|
ABSTRACT
Title |
: |
Combining K-Harmonic mean and Hierarchical algorithms for robust and efficient data clustering with cohesion selfmerging |
Authors |
: |
Jyothi Bankapalli, R.Venu Babu, S. Anjali Devi |
Keywords |
: |
Data mining, K-harmonic means clustering algorithm, Hierarchical clustering algorithm,
Cohesion Similarity measure. |
Issue Date |
: |
June 2011 |
Abstract |
: |
A cluster is a collection of data objects that are similar to one another within the cluster and are dissimilar to the objects in the other cluster. Data clustering is studied in statistical, machine learning and data mining. The dissimilarity between two clusters is defined as the distance between their centroids or the distance between two closest (or farthest) data points. The measures are vulnerable to outliers and removing the outliers is yet another difficult task. A new similarity measure, cohesion is used to measure the inter cluster distance. By using cohesion measure, a new two phase clustering algorithm is designed called as “Cohesion Based Self Merging Algorithm (CSM)”. This is a combination of partitional (KHarmonic mean) clustering and hierarchical clustering (CURE) algorithms. CSM partitions the input dataset into several small clusters in the first phase and then continuously merges the sub clusters based on similarity measures cohesion in a hierarchical manner in the second phase. Run time behaviors of these algorithms are analyzed and compared using the existing method combining k-mean and hierarchical algorithms for robust and efficient data clustering with cohesion-self merging algorithm. |
Page(s) |
: |
2544-2553 |
ISSN |
: |
0975–3397 |
Source |
: |
Vol. 3, Issue.6 |
|