Vis enkel innførsel

dc.contributor.authorMøllersen, Kajsa
dc.contributor.authorDhar, Subhra
dc.contributor.authorGodtliebsen, Fred
dc.date.accessioned2017-03-17T13:35:26Z
dc.date.available2017-03-17T13:35:26Z
dc.date.issued2016-09-12
dc.description.abstractHybrid clustering combines partitional and hierarchical clustering for computational effectiveness and versatility in cluster shape. In such clustering, a dissimilarity measure plays a crucial role in the hierarchical merging. The dissimilarity measure has great impact on the final clustering, and data-independent properties are needed to choose the right dissimilarity measure for the problem at hand. Properties for distance- based dissimilarity measures have been studied for decades, but properties for density-based dissimilarity measures have so far received little attention. Here, we propose six data-independent properties to evaluate density-based dissimilarity measures associated with hybrid clustering, regarding equality, orthogonality, symmetry, outlier and noise observations, and light-tailed models for heavy-tailed clusters. The significance of the properties is investigated, and we study some well-known dissimilarity measures based on Shannon entropy, misclassification rate, Bhattacharyya distance and Kullback-Leibler divergence with respect to the proposed properties. As none of them satisfy all the proposed properties, we introduce a new dissimilarity measure based on the Kullback-Leibler information and show that it satisfies all proposed properties. The effect of the proposed properties is also illustrated on several real and simulated data sets.en_US
dc.descriptionSource: <a href=http://dx.doi.org/10.4236/am.2016.715143>doi: 10.4236/am.2016.715143</a>en_US
dc.identifier.citationMøllersen, K., Dhar, S.S. and Godtliebsen, F. (2016) On Data-Independent Properties for Density-Based Dissimilarity Measures in Hybrid Clustering. Applied Mathematics , 7, 1674-1706. http://dx.doi.org/10.4236/am.2016.715143en_US
dc.identifier.cristinIDFRIDAID 1438272
dc.identifier.doi10.4236/am.2016.715143
dc.identifier.issn2152-7385
dc.identifier.issn2152-7393
dc.identifier.urihttps://hdl.handle.net/10037/10769
dc.language.isoengen_US
dc.publisherScientific Research Publishingen_US
dc.relation.journalApplied Mathematics
dc.rights.accessRightsopenAccessen_US
dc.subjectVDP::Matematikk og Naturvitenskap: 400::Matematikk: 410en_US
dc.subjectVDP::Mathematics and natural science: 400::Mathematics: 410en_US
dc.titleOn Data-Independent Properties for Density-Based Dissimilarity Measures in Hybrid Clusteringen_US
dc.typeJournal articleen_US
dc.typeTidsskriftartikkelen_US
dc.typePeer revieweden_US


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel