In cluster analysis, the inclusion of unnecessaryvariables may mask the true group structure. For the selection ofthe best subset of variables, we suggest the use of two overallindices. The first index is a distance between two hierarchicalclusterings and the second one is a similarity index obtained asthe complement to one of the previous distance. Both criteria canbe used for measuring the similarity between clusterings obtainedwith different subsets of variables. An application with a realdata set regarding the economic welfare of the Italian Regionsshows the benefits gained with the suggested procedure.
Variable selection in cluster analysis: an approach based on a new index / Morlini, Isabella; Zani, S.. - STAMPA. - (2013), pp. 71-79. (Intervento presentato al convegno Joint Meetings on Classification and Data Analysis Group of the Italian Statistical Society, CLADAG 2010 tenutosi a Firenze, ita) [10.1007/978-3-642-28894-4_9].
Variable selection in cluster analysis: an approach based on a new index
MORLINI, Isabella;
2013
Abstract
In cluster analysis, the inclusion of unnecessaryvariables may mask the true group structure. For the selection ofthe best subset of variables, we suggest the use of two overallindices. The first index is a distance between two hierarchicalclusterings and the second one is a similarity index obtained asthe complement to one of the previous distance. Both criteria canbe used for measuring the similarity between clusterings obtainedwith different subsets of variables. An application with a realdata set regarding the economic welfare of the Italian Regionsshows the benefits gained with the suggested procedure.File | Dimensione | Formato | |
---|---|---|---|
2013 Springer Morlini Zani.pdf
Open access
Tipologia:
Versione pubblicata dall'editore
Dimensione
195.04 kB
Formato
Adobe PDF
|
195.04 kB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris