In cluster analysis, the inclusion of unnecessaryvariables may mask the true group structure. For the selection ofthe best subset of variables, we suggest the use of two overallindices. The first index is a distance between two hierarchicalclusterings and the second one is a similarity index obtained asthe complement to one of the previous distance. Both criteria canbe used for measuring the similarity between clusterings obtainedwith different subsets of variables. An application with a realdata set regarding the economic welfare of the Italian Regionsshows the benefits gained with the suggested procedure.

Variable selection in cluster analysis: an approach based on a new index / Morlini, Isabella; Zani, S.. - STAMPA. - (2013), pp. 71-79. (Intervento presentato al convegno Joint Meetings on Classification and Data Analysis Group of the Italian Statistical Society, CLADAG 2010 tenutosi a Firenze, ita) [10.1007/978-3-642-28894-4_9].

Variable selection in cluster analysis: an approach based on a new index

MORLINI, Isabella;
2013

Abstract

In cluster analysis, the inclusion of unnecessaryvariables may mask the true group structure. For the selection ofthe best subset of variables, we suggest the use of two overallindices. The first index is a distance between two hierarchicalclusterings and the second one is a similarity index obtained asthe complement to one of the previous distance. Both criteria canbe used for measuring the similarity between clusterings obtainedwith different subsets of variables. An application with a realdata set regarding the economic welfare of the Italian Regionsshows the benefits gained with the suggested procedure.
2013
Joint Meetings on Classification and Data Analysis Group of the Italian Statistical Society, CLADAG 2010
Firenze, ita
71
79
Morlini, Isabella; Zani, S.
Variable selection in cluster analysis: an approach based on a new index / Morlini, Isabella; Zani, S.. - STAMPA. - (2013), pp. 71-79. (Intervento presentato al convegno Joint Meetings on Classification and Data Analysis Group of the Italian Statistical Society, CLADAG 2010 tenutosi a Firenze, ita) [10.1007/978-3-642-28894-4_9].
File in questo prodotto:
File Dimensione Formato  
2013 Springer Morlini Zani.pdf

Open access

Tipologia: Versione pubblicata dall'editore
Dimensione 195.04 kB
Formato Adobe PDF
195.04 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/709013
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? ND
social impact