The determination of the number of groups in a dataset, theircomposition and the most relevant measurements to be considered in clusteringthe data, is a high-demanding task, especially when the a priori information onthe dataset is limited. Three different genetic approaches are introduced in thispaper as tools for automatic data clustering and features selection. They differin the adopted codification of the grouping problem, not in the evolutionaryoperator and parameters. Two of them deals with the grouping problem in adeterministic framework. The first directly approaches the grouping problem asa combinatorial one. The second tries to determine some relevant points in thedata domain to be used in clustering data as group separators. A probabilisticframework is then introduced with the third one, which starts specifying thestatistical model from which data are assumed to be drawn. The evolutionaryapproaches are, finally, compared with respect to classical partitional clusteringalgorithms on simulated data and on Fisher’s Iris dataset used as a benchmark.

Evolutionary Approaches for Cluster Analysis / Minerva, Tommaso; Paterlini, Sandra. - In: SOFT COMPUTING. - ISSN 1432-7643. - STAMPA. - 8:(2003), pp. 165-176.

Evolutionary Approaches for Cluster Analysis

MINERVA, Tommaso;PATERLINI, Sandra
2003

Abstract

The determination of the number of groups in a dataset, theircomposition and the most relevant measurements to be considered in clusteringthe data, is a high-demanding task, especially when the a priori information onthe dataset is limited. Three different genetic approaches are introduced in thispaper as tools for automatic data clustering and features selection. They differin the adopted codification of the grouping problem, not in the evolutionaryoperator and parameters. Two of them deals with the grouping problem in adeterministic framework. The first directly approaches the grouping problem asa combinatorial one. The second tries to determine some relevant points in thedata domain to be used in clustering data as group separators. A probabilisticframework is then introduced with the third one, which starts specifying thestatistical model from which data are assumed to be drawn. The evolutionaryapproaches are, finally, compared with respect to classical partitional clusteringalgorithms on simulated data and on Fisher’s Iris dataset used as a benchmark.
2003
8
165
176
Minerva, Tommaso; Paterlini, Sandra
Evolutionary Approaches for Cluster Analysis / Minerva, Tommaso; Paterlini, Sandra. - In: SOFT COMPUTING. - ISSN 1432-7643. - STAMPA. - 8:(2003), pp. 165-176.
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/640897
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? 30
social impact