Three approaches to partitional clustering using genetic algorithms (GA) are compared with k-means and the EM algorithm for three real world datasets (Iris, Glass and Vowel). The GA techniques differ in their encoding of the clustering problem using either a class id for each object (GAIE), medoids to assign objects to the class associated with the nearest medoid (GAME), or parameters for multivariate distributions that describe each cluster (GAPE). For the simple Iris dataset, all algorithms except GAIE obtained results with comparable accuracy, but k-means and EM had more runs with inferior results compared to GAME and GAPE. For the more complex Glass dataset, the results for GAME and GAPE were superior compared to k-means, EM and GAIE regarding their accuracy and variance of the results for repeated runs. None of the algorithms was superior for the most complex dataset (Vowel). We conclude that GAs in clustering are a valuable alternative to k-means and EM, but that the choice of the problem representation is crucial.
Genetic Algorithms in Partitional Clustering: a comparison / Paterlini, Sandra; Minerva, Tommaso. - STAMPA. - (2010), pp. 28-36. (Intervento presentato al convegno Proc. of the 11th WSEAS Int. Conf. on Neural Networks, NN '10, Proceedings of the 11th WSEAS Int. Conf. on Evolutionary Computing, EC '10, Proc. of the 11th WSEAS Int. Conf. on Fuzzy Systems, FS '10 tenutosi a Iasi, rou nel 2010).
Genetic Algorithms in Partitional Clustering: a comparison
PATERLINI, Sandra;MINERVA, Tommaso
2010
Abstract
Three approaches to partitional clustering using genetic algorithms (GA) are compared with k-means and the EM algorithm for three real world datasets (Iris, Glass and Vowel). The GA techniques differ in their encoding of the clustering problem using either a class id for each object (GAIE), medoids to assign objects to the class associated with the nearest medoid (GAME), or parameters for multivariate distributions that describe each cluster (GAPE). For the simple Iris dataset, all algorithms except GAIE obtained results with comparable accuracy, but k-means and EM had more runs with inferior results compared to GAME and GAPE. For the more complex Glass dataset, the results for GAME and GAPE were superior compared to k-means, EM and GAIE regarding their accuracy and variance of the results for repeated runs. None of the algorithms was superior for the most complex dataset (Vowel). We conclude that GAs in clustering are a valuable alternative to k-means and EM, but that the choice of the problem representation is crucial.File | Dimensione | Formato | |
---|---|---|---|
genetic_clustering.pdf
Accesso riservato
Tipologia:
Versione dell'autore revisionata e accettata per la pubblicazione
Dimensione
1.17 MB
Formato
Adobe PDF
|
1.17 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris