We introduce new similarity measures between two subjects, withreference to variables with multiple categories. In contrast totraditionally used similarity indices, they also consider thefrequency of the categories of each attribute in the sample. Thisfeature is useful when dealing with rare categories, since itmakes sense to differently evaluate the pairwise presence of arare category from the pairwise presence of a widespread one. Aweighting criterion for each category derived from Shannon'sinformation theory is suggested. There are two versions of theweighted index: one for independent categorical variables and onefor dependent variables. The suitability of the proposed indicesis shown in this paper using both simulated and real world datasets
A new class of weighted similarity indices using polytomous variables / Morlini, Isabella; S., Zani. - In: JOURNAL OF CLASSIFICATION. - ISSN 0176-4268. - STAMPA. - 29:2(2012), pp. 199-226. [10.1007/s00357-012-9107-2]
A new class of weighted similarity indices using polytomous variables
MORLINI, Isabella;
2012
Abstract
We introduce new similarity measures between two subjects, withreference to variables with multiple categories. In contrast totraditionally used similarity indices, they also consider thefrequency of the categories of each attribute in the sample. Thisfeature is useful when dealing with rare categories, since itmakes sense to differently evaluate the pairwise presence of arare category from the pairwise presence of a widespread one. Aweighting criterion for each category derived from Shannon'sinformation theory is suggested. There are two versions of theweighted index: one for independent categorical variables and onefor dependent variables. The suitability of the proposed indicesis shown in this paper using both simulated and real world datasetsFile | Dimensione | Formato | |
---|---|---|---|
2012 JoC.pdf
Accesso riservato
Tipologia:
Versione originale dell'autore proposta per la pubblicazione
Dimensione
251.58 kB
Formato
Adobe PDF
|
251.58 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris