Multivariate exploratory data analysis allows revealing patterns and extracting information from complex multivariate data sets. However, highly complex data may not show evident groupings or trends in the principal component space, e.g. because the variation of the variables are not grouped but rather continuous. In these cases, classical exploratory methods may not provide satisfactory results when the aim is to find distinct groupings in the data. To enhance information extraction in such situations, we propose a novel approach inspired by the concept of combining weak classifiers, but in the unsupervised context. The approach is based on the fusion of several adjacency matrices obtained by different distance measures on data from different analytical platforms. This paper is intended to present and discuss the potential of the approach through a benchmark data set of beer samples. The beer data were acquired using three spectroscopic techniques: Visible, near-Infrared and Nuclear Magnetic Resonance. The results of fusing the three data sets via the proposed approach are compared with those from the single data blocks (Visible, NIR and NMR) and from a standard mid-level data fusion methodology. It is shown that, with the suggested approach, groupings related to beer style and other features are efficiently recovered, and generally more evident.

Fused Adjacency Matrices to enhance information extraction: the beer benchmark / Cavallini, Nicola; Savorani, Francesco; Bro, Rasmus; Cocchi, Marina. - In: ANALYTICA CHIMICA ACTA. - ISSN 0003-2670. - 1061:1061(2019), pp. 70-83. [10.1016/j.aca.2019.02.023]

Fused Adjacency Matrices to enhance information extraction: the beer benchmark

Cavallini, Nicola;Savorani, Francesco;Bro, Rasmus;Cocchi, Marina
2019

Abstract

Multivariate exploratory data analysis allows revealing patterns and extracting information from complex multivariate data sets. However, highly complex data may not show evident groupings or trends in the principal component space, e.g. because the variation of the variables are not grouped but rather continuous. In these cases, classical exploratory methods may not provide satisfactory results when the aim is to find distinct groupings in the data. To enhance information extraction in such situations, we propose a novel approach inspired by the concept of combining weak classifiers, but in the unsupervised context. The approach is based on the fusion of several adjacency matrices obtained by different distance measures on data from different analytical platforms. This paper is intended to present and discuss the potential of the approach through a benchmark data set of beer samples. The beer data were acquired using three spectroscopic techniques: Visible, near-Infrared and Nuclear Magnetic Resonance. The results of fusing the three data sets via the proposed approach are compared with those from the single data blocks (Visible, NIR and NMR) and from a standard mid-level data fusion methodology. It is shown that, with the suggested approach, groupings related to beer style and other features are efficiently recovered, and generally more evident.
2019
19-feb-2019
1061
1061
70
83
Fused Adjacency Matrices to enhance information extraction: the beer benchmark / Cavallini, Nicola; Savorani, Francesco; Bro, Rasmus; Cocchi, Marina. - In: ANALYTICA CHIMICA ACTA. - ISSN 0003-2670. - 1061:1061(2019), pp. 70-83. [10.1016/j.aca.2019.02.023]
Cavallini, Nicola; Savorani, Francesco; Bro, Rasmus; Cocchi, Marina
File in questo prodotto:
File Dimensione Formato  
1-s2.0-S0003267019301977-main.pdf

Open access

Descrizione: FusedAdjACA2019
Tipologia: Versione dell'autore revisionata e accettata per la pubblicazione
Dimensione 16.61 MB
Formato Adobe PDF
16.61 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1171572
Citazioni
  • ???jsp.display-item.citation.pmc??? 2
  • Scopus 9
  • ???jsp.display-item.citation.isi??? 7
social impact