With the modern information technologies, data availability is increasing at formidable speed giving raise to the Big Data challenge (Bergamaschi, 2014). As a matter of fact, Big Data analysis now drives every aspect of modern so- ciety, such as: manufacturing, retail, financial services, etc., (Labrinidis & Jagadish, 2012). In this scenario, we need to rethink advanced and efficient human-computer-interaction to be able to handling huge amount of data. In fact, one of the most valuable means to make sense of Big Data, to most peo- ple, is data visualization. As a matter of fact, data visualization may guide decision-making and become a powerful tool to convey information in all data analysis tasks. However, to be actually actionable, data visualization tools should allow the right amount of interactivity and to be easy to use, under- standable, meaningful, and approachable. In this article, we present a new approach to visualize and explore a huge amount of data. In particular, the novelty of our approach is to enhance the faceted browsing search in Apache Solr∗ (a widely used enterprise search platform) by exploiting Bayesian networks, supporting the user in the explo- ration of the data. We show how the proposed Bayesian suggestion algorithm (Cooper & Herskovits, 1991) be a key ingredient in a Big Data scenario, where a query can generate too many results that the user cannot handle. Our pro- posed solution aim to select best results, which together with the result-path, chosen by the user by means of multi-faceted querying and faceted navigation, can be a valuable support for both Big Data exploration and visualization. In the fellowing, we introduce the faceted browsing technique, then we de- scribe how it can be enhanced exploiting Bayesian networks.
Enhancing big data exploration with faceted browsing / Bergamaschi, Sonia; Zhu, Song; Simonini, Giovanni. - (2018), pp. 13-21. (Intervento presentato al convegno 10th Scientific Meeting of the Classification and Data Analysis Group of the Italian Statistical Society, CLADAG 2015 tenutosi a Cagliari, ITALY nel OCT 10-12, 2015) [10.1007/978-3-319-55708-3_2].
Enhancing big data exploration with faceted browsing
Sonia Bergamaschi;Song Zhu;Giovanni Simonini
2018
Abstract
With the modern information technologies, data availability is increasing at formidable speed giving raise to the Big Data challenge (Bergamaschi, 2014). As a matter of fact, Big Data analysis now drives every aspect of modern so- ciety, such as: manufacturing, retail, financial services, etc., (Labrinidis & Jagadish, 2012). In this scenario, we need to rethink advanced and efficient human-computer-interaction to be able to handling huge amount of data. In fact, one of the most valuable means to make sense of Big Data, to most peo- ple, is data visualization. As a matter of fact, data visualization may guide decision-making and become a powerful tool to convey information in all data analysis tasks. However, to be actually actionable, data visualization tools should allow the right amount of interactivity and to be easy to use, under- standable, meaningful, and approachable. In this article, we present a new approach to visualize and explore a huge amount of data. In particular, the novelty of our approach is to enhance the faceted browsing search in Apache Solr∗ (a widely used enterprise search platform) by exploiting Bayesian networks, supporting the user in the explo- ration of the data. We show how the proposed Bayesian suggestion algorithm (Cooper & Herskovits, 1991) be a key ingredient in a Big Data scenario, where a query can generate too many results that the user cannot handle. Our pro- posed solution aim to select best results, which together with the result-path, chosen by the user by means of multi-faceted querying and faceted navigation, can be a valuable support for both Big Data exploration and visualization. In the fellowing, we introduce the faceted browsing technique, then we de- scribe how it can be enhanced exploiting Bayesian networks.File | Dimensione | Formato | |
---|---|---|---|
ENHANCING BIG DATA EXPLORATION WITH FACETED BROWSING.pdf
Open access
Tipologia:
Versione dell'autore revisionata e accettata per la pubblicazione
Dimensione
176.51 kB
Formato
Adobe PDF
|
176.51 kB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris