Sharing, discovering, and integrating data is a crucial task and poses many challenging spots and open research direction. Data owners need to know what data consumers want and data consumers need to find datasets that are satisfactory for their tasks. Several data market platforms, or data marketplaces (DMs), have been used so far to facilitate data transactions between data owners and customers. However, current DMs are mostly shop windows, where customers have to rely on metadata that owners manually curate to discover useful datasets and there is no automated mechanism for owners to determine if their data could be merged with other datasets to satisfy customers’ desiderata. The availability of novel artificial intelligence techniques for data management has sparked a renewed interest in proposing new DMs that stray from this conventional paradigm and overcome its limitations. This paper envisions a conceptual framework called DataStreet where DMs can create personalized datasets by combining available datasets and presenting summarized statistics to help users make informed decisions. In our framework, owners share some of their data with a trusted DM, and customers provide a dataset template to fuel content-based (rather than metadata-based) search queries. Upon each query, the DM creates a preview of the personalized dataset through a flexible use of dataset discovery, integration, and value measurement, while ensuring owners’ fair treatment and preserving privacy. The previewed datasets might not be pre-defined in the DM and are finally materialized upon successful transaction.

Bridging the Gap between Buyers and Sellers in Data Marketplaces with Personalized Datasets / Firmani, Donatella; Mathew, Jerin George; Santoro, Donatello; Simonini, Giovanni; Zecchini, Luca. - 3478:(2023), pp. 525-534. (Intervento presentato al convegno 31st Italian Symposium on Advanced Database Systems (SEBD 2023) tenutosi a Galzignano Terme (Padova), Italy nel July 2-5, 2023).

Bridging the Gap between Buyers and Sellers in Data Marketplaces with Personalized Datasets

Simonini, Giovanni;Zecchini, Luca
2023

Abstract

Sharing, discovering, and integrating data is a crucial task and poses many challenging spots and open research direction. Data owners need to know what data consumers want and data consumers need to find datasets that are satisfactory for their tasks. Several data market platforms, or data marketplaces (DMs), have been used so far to facilitate data transactions between data owners and customers. However, current DMs are mostly shop windows, where customers have to rely on metadata that owners manually curate to discover useful datasets and there is no automated mechanism for owners to determine if their data could be merged with other datasets to satisfy customers’ desiderata. The availability of novel artificial intelligence techniques for data management has sparked a renewed interest in proposing new DMs that stray from this conventional paradigm and overcome its limitations. This paper envisions a conceptual framework called DataStreet where DMs can create personalized datasets by combining available datasets and presenting summarized statistics to help users make informed decisions. In our framework, owners share some of their data with a trusted DM, and customers provide a dataset template to fuel content-based (rather than metadata-based) search queries. Upon each query, the DM creates a preview of the personalized dataset through a flexible use of dataset discovery, integration, and value measurement, while ensuring owners’ fair treatment and preserving privacy. The previewed datasets might not be pre-defined in the DM and are finally materialized upon successful transaction.
2023
5-lug-2023
31st Italian Symposium on Advanced Database Systems (SEBD 2023)
Galzignano Terme (Padova), Italy
July 2-5, 2023
3478
525
534
Firmani, Donatella; Mathew, Jerin George; Santoro, Donatello; Simonini, Giovanni; Zecchini, Luca
Bridging the Gap between Buyers and Sellers in Data Marketplaces with Personalized Datasets / Firmani, Donatella; Mathew, Jerin George; Santoro, Donatello; Simonini, Giovanni; Zecchini, Luca. - 3478:(2023), pp. 525-534. (Intervento presentato al convegno 31st Italian Symposium on Advanced Database Systems (SEBD 2023) tenutosi a Galzignano Terme (Padova), Italy nel July 2-5, 2023).
File in questo prodotto:
File Dimensione Formato  
paper01.pdf

Open access

Tipologia: Versione pubblicata dall'editore
Dimensione 1.35 MB
Formato Adobe PDF
1.35 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1317067
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact