The problem of identifying the manifold generated copies of an object is known as Object Identification (OI). This problem concerns the quality of the data. Subsequently, the quality of the ob- ject (data) could be restored through the identification of the corrupted copies.In literature the solutions are mainly oriented to discover pairs of du- plicates (pairs-oriented OI) rather than sets of similar objects (group- oriented OI). We proposed a new technique to resolve the OI problem among many sources in a quasi-decentralized manner. The new technique is based on the concept of constraints and is composed by two phases: extraction phase and grouping. First we extract constraints by analyz- ing data at hand (the decentralized phase). Then, we reason about those to find the groups of similar objects (the centralized phase). We have conducted several tests that show the effectiveness of our proposal.

Object Identification across Multiple Sources / Beneventano, Domenico; Matteo Di, Gioia; Monica, Scannapieco. - STAMPA. - (2010), pp. 414-425. (Intervento presentato al convegno Eighteenth Italian Symposium on Advanced Database Systems, SEBD 2010 tenutosi a Rimini, Italy nel June 20-23 - 2010).

Object Identification across Multiple Sources

BENEVENTANO, Domenico;
2010

Abstract

The problem of identifying the manifold generated copies of an object is known as Object Identification (OI). This problem concerns the quality of the data. Subsequently, the quality of the ob- ject (data) could be restored through the identification of the corrupted copies.In literature the solutions are mainly oriented to discover pairs of du- plicates (pairs-oriented OI) rather than sets of similar objects (group- oriented OI). We proposed a new technique to resolve the OI problem among many sources in a quasi-decentralized manner. The new technique is based on the concept of constraints and is composed by two phases: extraction phase and grouping. First we extract constraints by analyz- ing data at hand (the decentralized phase). Then, we reason about those to find the groups of similar objects (the centralized phase). We have conducted several tests that show the effectiveness of our proposal.
2010
Eighteenth Italian Symposium on Advanced Database Systems, SEBD 2010
Rimini, Italy
June 20-23 - 2010
414
425
Beneventano, Domenico; Matteo Di, Gioia; Monica, Scannapieco
Object Identification across Multiple Sources / Beneventano, Domenico; Matteo Di, Gioia; Monica, Scannapieco. - STAMPA. - (2010), pp. 414-425. (Intervento presentato al convegno Eighteenth Italian Symposium on Advanced Database Systems, SEBD 2010 tenutosi a Rimini, Italy nel June 20-23 - 2010).
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/743564
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
social impact