We propose a novel knowledge-based technique for inter-document similarity, called Context Semantic Analysis (CSA). Several specialized approaches built on top of specific knowledge base (e.g. Wikipedia) exist in literature but CSA differs from them because it is designed to be portable to any RDF knowledge base. Our technique relies on a generic RDF knowledge base (e.g. DBpedia and Wikidata) to extract from it a vector able to represent the context of a document. We show how such a Semantic Context Vector can be effectively exploited to compute inter-document similarity. Experimental results show that our general technique outperforms baselines built on top of traditional methods, and achieves a performance similar to the ones of specialized methods.

Context Semantic Analysis: A Knowledge-Based Technique for Computing Inter-document Similarity / Bergamaschi, Sonia; Beneventano, Domenico; Benedetti, Fabio. - 9939(2016), pp. 164-178. ((Intervento presentato al convegno 9th International Conference on Similarity Search and Applications (SISAP) tenutosi a Tokyo, Japan nel October 24-26, 2016.

Context Semantic Analysis: A Knowledge-Based Technique for Computing Inter-document Similarity

BERGAMASCHI, Sonia;BENEVENTANO, Domenico;BENEDETTI, FABIO
2016

Abstract

We propose a novel knowledge-based technique for inter-document similarity, called Context Semantic Analysis (CSA). Several specialized approaches built on top of specific knowledge base (e.g. Wikipedia) exist in literature but CSA differs from them because it is designed to be portable to any RDF knowledge base. Our technique relies on a generic RDF knowledge base (e.g. DBpedia and Wikidata) to extract from it a vector able to represent the context of a document. We show how such a Semantic Context Vector can be effectively exploited to compute inter-document similarity. Experimental results show that our general technique outperforms baselines built on top of traditional methods, and achieves a performance similar to the ones of specialized methods.
27-set-2016
9th International Conference on Similarity Search and Applications (SISAP)
Tokyo, Japan
October 24-26, 2016
9939
164
178
Bergamaschi, Sonia; Beneventano, Domenico; Benedetti, Fabio
Context Semantic Analysis: A Knowledge-Based Technique for Computing Inter-document Similarity / Bergamaschi, Sonia; Beneventano, Domenico; Benedetti, Fabio. - 9939(2016), pp. 164-178. ((Intervento presentato al convegno 9th International Conference on Similarity Search and Applications (SISAP) tenutosi a Tokyo, Japan nel October 24-26, 2016.
File in questo prodotto:
File Dimensione Formato  
benedetti2016(1).pdf

non disponibili

Tipologia: Versione dell'editore (versione pubblicata)
Dimensione 1.3 MB
Formato Adobe PDF
1.3 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

Caricamento pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: http://hdl.handle.net/11380/1133903
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 8
  • ???jsp.display-item.citation.isi??? 4
social impact