We propose a novel knowledge-based technique for inter-document similarity, called Context Semantic Analysis (CSA). Several specialized approaches built on top of specific knowledge base (e.g. Wikipedia) exist in literature but CSA differs from them because it is designed to be portable to any RDF knowledge base. Our technique relies on a generic RDF knowledge base (e.g. DBpedia and Wikidata) to extract from it a vector able to represent the context of a document. We show how such a Semantic Context Vector can be effectively exploited to compute inter-document similarity. Experimental results show that our general technique outperforms baselines built on top of traditional methods, and achieves a performance similar to the ones of specialized methods.

Context Semantic Analysis: A Knowledge-Based Technique for Computing Inter-document Similarity / Bergamaschi, Sonia; Beneventano, Domenico; Benedetti, Fabio. - 9939:(2016), pp. 164-178. (Intervento presentato al convegno 9th International Conference on Similarity Search and Applications, SISAP 2016 tenutosi a Tokyo, Japan nel October 24-26, 2016) [10.1007/978-3-319-46759-7_13].

Context Semantic Analysis: A Knowledge-Based Technique for Computing Inter-document Similarity

BERGAMASCHI, Sonia;BENEVENTANO, Domenico;BENEDETTI, FABIO
2016

Abstract

We propose a novel knowledge-based technique for inter-document similarity, called Context Semantic Analysis (CSA). Several specialized approaches built on top of specific knowledge base (e.g. Wikipedia) exist in literature but CSA differs from them because it is designed to be portable to any RDF knowledge base. Our technique relies on a generic RDF knowledge base (e.g. DBpedia and Wikidata) to extract from it a vector able to represent the context of a document. We show how such a Semantic Context Vector can be effectively exploited to compute inter-document similarity. Experimental results show that our general technique outperforms baselines built on top of traditional methods, and achieves a performance similar to the ones of specialized methods.
2016
27-set-2016
9th International Conference on Similarity Search and Applications, SISAP 2016
Tokyo, Japan
October 24-26, 2016
9939
164
178
Bergamaschi, Sonia; Beneventano, Domenico; Benedetti, Fabio
Context Semantic Analysis: A Knowledge-Based Technique for Computing Inter-document Similarity / Bergamaschi, Sonia; Beneventano, Domenico; Benedetti, Fabio. - 9939:(2016), pp. 164-178. (Intervento presentato al convegno 9th International Conference on Similarity Search and Applications, SISAP 2016 tenutosi a Tokyo, Japan nel October 24-26, 2016) [10.1007/978-3-319-46759-7_13].
File in questo prodotto:
File Dimensione Formato  
benedetti2016(1).pdf

Accesso riservato

Tipologia: Versione pubblicata dall'editore
Dimensione 1.3 MB
Formato Adobe PDF
1.3 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1133903
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 9
  • ???jsp.display-item.citation.isi??? 4
social impact