We propose a novel knowledge-based technique for inter-document similarity, called Context Semantic Analysis (CSA). Several specialized approaches built on top of specific knowledge base (e.g. Wikipedia) exist in literature but CSA differs from them because it is designed to be portable to any RDF knowledge base. Our technique relies on a generic RDF knowledge base (e.g. DBpedia and Wikidata) to extract from it a vector able to represent the context of a document. We show how such a Semantic Context Vector can be effectively exploited to compute inter-document similarity. Experimental results show that our general technique outperforms baselines built on top of traditional methods, and achieves a performance similar to the ones of specialized methods.
Context Semantic Analysis: A Knowledge-Based Technique for Computing Inter-document Similarity / Bergamaschi, Sonia; Beneventano, Domenico; Benedetti, Fabio. - 9939:(2016), pp. 164-178. (Intervento presentato al convegno 9th International Conference on Similarity Search and Applications, SISAP 2016 tenutosi a Tokyo, Japan nel October 24-26, 2016) [10.1007/978-3-319-46759-7_13].
Context Semantic Analysis: A Knowledge-Based Technique for Computing Inter-document Similarity
BERGAMASCHI, Sonia;BENEVENTANO, Domenico;BENEDETTI, FABIO
2016
Abstract
We propose a novel knowledge-based technique for inter-document similarity, called Context Semantic Analysis (CSA). Several specialized approaches built on top of specific knowledge base (e.g. Wikipedia) exist in literature but CSA differs from them because it is designed to be portable to any RDF knowledge base. Our technique relies on a generic RDF knowledge base (e.g. DBpedia and Wikidata) to extract from it a vector able to represent the context of a document. We show how such a Semantic Context Vector can be effectively exploited to compute inter-document similarity. Experimental results show that our general technique outperforms baselines built on top of traditional methods, and achieves a performance similar to the ones of specialized methods.File | Dimensione | Formato | |
---|---|---|---|
benedetti2016(1).pdf
Accesso riservato
Tipologia:
Versione pubblicata dall'editore
Dimensione
1.3 MB
Formato
Adobe PDF
|
1.3 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris