In [12] a novel approach to Web search result clustering based on Word Sense Induction, i.e. the automatic discovery of word senses from raw text was presented; key to the proposed approach is the idea of, first, automatically in- ducing senses for the target query and, second, clustering the search results based on their semantic similarity to the word senses induced. In [1] we proposed an innovative Word Sense Induction method based on multilingual data; key to our approach was the idea that a multilingual context representation, where the context of the words is expanded by considering its translations in different languages, may im- prove the WSI results; the experiments showed a clear per- formance gain. In this paper we give some preliminary ideas to exploit our multilingual Word Sense Induction method to Web search result clustering.
Multilingual Word Sense Induction to Improve Web Search Result Clustering / Albano, Lorenzo; Beneventano, Domenico; Bergamaschi, Sonia. - (2015), pp. 835-839. (Intervento presentato al convegno 24th International Conference on World Wide Web tenutosi a Firenze nel 18-22 May 2015) [10.1145/2740908.2743009].
Multilingual Word Sense Induction to Improve Web Search Result Clustering
ALBANO, LORENZO;BENEVENTANO, Domenico;BERGAMASCHI, Sonia
2015
Abstract
In [12] a novel approach to Web search result clustering based on Word Sense Induction, i.e. the automatic discovery of word senses from raw text was presented; key to the proposed approach is the idea of, first, automatically in- ducing senses for the target query and, second, clustering the search results based on their semantic similarity to the word senses induced. In [1] we proposed an innovative Word Sense Induction method based on multilingual data; key to our approach was the idea that a multilingual context representation, where the context of the words is expanded by considering its translations in different languages, may im- prove the WSI results; the experiments showed a clear per- formance gain. In this paper we give some preliminary ideas to exploit our multilingual Word Sense Induction method to Web search result clustering.File | Dimensione | Formato | |
---|---|---|---|
p835.pdf
Accesso riservato
Tipologia:
Versione pubblicata dall'editore
Dimensione
485.11 kB
Formato
Adobe PDF
|
485.11 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris