Lexical annotation is the explicit inclusion of the “meaning" of a data source element according to a lexical resource. Accuracy of semi-automatic lexical annotator tools is poor on real-world schemata due to the abundance of non-dictionary compound nouns. It follows that a large set of relationships among different schemata is discovered, including a great amount of false positive relationships. In this paper we propose a new method for the annotation of non- dictionary compound nouns, which draws its inspiration from works in the natural languagedisambiguation area. The method extends the lexical annotation module of the MOMIS data integration system.
Semi-automatic compound nouns annotation for data integration systems / Bergamaschi, Sonia; Sorrentino, Serena. - STAMPA. - (2009), pp. 221-228. (Intervento presentato al convegno Sistemi evoluti per Basi di dati (SEBD 2009) tenutosi a Camogli, Genova, Italy nel 21-24 Giungno 2009).
Semi-automatic compound nouns annotation for data integration systems
BERGAMASCHI, Sonia;SORRENTINO, Serena
2009
Abstract
Lexical annotation is the explicit inclusion of the “meaning" of a data source element according to a lexical resource. Accuracy of semi-automatic lexical annotator tools is poor on real-world schemata due to the abundance of non-dictionary compound nouns. It follows that a large set of relationships among different schemata is discovered, including a great amount of false positive relationships. In this paper we propose a new method for the annotation of non- dictionary compound nouns, which draws its inspiration from works in the natural languagedisambiguation area. The method extends the lexical annotation module of the MOMIS data integration system.Pubblicazioni consigliate
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris