Schema matching is the problem of finding relationships among concepts across heterogeneous data sources(heterogeneous in format and structure). Schema matching systems usually exploit lexical and semantic information provided by lexical databases/thesauri to discover intra/inter semanticrelationships among schema elements. However, most of them obtain poor performance on real scenarios due to the significant presence of “non-dictionary words” in real world schemata.Non-dictionary words include compound nouns, abbreviations and acronyms. In this paper, we present NORMS (NORmalizer of Schemata), a tool performing schema label normalization to increase the number of comparable labels extracted fromschemata.
NORMS: an automatic tool to perform schema label normalization / Bergamaschi, Sonia; Gawinecki, Maciej; Sorrentino, Serena. - ELETTRONICO. - (2011), pp. 1344-1347. (Intervento presentato al convegno 2011 IEEE 27th International Conference on Data Engineering, ICDE 2011 tenutosi a Hannover, deu nel 11-16 Aprile 2011) [10.1109/ICDE.2011.5767952].