Developing intelligent tools for the integration of informationextracted from multiple heterogeneous sources is a challenging issue to effectively exploit the numerous sources available on-line in global information systems. In this paper, we propose intelligent, tool-supported techniques to information extraction and integration which take into account both structured and semistructured data sources. An object-oriented language called odli3, derived from the standard ODMG, with an underlying Description Logics, is introduced for information extraction. Odli3 descriptions of the information sources are exploited first to set a shared vocabulary for the sources.Information integration is performed in a semi-automatic way, by exploiting odli3 descriptions of source schemas with a combination of Description Logics and clustering techniques. Techniques described in the paper have been implemented in theMOMIS system, based on a conventional mediator architecture.
Intelligent Techniques for the Extraction and Integration of Heterogeneous Information / Bergamaschi, Sonia; S., Castano; Vincini, Maurizio; Beneventano, Domenico. - STAMPA. - (1999), pp. 109-129. (Intervento presentato al convegno IJCAI 1999 Workshop: Intelligent Information Integration tenutosi a Stockholm nel July 1999).