In this chapter we introduce readers to the various aspects of cluster analysis performed on textual data in a mining framework. We first provide a brief overview on the techniques and the background notions on general clustering. Then, we focus on the importance and on the goals of clustering in a text mining scenario, analyzing and describing the issues which are specific to this particular field. Effective information extraction from highly dimensional textual data, clustering algorithms specifically designed to efficiently work on very large unstructured and, possibly, hyperlinked data sets, and comprehension of the clustering output are among the covered topics.

Text Clustering as a Mining Task / Mandreoli, Federica; Martoglia, Riccardo; Tiberio, Paolo. - STAMPA. - (2005), pp. 75-104.

Text Clustering as a Mining Task

MANDREOLI, Federica;MARTOGLIA, Riccardo;TIBERIO, Paolo
2005

Abstract

In this chapter we introduce readers to the various aspects of cluster analysis performed on textual data in a mining framework. We first provide a brief overview on the techniques and the background notions on general clustering. Then, we focus on the importance and on the goals of clustering in a text mining scenario, analyzing and describing the issues which are specific to this particular field. Effective information extraction from highly dimensional textual data, clustering algorithms specifically designed to efficiently work on very large unstructured and, possibly, hyperlinked data sets, and comprehension of the clustering output are among the covered topics.
2005
Text Mining and its Applications to Intelligence, CRM and Knowledge Management
185312995X
WIT Press
STATI UNITI D'AMERICA
Text Clustering as a Mining Task / Mandreoli, Federica; Martoglia, Riccardo; Tiberio, Paolo. - STAMPA. - (2005), pp. 75-104.
Mandreoli, Federica; Martoglia, Riccardo; Tiberio, Paolo
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/308399
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact