We propose a novel Named Entity Recognition (NER) system based on a machine learning technique and a semantic network. The NER system is able to exploit the advantages of semantic information, coming from Expert System proprietary technology, Cogito. NER is a task of Natural Language Processing (NLP) which consists in detecting, from an unformatted text source and classify, Named Entities (NE), i.e. real-world entities that can be denoted with a rigid designator. To address this problem, the chosen approach is a combination of machine learning and deep semantic processing. The machine learning method used is Conditional Random Fields (CRF). CRF is particularly suitable for the task because it analyzes an input sequence considering the whole sequence, instead of one item at a time. CRF has been trained not only with classical information, available after a simple computation or anyway with little effort, but with semantic information too. Semantic information is obtained with Sensigrafo and Semantic Disambiguator, which are the proprietary semantic network and semantic engine of Expert System, respectively. The results are encouraging, as we can experimentally prove the improvements in the NER task obtained by exploiting semantics.

Effects of Semantic Analysis on Named-Entity Recognition with Conditional Random Fields / Bergamaschi, S.; Cappelli, A.; Circiello, A.; Varone, M.. - 2037:(2017). (Intervento presentato al convegno 25th Italian Symposium on Advanced Database Systems, SEBD 2017 tenutosi a Sun Beach Clubesse, ita nel 2017).

Effects of Semantic Analysis on Named-Entity Recognition with Conditional Random Fields

Bergamaschi S.;Cappelli A.;Circiello A.;
2017

Abstract

We propose a novel Named Entity Recognition (NER) system based on a machine learning technique and a semantic network. The NER system is able to exploit the advantages of semantic information, coming from Expert System proprietary technology, Cogito. NER is a task of Natural Language Processing (NLP) which consists in detecting, from an unformatted text source and classify, Named Entities (NE), i.e. real-world entities that can be denoted with a rigid designator. To address this problem, the chosen approach is a combination of machine learning and deep semantic processing. The machine learning method used is Conditional Random Fields (CRF). CRF is particularly suitable for the task because it analyzes an input sequence considering the whole sequence, instead of one item at a time. CRF has been trained not only with classical information, available after a simple computation or anyway with little effort, but with semantic information too. Semantic information is obtained with Sensigrafo and Semantic Disambiguator, which are the proprietary semantic network and semantic engine of Expert System, respectively. The results are encouraging, as we can experimentally prove the improvements in the NER task obtained by exploiting semantics.
2017
25th Italian Symposium on Advanced Database Systems, SEBD 2017
Sun Beach Clubesse, ita
2017
2037
Bergamaschi, S.; Cappelli, A.; Circiello, A.; Varone, M.
Effects of Semantic Analysis on Named-Entity Recognition with Conditional Random Fields / Bergamaschi, S.; Cappelli, A.; Circiello, A.; Varone, M.. - 2037:(2017). (Intervento presentato al convegno 25th Italian Symposium on Advanced Database Systems, SEBD 2017 tenutosi a Sun Beach Clubesse, ita nel 2017).
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1248571
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact