Optical chemical structure recognition is the problem of converting a bitmap image containing a chemical structure formula into a standard structured representation of the molecule. We introduce a novel approach to this problem based on the pipelined integration of pattern recognition techniques with probabilistic knowledge representation and reasoning. Basic entities and relations (such as textual elements, points, lines, etc.) are first extracted by a low-level processing module. A probabilistic reasoning engine based on Markov logic, embodying chemical and graphical knowledge, is subsequently used to refine these pieces of information. An annotated connection table of atoms and bonds is finally assembled and converted into a standard chemical exchange format. We report a successful evaluation on two large image data sets, showing that the method compares favorably with the current state-of-the-art, especially on degraded low-resolution images. The system is available as a web server at http://mlocsr.dinfo.unifi.it. © 2014 American Chemical Society.

Markov logic networks for optical chemical structure recognition / Frasconi, Paolo; Gabbrielli, Francesco; Lippi, Marco; Marinai, Simone. - In: JOURNAL OF CHEMICAL INFORMATION AND MODELING. - ISSN 1549-9596. - 54:8(2014), pp. 2380-2390. [10.1021/ci5002197]

Markov logic networks for optical chemical structure recognition

LIPPI, MARCO;
2014

Abstract

Optical chemical structure recognition is the problem of converting a bitmap image containing a chemical structure formula into a standard structured representation of the molecule. We introduce a novel approach to this problem based on the pipelined integration of pattern recognition techniques with probabilistic knowledge representation and reasoning. Basic entities and relations (such as textual elements, points, lines, etc.) are first extracted by a low-level processing module. A probabilistic reasoning engine based on Markov logic, embodying chemical and graphical knowledge, is subsequently used to refine these pieces of information. An annotated connection table of atoms and bonds is finally assembled and converted into a standard chemical exchange format. We report a successful evaluation on two large image data sets, showing that the method compares favorably with the current state-of-the-art, especially on degraded low-resolution images. The system is available as a web server at http://mlocsr.dinfo.unifi.it. © 2014 American Chemical Society.
2014
54
8
2380
2390
Markov logic networks for optical chemical structure recognition / Frasconi, Paolo; Gabbrielli, Francesco; Lippi, Marco; Marinai, Simone. - In: JOURNAL OF CHEMICAL INFORMATION AND MODELING. - ISSN 1549-9596. - 54:8(2014), pp. 2380-2390. [10.1021/ci5002197]
Frasconi, Paolo; Gabbrielli, Francesco; Lippi, Marco; Marinai, Simone
File in questo prodotto:
File Dimensione Formato  
ci5002197.pdf

Solo gestori archivio

Tipologia: Versione pubblicata dall'editore
Dimensione 1.27 MB
Formato Adobe PDF
1.27 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1122402
Citazioni
  • ???jsp.display-item.citation.pmc??? 6
  • Scopus 25
  • ???jsp.display-item.citation.isi??? 20
social impact