The main disadvantages of the existing methods for studying speech articulators (such as electromagnetic and optoelectronic systems) are the high cost and the discomfort to participants or patients. The aim of this work is to introduce a completely markerless low-cost 3D tracking technique in the context of speech articulation, and then compare it with a wellestablished marker-based one to evaluate the performances. A Kinect-like device was used in conjunction with an existing face tracking algorithm to track lips movements in 3D without markers. The method was tested on two subjects uttering 200 words and 100 sentences. For most of points of the lips the RMSE ranged between 1 and 3 mm. Although the image resolution used in this experiment was low, these results are very promising. Nevertheless, further studies should consider higher video resolutions in order to obtain better results.

Accuracy of a markerless acquisition technique for studying speech articulators / Bandini, A.; Ouni, S.; Cosi, P.; Orlandi, S.; Manfredi, C.. - 2015-:(2015), pp. 2162-2166. ( 16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015 Dresden, GERMANY SEP 06-10, 2015).

Accuracy of a markerless acquisition technique for studying speech articulators

Bandini A.;
2015

Abstract

The main disadvantages of the existing methods for studying speech articulators (such as electromagnetic and optoelectronic systems) are the high cost and the discomfort to participants or patients. The aim of this work is to introduce a completely markerless low-cost 3D tracking technique in the context of speech articulation, and then compare it with a wellestablished marker-based one to evaluate the performances. A Kinect-like device was used in conjunction with an existing face tracking algorithm to track lips movements in 3D without markers. The method was tested on two subjects uttering 200 words and 100 sentences. For most of points of the lips the RMSE ranged between 1 and 3 mm. Although the image resolution used in this experiment was low, these results are very promising. Nevertheless, further studies should consider higher video resolutions in order to obtain better results.
2015
16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015
Dresden, GERMANY
SEP 06-10, 2015
2015-
2162
2166
Bandini, A.; Ouni, S.; Cosi, P.; Orlandi, S.; Manfredi, C.
Accuracy of a markerless acquisition technique for studying speech articulators / Bandini, A.; Ouni, S.; Cosi, P.; Orlandi, S.; Manfredi, C.. - 2015-:(2015), pp. 2162-2166. ( 16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015 Dresden, GERMANY SEP 06-10, 2015).
File in questo prodotto:
File Dimensione Formato  
2015_Bandini_INTERSPEECH.pdf

Accesso riservato

Tipologia: Tesi di dottorato
Licenza: [IR] closed
Dimensione 206.52 kB
Formato Adobe PDF
206.52 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1401694
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 7
  • ???jsp.display-item.citation.isi??? 4
social impact