The simple act of listening or of taking notes while attendinga lesson may represent an insuperable burden for millions ofpeople with some form of disabilities (e.g., hearing impaired,dyslexic and ESL students). In this paper, we propose anarchitecture that aims at automatically creating captions forvideo lessons by exploiting advances in speech recognitiontechnologies. Our approach couples the usage of o-the-shelf ASR (Automatic Speech Recognition) software with anovel caption alignment mechanism that smartly introducesunique audio markups into the audio stream before givingit to the ASR and transforms the plain transcript producedby the ASR into a timecoded transcript.
Enhancing learning accessibility through fully automatic captioning / Federico, Maria; Furini, Marco. - ELETTRONICO. - (2012), pp. 40:1-40:4. (Intervento presentato al convegno International Cross-Disciplinary Conference on Web Accessibility (W4A-2012) tenutosi a Lyon, France nel 16-17 Aprile 2012) [10.1145/2207016.2207053].
Enhancing learning accessibility through fully automatic captioning
FEDERICO, Maria;FURINI, Marco
2012
Abstract
The simple act of listening or of taking notes while attendinga lesson may represent an insuperable burden for millions ofpeople with some form of disabilities (e.g., hearing impaired,dyslexic and ESL students). In this paper, we propose anarchitecture that aims at automatically creating captions forvideo lessons by exploiting advances in speech recognitiontechnologies. Our approach couples the usage of o-the-shelf ASR (Automatic Speech Recognition) software with anovel caption alignment mechanism that smartly introducesunique audio markups into the audio stream before givingit to the ASR and transforms the plain transcript producedby the ASR into a timecoded transcript.File | Dimensione | Formato | |
---|---|---|---|
2012-W4A.pdf
Accesso riservato
Descrizione: Articolo principale
Tipologia:
AAM - Versione dell'autore revisionata e accettata per la pubblicazione
Dimensione
252.9 kB
Formato
Adobe PDF
|
252.9 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris