Automated pharyngeal phase detection and bolus localization in videofluoroscopic swallowing study: Killing two birds with one stone?

Bandini, A.; Smaoui, S.; Steele, C. M.

doi:10.1016/j.cmpb.2022.107058

Background and objective: The videofluoroscopic swallowing study (VFSS) is a gold-standard imaging technique for assessing swallowing, but analysis and rating of VFSS recordings is time consuming and requires specialized training and expertise. Researchers have recently demonstrated that it is possible to automatically detect the pharyngeal phase of swallowing and to localize the bolus in VFSS recordings via computer vision approaches, fostering the development of novel techniques for automatic VFSS analysis. However, training of algorithms to perform these tasks requires large amounts of annotated data that are seldom available. In this paper, we demonstrate that the challenges of pharyngeal phase detection and bolus localization can be solved together using a single approach. Methods: We propose a deep-learning framework that jointly tackles pharyngeal phase detection and bolus localization in a weakly-supervised manner, requiring only the initial and final frames of the pharyngeal phase as ground truth annotations for the training. Our approach stems from the observation that bolus presence in the pharynx is the most prominent visual feature upon which to infer whether individual VFSS frames belong to the pharyngeal phase. We conducted extensive experiments with multiple convolutional neural networks (CNNs) on a dataset of 1245 bolus-level clips from 59 healthy subjects. Results: We demonstrated that the pharyngeal phase can be detected with an F1-score higher than 0.9. Moreover, by processing the class activation maps of the CNNs, we were able to localize the bolus with promising results, obtaining correlations with ground truth trajectories higher than 0.9, without any manual annotations of bolus location used for training purposes. Conclusions: Once validated on a larger sample of participants with swallowing disorders, our framework will pave the way for the development of intelligent tools for VFSS analysis to support clinicians in swallowing assessment.

Automated pharyngeal phase detection and bolus localization in videofluoroscopic swallowing study: Killing two birds with one stone? / Bandini, A., Smaoui, S., Steele, C.M.. - In: COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE. - ISSN 0169-2607. - 225:(2022), pp. 1-11. [10.1016/j.cmpb.2022.107058]

Automated pharyngeal phase detection and bolus localization in videofluoroscopic swallowing study: Killing two birds with one stone?

Bandini A.;Smaoui S.;Steele C. M.

2022

Abstract

Background and objective: The videofluoroscopic swallowing study (VFSS) is a gold-standard imaging technique for assessing swallowing, but analysis and rating of VFSS recordings is time consuming and requires specialized training and expertise. Researchers have recently demonstrated that it is possible to automatically detect the pharyngeal phase of swallowing and to localize the bolus in VFSS recordings via computer vision approaches, fostering the development of novel techniques for automatic VFSS analysis. However, training of algorithms to perform these tasks requires large amounts of annotated data that are seldom available. In this paper, we demonstrate that the challenges of pharyngeal phase detection and bolus localization can be solved together using a single approach. Methods: We propose a deep-learning framework that jointly tackles pharyngeal phase detection and bolus localization in a weakly-supervised manner, requiring only the initial and final frames of the pharyngeal phase as ground truth annotations for the training. Our approach stems from the observation that bolus presence in the pharynx is the most prominent visual feature upon which to infer whether individual VFSS frames belong to the pharyngeal phase. We conducted extensive experiments with multiple convolutional neural networks (CNNs) on a dataset of 1245 bolus-level clips from 59 healthy subjects. Results: We demonstrated that the pharyngeal phase can be detected with an F1-score higher than 0.9. Moreover, by processing the class activation maps of the CNNs, we were able to localize the bolus with promising results, obtaining correlations with ground truth trajectories higher than 0.9, without any manual annotations of bolus location used for training purposes. Conclusions: Once validated on a larger sample of participants with swallowing disorders, our framework will pave the way for the development of intelligent tools for VFSS analysis to support clinicians in swallowing assessment.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2022
			
	Lingua/e di pubblicazione
	
				Inglese
			
	Rivista
	
				COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE
			
	N° del Volume
	
				225
			
	Pagina iniziale
	
				1
			
	Pagina finale
	
				11
			
	Codice DOI
	
				https://dx.doi.org/10.1016/j.cmpb.2022.107058
			
	Codice WoS
	
				WOS:000856927600009
			
	Codice Scopus
	
				2-s2.0-85135685605
			
	Codice PubMed
	
				35961072
35961072
			
	Parole chiave
	
				Bolus localization; Convolutional neural networks; Pharyngeal phase; Video classification; Videofluoroscopic swallowing study
			
	SDG - Sustainable Development Goals
	
				Goal 2: Zero hunger
Goal 3: Good health and well-being
			
	Fulltext
	
				partially_open
			
	Tipologia
	
				info:eu-repo/semantics/article
			
	Tipologia
	
				Contributo su RIVISTA::Articolo su rivista
			
	Tipologia sito docente
	
				262
			
	Citazione
	
				Automated pharyngeal phase detection and bolus localization in videofluoroscopic swallowing study: Killing two birds with one stone? / Bandini, A., Smaoui, S., Steele, C.M.. - In: COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE. - ISSN 0169-2607. - 225:(2022), pp. 1-11. [10.1016/j.cmpb.2022.107058]
			
	Tutti gli autori
	
						Bandini, A.; Smaoui, S.; Steele, C. M.
					
	Numero autori
	
				3
			
	Tipologia
	
				Articolo su rivista

File in questo prodotto:

File	Dimensione	Formato
2022_Bandini_CMPB.pdf Accesso riservato Tipologia: VOR - Versione pubblicata dall'editore Licenza: [IR] closed Dimensione 2.08 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	2.08 MB	Adobe PDF	Visualizza/Apri Richiedi una copia
2111.04699v2.pdf Open access Tipologia: AAM - Versione dell'autore revisionata e accettata per la pubblicazione Licenza: [IR] creative-commons Dimensione 924.25 kB Formato Adobe PDF Visualizza/Apri	924.25 kB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris