Convolutional Neural Networks (CNNs) have recently been proposed to automatically detect the pharyngeal phase in videofluoroscopic swallowing studies (VFSS). However, there is a lack of consensus regarding the best algorithmic strategy to adopt for segmenting this important yet rapid phase of the swallow. Moreover, additional information is needed to understand how small the detection error should be, in view of translating this approach for use in clinical practice. In this manuscript we compare multiple CNN-based algorithms for detecting the pharyngeal phase in VFSS bolus-level clips, specifically looking at 2DCNN and 3DCNN approaches with different temporal windows as input. Our results showed that a 2DCNN analysis on 3-frame windows outperformed both frame-by-frame approaches and 3DCNNs. We also demonstrated that the detection accuracy of the pharyngeal phase is very close to the clinical gold standard (i.e., trained clinical raters). These results demonstrate the feasibility of deep learning-based algorithms for developing intelligent approaches to automatically support clinicians in the analysis of VFSS data.Clinical relevance - Accurate and reliable segmentation of the pharyngeal phase will support clinicians by reducing the time needed for rating VFSS data. Moreover, automatic detection of this phase can be seen as a foundation for building novel and intelligent approaches to detect clinical features of interest in VFSS, such as the presence of penetration-aspiration.

The effect of time on the automated detection of the pharyngeal phase in videofluoroscopic swallowing studies / Bandini, A.; Steele, C. M.. - 2021-:(2021), pp. 3435-3438. ( 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2021 mex NOV 01-05, 2021) [10.1109/EMBC46164.2021.9629562].

The effect of time on the automated detection of the pharyngeal phase in videofluoroscopic swallowing studies

Bandini A.;
2021

Abstract

Convolutional Neural Networks (CNNs) have recently been proposed to automatically detect the pharyngeal phase in videofluoroscopic swallowing studies (VFSS). However, there is a lack of consensus regarding the best algorithmic strategy to adopt for segmenting this important yet rapid phase of the swallow. Moreover, additional information is needed to understand how small the detection error should be, in view of translating this approach for use in clinical practice. In this manuscript we compare multiple CNN-based algorithms for detecting the pharyngeal phase in VFSS bolus-level clips, specifically looking at 2DCNN and 3DCNN approaches with different temporal windows as input. Our results showed that a 2DCNN analysis on 3-frame windows outperformed both frame-by-frame approaches and 3DCNNs. We also demonstrated that the detection accuracy of the pharyngeal phase is very close to the clinical gold standard (i.e., trained clinical raters). These results demonstrate the feasibility of deep learning-based algorithms for developing intelligent approaches to automatically support clinicians in the analysis of VFSS data.Clinical relevance - Accurate and reliable segmentation of the pharyngeal phase will support clinicians by reducing the time needed for rating VFSS data. Moreover, automatic detection of this phase can be seen as a foundation for building novel and intelligent approaches to detect clinical features of interest in VFSS, such as the presence of penetration-aspiration.
2021
43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2021
mex
NOV 01-05, 2021
2021-
3435
3438
Bandini, A.; Steele, C. M.
The effect of time on the automated detection of the pharyngeal phase in videofluoroscopic swallowing studies / Bandini, A.; Steele, C. M.. - 2021-:(2021), pp. 3435-3438. ( 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2021 mex NOV 01-05, 2021) [10.1109/EMBC46164.2021.9629562].
File in questo prodotto:
File Dimensione Formato  
2021_Bandini_EMBC.pdf

Accesso riservato

Tipologia: VOR - Versione pubblicata dall'editore
Licenza: [IR] closed
Dimensione 1.98 MB
Formato Adobe PDF
1.98 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
nihms-1783274.pdf

Open access

Tipologia: AAM - Versione dell'autore revisionata e accettata per la pubblicazione
Licenza: [IR] unspecified-oa
Dimensione 199.9 kB
Formato Adobe PDF
199.9 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1401681
Citazioni
  • ???jsp.display-item.citation.pmc??? 4
  • Scopus 10
  • ???jsp.display-item.citation.isi??? 6
social impact