Background: Early identification of patients with acute hypoxemic respiratory failure (AHRF) who are at risk of failing high-flow nasal cannula (HFNC) therapy could facilitate closer monitoring, and timely adjustment/escalation of treatment. We aimed to establish whether machine learning (ML) models could predict HFNC outcome, early in the course of treatment, with greater accuracy than currently used clinical indices. Methods: We developed ML models trained using measurements made within the first two hours of treatment from 184 AHRF patients (37% HFNC failures) treated at the respiratory ICU of the University Hospital of Modena between 2018 and 2023. For external validation, we used a dataset on 567 AHRF patients (22% failures) comprising 510 patients from the recent RENOVATE trial in Brazil and 57 from the MIMIC-IV and eICU databases in the US. Predictive performance of the ML models was benchmarked against optimized thresholds of the following clinical indices: respiratory rate oxygenation index (ROX) and variants, heart rate to saturation of pulse oxygen (SpO2) ratio, SpO2/FiO2 ratio, PaO2/FiO2 ratio, sequential organ failure assessment (SOFA) and heart rate, acidosis, consciousness, oxygenation and respiratory rate (HACOR) scores. Results: Internal and external predictive performance of a Support Vector Machine (SVM) ML model was superior to all clinical indices across all scenarios tested. In external validation on the 567-patient dataset, a SVM model trained on non-invasive measurements had an accuracy of 73%, sensitivity of 73%, specificity of 73%, and AUC of 0.79. The ROX index had an accuracy of 64%, sensitivity of 79%, specificity of 60%, and AUC of 0.74. When arterial blood gasses (ABG’s) were also used for model training, the SVM model had an accuracy of 83%, sensitivity of 84%, specificity of 82%, and AUC of 0.82 in external validation on the MIMICIV/eICU dataset. The modified ROX index, which requires PaO2, achieved 70% accuracy, 63% sensitivity, 74% specificity, and AUC of 0.65. Conclusions: Decision support tools based on SVM models could provide clinicians with more accurate early predictions of HFNC outcome than currently available clinical indices. If available, ABG measurements could improve the capability to accurately identify patients at risk of failing HFNC therapy.
Machine learning models compared with current clinical indices to predict the outcome of high flow nasal cannula therapy in acute hypoxemic respiratory failure / Yu, Hang; Saffaran, Sina; Tonelli, Roberto; Laffey, John; Esquinas, Antonio; Martins de Lima, Lucas; Kawano-Dourado, Letícia; Maia, Israel; Biasi Cavalcanti, Alexandre; Clini, Enrico; Bates, Declan. - In: CRITICAL CARE. - ISSN 1466-609X. - 29:(2025), pp. 1-14. [10.1186/s13054-025-05336-4]
Machine learning models compared with current clinical indices to predict the outcome of high flow nasal cannula therapy in acute hypoxemic respiratory failure
Roberto Tonelli;Enrico Clini
;
2025
Abstract
Background: Early identification of patients with acute hypoxemic respiratory failure (AHRF) who are at risk of failing high-flow nasal cannula (HFNC) therapy could facilitate closer monitoring, and timely adjustment/escalation of treatment. We aimed to establish whether machine learning (ML) models could predict HFNC outcome, early in the course of treatment, with greater accuracy than currently used clinical indices. Methods: We developed ML models trained using measurements made within the first two hours of treatment from 184 AHRF patients (37% HFNC failures) treated at the respiratory ICU of the University Hospital of Modena between 2018 and 2023. For external validation, we used a dataset on 567 AHRF patients (22% failures) comprising 510 patients from the recent RENOVATE trial in Brazil and 57 from the MIMIC-IV and eICU databases in the US. Predictive performance of the ML models was benchmarked against optimized thresholds of the following clinical indices: respiratory rate oxygenation index (ROX) and variants, heart rate to saturation of pulse oxygen (SpO2) ratio, SpO2/FiO2 ratio, PaO2/FiO2 ratio, sequential organ failure assessment (SOFA) and heart rate, acidosis, consciousness, oxygenation and respiratory rate (HACOR) scores. Results: Internal and external predictive performance of a Support Vector Machine (SVM) ML model was superior to all clinical indices across all scenarios tested. In external validation on the 567-patient dataset, a SVM model trained on non-invasive measurements had an accuracy of 73%, sensitivity of 73%, specificity of 73%, and AUC of 0.79. The ROX index had an accuracy of 64%, sensitivity of 79%, specificity of 60%, and AUC of 0.74. When arterial blood gasses (ABG’s) were also used for model training, the SVM model had an accuracy of 83%, sensitivity of 84%, specificity of 82%, and AUC of 0.82 in external validation on the MIMICIV/eICU dataset. The modified ROX index, which requires PaO2, achieved 70% accuracy, 63% sensitivity, 74% specificity, and AUC of 0.65. Conclusions: Decision support tools based on SVM models could provide clinicians with more accurate early predictions of HFNC outcome than currently available clinical indices. If available, ABG measurements could improve the capability to accurately identify patients at risk of failing HFNC therapy.File | Dimensione | Formato | |
---|---|---|---|
Yu (Machine learning to predict success of HFNO in ARF- 2025).pdf
Open access
Tipologia:
VOR - Versione pubblicata dall'editore
Dimensione
3.83 MB
Formato
Adobe PDF
|
3.83 MB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris