Background: HER2-low expression has gained clinical relevance in breast cancer (BC) due to the availability of anti-HER2 antibody-drug conjugates for patients with HER2-low metastatic BC. The well-reported instability of HER2-low status during disease evolution highlights the need to identify patients with HER2-0 primary BC who may develop a HER2-low phenotype at relapse. In response to the urgency of maximizing treatment access, we utilized artificial intelligence to predict this occurrence. Patients and methods: We included a large multicentric retrospective cohort of patients with BC who underwent tissue resampling at relapse. The dataset was preprocessed to address relevant issues such as missing data, feature abundance, and target class imbalance. We then trained two models: one focused on explainability [Extreme Gradient Boosting (XGBoost)] and another aimed at performance (an ensemble of XGBoost and support vector machine). Results: A total of 1200 patients were included in this study. Among 386 patients with HER2-0 primary BC and matched HER2 status at relapse, 42.5% (n = 157) converted to a HER2-low phenotype. The explainable model achieved a balanced accuracy of 58%, with a sensitivity of 53% and a specificity of 64%. The most important variables for this model were primary BC phenotype [mean Shapley value (SHAP) 0.540], primary BC histological type (SHAP 0.101), grade (SHAP 0.182), and sites of relapse (SHAP 0.008-0.213). The ensemble model had a balanced accuracy of 64%, with a sensitivity of 75% and a specificity of 53%. Conclusions: This work represents one of the first proof-of-concept applications of machine learning models to predict a highly relevant phenomenon for drug access in modern BC oncology. Starting with an explainable model and subsequently integrating it with an ensemble approach enabled us to enhance performance while maintaining transparency, explainability, and intelligibility.

Development of two machine learning models to predict conversion from primary HER2-0 breast cancer to HER2-low metastases: a proof-of-concept study / Miglietta, F.; Collesei, A.; Vernieri, C.; Giarratano, T.; Giorgi, C. A.; Girardi, F.; Griguolo, G.; Cacciatore, M.; Botticelli, A.; Vingiani, A.; Fotia, G.; Piacentini, F.; Massa, D.; Zanghì, F.; Marino, M.; Pruneri, G.; Fassan, M.; Dei Tos, A. P.; Dieci, M. V.; Guarneri, V.. - In: ESMO OPEN. - ISSN 2059-7029. - 10:1(2025), pp. 1-9. [10.1016/j.esmoop.2024.104087]

Development of two machine learning models to predict conversion from primary HER2-0 breast cancer to HER2-low metastases: a proof-of-concept study

Piacentini F.;Dieci M. V.;Guarneri V.
2025

Abstract

Background: HER2-low expression has gained clinical relevance in breast cancer (BC) due to the availability of anti-HER2 antibody-drug conjugates for patients with HER2-low metastatic BC. The well-reported instability of HER2-low status during disease evolution highlights the need to identify patients with HER2-0 primary BC who may develop a HER2-low phenotype at relapse. In response to the urgency of maximizing treatment access, we utilized artificial intelligence to predict this occurrence. Patients and methods: We included a large multicentric retrospective cohort of patients with BC who underwent tissue resampling at relapse. The dataset was preprocessed to address relevant issues such as missing data, feature abundance, and target class imbalance. We then trained two models: one focused on explainability [Extreme Gradient Boosting (XGBoost)] and another aimed at performance (an ensemble of XGBoost and support vector machine). Results: A total of 1200 patients were included in this study. Among 386 patients with HER2-0 primary BC and matched HER2 status at relapse, 42.5% (n = 157) converted to a HER2-low phenotype. The explainable model achieved a balanced accuracy of 58%, with a sensitivity of 53% and a specificity of 64%. The most important variables for this model were primary BC phenotype [mean Shapley value (SHAP) 0.540], primary BC histological type (SHAP 0.101), grade (SHAP 0.182), and sites of relapse (SHAP 0.008-0.213). The ensemble model had a balanced accuracy of 64%, with a sensitivity of 75% and a specificity of 53%. Conclusions: This work represents one of the first proof-of-concept applications of machine learning models to predict a highly relevant phenomenon for drug access in modern BC oncology. Starting with an explainable model and subsequently integrating it with an ensemble approach enabled us to enhance performance while maintaining transparency, explainability, and intelligibility.
2025
10
1
1
9
Development of two machine learning models to predict conversion from primary HER2-0 breast cancer to HER2-low metastases: a proof-of-concept study / Miglietta, F.; Collesei, A.; Vernieri, C.; Giarratano, T.; Giorgi, C. A.; Girardi, F.; Griguolo, G.; Cacciatore, M.; Botticelli, A.; Vingiani, A.; Fotia, G.; Piacentini, F.; Massa, D.; Zanghì, F.; Marino, M.; Pruneri, G.; Fassan, M.; Dei Tos, A. P.; Dieci, M. V.; Guarneri, V.. - In: ESMO OPEN. - ISSN 2059-7029. - 10:1(2025), pp. 1-9. [10.1016/j.esmoop.2024.104087]
Miglietta, F.; Collesei, A.; Vernieri, C.; Giarratano, T.; Giorgi, C. A.; Girardi, F.; Griguolo, G.; Cacciatore, M.; Botticelli, A.; Vingiani, A.; Fot...espandi
File in questo prodotto:
File Dimensione Formato  
Development of two machine learning models to predict conversion from primary HER2-0 breast cancer to HER2-low metastases_ a proof-of-concept study.pdf

Open access

Tipologia: VOR - Versione pubblicata dall'editore
Dimensione 447.86 kB
Formato Adobe PDF
447.86 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1369068
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact