Large-scale benchmarks such as BraTS have driven progress in brain tumor segmentation, but they provide only masks with limited access to the clinical semantics found in radiology reports. We introduce ReportX, a paired resource of 257 clinical reports aligned to BraTS-GLI-2023 subjects, structured into a rich set of qualitative and quantitative attributes. Qualitative fields are curated by clinicians, while quantitative descriptors are automatically derived via atlas-based localization and geometric computations. We compare our annotation schema to existing report-augmented datasets and show that ReportX provides substantially broader coverage of clinically relevant factors. To exploit this supervision, we encode reports using biomedical language models and incorporate their embeddings as auxiliary semantic guidance for 3D tumor segmentation during training. Experimental results demonstrate that the proposed vision-text alignment improves segmentation performance on standard BraTS metrics, with clinically curated reports providing more consistent improvements than automatically generated or less-structured counterparts. We publicly release the dataset (https://ditto.ing.unimore.it/reportx) and the source code (https://github.com/AImageLab-zip/ReportX).

ReportX: The BraTS Clinical Report Dataset / Marchesini, K., Carpentiero, O., Del Gaudio, L., Farioli, F., Cucchiara, R., Grana, C., Cuculo, V., Bolelli, F.. - (2026). (29th International Conference on Medical Image Computing and Computer Assisted Intervention Strasbourg, France Sep 27-Oct 1).

ReportX: The BraTS Clinical Report Dataset

Marchesini, Kevin;Carpentiero, Omar;Del Gaudio, Livia;Farioli, Francesco;Cucchiara, Rita;Grana, Costantino;Cuculo, Vittorio;Bolelli, Federico
2026

Abstract

Large-scale benchmarks such as BraTS have driven progress in brain tumor segmentation, but they provide only masks with limited access to the clinical semantics found in radiology reports. We introduce ReportX, a paired resource of 257 clinical reports aligned to BraTS-GLI-2023 subjects, structured into a rich set of qualitative and quantitative attributes. Qualitative fields are curated by clinicians, while quantitative descriptors are automatically derived via atlas-based localization and geometric computations. We compare our annotation schema to existing report-augmented datasets and show that ReportX provides substantially broader coverage of clinically relevant factors. To exploit this supervision, we encode reports using biomedical language models and incorporate their embeddings as auxiliary semantic guidance for 3D tumor segmentation during training. Experimental results demonstrate that the proposed vision-text alignment improves segmentation performance on standard BraTS metrics, with clinically curated reports providing more consistent improvements than automatically generated or less-structured counterparts. We publicly release the dataset (https://ditto.ing.unimore.it/reportx) and the source code (https://github.com/AImageLab-zip/ReportX).
2026
23-giu-2026
29th International Conference on Medical Image Computing and Computer Assisted Intervention
Strasbourg, France
Sep 27-Oct 1
Marchesini, Kevin; Carpentiero, Omar; Del Gaudio, Livia; Farioli, Francesco; Cucchiara, Rita; Grana, Costantino; Cuculo, Vittorio; Bolelli, Federico...espandi
ReportX: The BraTS Clinical Report Dataset / Marchesini, K., Carpentiero, O., Del Gaudio, L., Farioli, F., Cucchiara, R., Grana, C., Cuculo, V., Bolelli, F.. - (2026). (29th International Conference on Medical Image Computing and Computer Assisted Intervention Strasbourg, France Sep 27-Oct 1).
File in questo prodotto:
File Dimensione Formato  
Paper-0852.pdf

embargo fino al 15/10/2027

Tipologia: AAM - Versione dell'autore revisionata e accettata per la pubblicazione
Dimensione 1.83 MB
Formato Adobe PDF
1.83 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1411368
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact