Prediction rigidities for data-driven chemistry

: The widespread application of machine learning (ML) to the chemical sciences is making it very important to understand how the ML models learn to correlate chemical structures with their properties, and what can be done to improve the training efficiency whilst guaranteeing interpretability and transferability. In this work, we demonstrate the wide utility of prediction rigidities, a family of metrics derived from the loss function, in understanding the robustness of ML model predictions. We show that the prediction rigidities allow the assessment of the model not only at the global level, but also on the local or the component-wise level at which the intermediate (e.g. atomic, body-ordered, or range-separated) predictions are made. We leverage these metrics to understand the learning behavior of different ML models, and to guide efficient dataset construction for model training. We finally implement the formalism for a ML model targeting a coarse-grained system to demonstrate the applicability of the prediction rigidities to an even broader class of atomistic modeling problems.

Prediction rigidities for data-driven chemistry / Chong, Sanggyu; Bigi, Filippo; Grasselli, Federico; Loche, Philip; Kellner, Matthias; Ceriotti, Michele. - In: FARADAY DISCUSSIONS. - ISSN 1359-6640. - 256:(2024), pp. 322-344. [10.1039/d4fd00101j]

Prediction rigidities for data-driven chemistry

Chong, Sanggyu;Bigi, Filippo;Grasselli, Federico;Loche, Philip;Kellner, Matthias;Ceriotti, Michele

2024

Abstract

: The widespread application of machine learning (ML) to the chemical sciences is making it very important to understand how the ML models learn to correlate chemical structures with their properties, and what can be done to improve the training efficiency whilst guaranteeing interpretability and transferability. In this work, we demonstrate the wide utility of prediction rigidities, a family of metrics derived from the loss function, in understanding the robustness of ML model predictions. We show that the prediction rigidities allow the assessment of the model not only at the global level, but also on the local or the component-wise level at which the intermediate (e.g. atomic, body-ordered, or range-separated) predictions are made. We leverage these metrics to understand the learning behavior of different ML models, and to guide efficient dataset construction for model training. We finally implement the formalism for a ML model targeting a coarse-grained system to demonstrate the applicability of the prediction rigidities to an even broader class of atomistic modeling problems.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2024
			
	Rivista
	
				FARADAY DISCUSSIONS
			
	N° del Volume
	
				256
			
	Pagina iniziale
	
				322
			
	Pagina finale
	
				344
			
	Codice DOI
	
				https://dx.doi.org/10.1039/d4fd00101j
			
	Codice WoS
	
				WOS:001320927000001
			
	Codice Scopus
	
				2-s2.0-85205905356
			
	Codice PubMed
	
				39319702
			
	Citazione
	
				Prediction rigidities for data-driven chemistry / Chong, Sanggyu; Bigi, Filippo; Grasselli, Federico; Loche, Philip; Kellner, Matthias; Ceriotti, Michele. - In: FARADAY DISCUSSIONS. - ISSN 1359-6640. - 256:(2024), pp. 322-344. [10.1039/d4fd00101j]
			
	Tutti gli autori
	
						Chong, Sanggyu; Bigi, Filippo; Grasselli, Federico; Loche, Philip; Kellner, Matthias; Ceriotti, Michele
					
	Tipologia
	
				Articolo su rivista

File in questo prodotto:

File	Dimensione	Formato
d4fd00101j.pdf Open access Tipologia: VOR - Versione pubblicata dall'editore Dimensione 2.06 MB Formato Adobe PDF Visualizza/Apri	2.06 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1362576

Citazioni

0

0

0

social impact