In many cases, the predictions of machine learning interatomic potentials (MLIPs) can be interpreted as a sum of body-ordered contributions, which is explicit when the model is directly built on neighbor density correlation descriptors and is implicit when the model captures the correlations through the non-linear functions of low body-order terms. In both cases, the "effective body-orderedness" of MLIPs remains largely unexplained: how do the models decompose the total energy into body-ordered contributions, and how does their body-orderedness affect the accuracy and learning behavior? In answering these questions, we first discuss the complexities in imposing the many-body expansion on ab initio calculations at the atomic limit. Next, we train a curated set of MLIPs on datasets of hydrogen clusters and reveal the inherent tendency of the ML models to deduce their own, effective body-order trends, which are dependent on the model type and dataset makeup. Finally, we present different trends in the convergence of the body-orders and generalizability of the models, providing useful insights into the development of future MLIPs.

Resolving the body-order paradox of machine learning interatomic potentials / Chong, S.; Jiang, T.; Domina, M.; Bigi, F.; Grasselli, F.; Lee, J.; Ceriotti, M.. - In: THE JOURNAL OF CHEMICAL PHYSICS. - ISSN 0021-9606. - 164:6(2026), pp. 1-12. [10.1063/5.0303302]

Resolving the body-order paradox of machine learning interatomic potentials

Grasselli F.;
2026

Abstract

In many cases, the predictions of machine learning interatomic potentials (MLIPs) can be interpreted as a sum of body-ordered contributions, which is explicit when the model is directly built on neighbor density correlation descriptors and is implicit when the model captures the correlations through the non-linear functions of low body-order terms. In both cases, the "effective body-orderedness" of MLIPs remains largely unexplained: how do the models decompose the total energy into body-ordered contributions, and how does their body-orderedness affect the accuracy and learning behavior? In answering these questions, we first discuss the complexities in imposing the many-body expansion on ab initio calculations at the atomic limit. Next, we train a curated set of MLIPs on datasets of hydrogen clusters and reveal the inherent tendency of the ML models to deduce their own, effective body-order trends, which are dependent on the model type and dataset makeup. Finally, we present different trends in the convergence of the body-orders and generalizability of the models, providing useful insights into the development of future MLIPs.
2026
164
6
1
12
Resolving the body-order paradox of machine learning interatomic potentials / Chong, S.; Jiang, T.; Domina, M.; Bigi, F.; Grasselli, F.; Lee, J.; Ceriotti, M.. - In: THE JOURNAL OF CHEMICAL PHYSICS. - ISSN 0021-9606. - 164:6(2026), pp. 1-12. [10.1063/5.0303302]
Chong, S.; Jiang, T.; Domina, M.; Bigi, F.; Grasselli, F.; Lee, J.; Ceriotti, M.
File in questo prodotto:
File Dimensione Formato  
064121_1_5.0303302.pdf

Open access

Tipologia: VOR - Versione pubblicata dall'editore
Dimensione 5.09 MB
Formato Adobe PDF
5.09 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1400628
Citazioni
  • ???jsp.display-item.citation.pmc??? 1
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact