Successfully training models which address the complex nature of real environments in Smart Cities and robotic networks is challenging, due to the vast amount of data needed. When employing Reinforcement Learning (RL) models, it is also impossible to let them explore all the actions in the real world, due to potential damages and inappropriate actions. The inherent trial-and-error nature of RL, especially in real-world applications like traffic management, makes an integration with Digital Twins (DTs) attractive: DTs provide a secure environment for iterative training and testing, ensuring the refinement of the models before testing. However, the use of DTs has some significant limitations, as the difference between the model and reality may cause significant risks. This study focuses on the adaptive and self-learning characteristics of this approach, considering a standard navigation task in a highway environment, and analyzes its advantages and potential pitfalls.
On the Limits of Digital Twins for Safe Deep Reinforcement Learning in Robotic Networks / Balghouthi, M. I.; Chiariotti, F.; Bedogni, L.. - (2024), pp. 1-6. (Intervento presentato al convegno 2024 IEEE Conference on Computer Communications Workshops, INFOCOM WKSHPS 2024 tenutosi a Vancouver, CANADA nel MAY 20-23, 2024) [10.1109/INFOCOMWKSHPS61880.2024.10620857].
On the Limits of Digital Twins for Safe Deep Reinforcement Learning in Robotic Networks
Bedogni L.
2024
Abstract
Successfully training models which address the complex nature of real environments in Smart Cities and robotic networks is challenging, due to the vast amount of data needed. When employing Reinforcement Learning (RL) models, it is also impossible to let them explore all the actions in the real world, due to potential damages and inappropriate actions. The inherent trial-and-error nature of RL, especially in real-world applications like traffic management, makes an integration with Digital Twins (DTs) attractive: DTs provide a secure environment for iterative training and testing, ensuring the refinement of the models before testing. However, the use of DTs has some significant limitations, as the difference between the model and reality may cause significant risks. This study focuses on the adaptive and self-learning characteristics of this approach, considering a standard navigation task in a highway environment, and analyzes its advantages and potential pitfalls.Pubblicazioni consigliate
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris