Deep learning-based method for vision-guided robotic grasping of unknown objects

Nowadays, robots are heavily used in factories for different tasks, most of them including grasping and manipulation of generic objects in unstructured scenarios. In order to better mimic a human operator involved in a grasping action, where he/she needs to identify the object and detect an optimal grasp by means of visual information, a widely adopted sensing solution is Artificial Vision. Nonetheless, state-of-art applications need long training and fine-tuning for manually build the object's model that is used at run-time during the normal operations, which reduce the overall operational throughput of the robotic system. To overcome such limits, the paper presents a framework based on Deep Convolutional Neural Networks (DCNN) to predict both single and multiple grasp poses for multiple objects all at once, using a single RGB image as input. Thanks to a novel loss function, our framework is trained in an end-to-end fashion and matches state-of-art accuracy with a substantially smaller architecture, which gives unprecedented real-time performances during experimental tests, and makes the application reliable for working on real robots. The system has been implemented using the ROS framework and tested on a Baxter collaborative robot.

Deep learning-based method for vision-guided robotic grasping of unknown objects / Bergamini, L.; Sposato, M.; Pellicciari, M.; Peruzzini, M.; Calderara, S.; Schmidt, J.. - In: ADVANCED ENGINEERING INFORMATICS. - ISSN 1474-0346. - 44:(2020), pp. 101052-101066. [10.1016/j.aei.2020.101052]

Deep learning-based method for vision-guided robotic grasping of unknown objects

Bergamini L.;Sposato M.;Pellicciari M.;Peruzzini M.;Calderara S.;Schmidt J.

2020

Abstract

Nowadays, robots are heavily used in factories for different tasks, most of them including grasping and manipulation of generic objects in unstructured scenarios. In order to better mimic a human operator involved in a grasping action, where he/she needs to identify the object and detect an optimal grasp by means of visual information, a widely adopted sensing solution is Artificial Vision. Nonetheless, state-of-art applications need long training and fine-tuning for manually build the object's model that is used at run-time during the normal operations, which reduce the overall operational throughput of the robotic system. To overcome such limits, the paper presents a framework based on Deep Convolutional Neural Networks (DCNN) to predict both single and multiple grasp poses for multiple objects all at once, using a single RGB image as input. Thanks to a novel loss function, our framework is trained in an end-to-end fashion and matches state-of-art accuracy with a substantially smaller architecture, which gives unprecedented real-time performances during experimental tests, and makes the application reliable for working on real robots. The system has been implemented using the ROS framework and tested on a Baxter collaborative robot.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2020
			
	Rivista
	
				ADVANCED ENGINEERING INFORMATICS
			
	N° del Volume
	
				44
			
	Pagina iniziale
	
				101052
			
	Pagina finale
	
				101066
			
	Codice DOI
	
				https://dx.doi.org/10.1016/j.aei.2020.101052
			
	Codice WoS
	
				WOS:000530699400008
			
	Codice Scopus
	
				2-s2.0-85079340469
			
	Citazione
	
				Deep learning-based method for vision-guided robotic grasping of unknown objects / Bergamini, L.; Sposato, M.; Pellicciari, M.; Peruzzini, M.; Calderara, S.; Schmidt, J.. - In: ADVANCED ENGINEERING INFORMATICS. - ISSN 1474-0346. - 44:(2020), pp. 101052-101066. [10.1016/j.aei.2020.101052]
			
	Tutti gli autori
	
						Bergamini, L.; Sposato, M.; Pellicciari, M.; Peruzzini, M.; Calderara, S.; Schmidt, J.
					
	Tipologia
	
				Articolo su rivista

File in questo prodotto:

Non ci sono file associati a questo prodotto.

Pubblicazioni consigliate

I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1209700

Citazioni

ND

49

39

social impact