Deep learning has become a cornerstone in numerous applications, such as robotics, augmented reality, and autonomous systems, where accurate 6D pose estimation – determining an object’s position and orientation in 3D space – is critical. Despite its success, designing optimal neural architectures and tuning hyperparameters remain computationally expensive and challenging, especially in high-variance and data-intensive domains like 6D pose estimation. To address this, we investigate the application of a Neural Architecture Search (NAS) technique guided by classical Machine Learning-based performance predictors. As these predictors estimate final model performance using early-stage training data, we demonstrate that leveraging such a strategy allows for efficient exploration of the hyperparameter space, significantly reducing the computational burden of exhaustive search while still achieving improved performance. Building on an existing NAS method, we introduce a novel modification that enhances the efficiency of the search process, further accelerating convergence by nearly 71% without sacrificing accuracy. Through extensive experimentation on the LineMOD dataset, we demonstrate that our method consistently discovers high-performing configurations of an Augmented Autoencoder for 6D pose estimation, outperforming benchmark models by almost 15% in pose accuracy and 42% in reconstruction loss. These results underscore the potential of predictor-based NAS as a powerful and computationally efficient tool for neural architecture optimization in complex, real-world tasks.
Hyperparameter optimization of an augmented autoencoder for 6D object pose estimation via neural architecture search / Lombardi, M., Sapienza, D., Govi, E., Franchini, G.. - In: MATHEMATICS IN ENGINEERING. - ISSN 2640-3501. - 8:2(2026), pp. 150-180. [10.3934/mine.2026006]
Hyperparameter optimization of an augmented autoencoder for 6D object pose estimation via neural architecture search
Lombardi M.;Sapienza D.;Franchini G.
2026
Abstract
Deep learning has become a cornerstone in numerous applications, such as robotics, augmented reality, and autonomous systems, where accurate 6D pose estimation – determining an object’s position and orientation in 3D space – is critical. Despite its success, designing optimal neural architectures and tuning hyperparameters remain computationally expensive and challenging, especially in high-variance and data-intensive domains like 6D pose estimation. To address this, we investigate the application of a Neural Architecture Search (NAS) technique guided by classical Machine Learning-based performance predictors. As these predictors estimate final model performance using early-stage training data, we demonstrate that leveraging such a strategy allows for efficient exploration of the hyperparameter space, significantly reducing the computational burden of exhaustive search while still achieving improved performance. Building on an existing NAS method, we introduce a novel modification that enhances the efficiency of the search process, further accelerating convergence by nearly 71% without sacrificing accuracy. Through extensive experimentation on the LineMOD dataset, we demonstrate that our method consistently discovers high-performing configurations of an Augmented Autoencoder for 6D pose estimation, outperforming benchmark models by almost 15% in pose accuracy and 42% in reconstruction loss. These results underscore the potential of predictor-based NAS as a powerful and computationally efficient tool for neural architecture optimization in complex, real-world tasks.| File | Dimensione | Formato | |
|---|---|---|---|
|
10.3934_mine.2026006.pdf
Open access
Tipologia:
VOR - Versione pubblicata dall'editore
Licenza:
[IR] creative-commons
Dimensione
3.47 MB
Formato
Adobe PDF
|
3.47 MB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate

I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris




