Algorithms for 3D hand pose extraction: a novel implementation

In this work we construct a marker-less 3D hand pose ex- traction solution relying solely on a single RGB video source. We leverage available solutions for state-of-the-art computer vision models, coupled with a custom depth estimation algo- rithm to reconstruct full 3D space data. This system is pack- aged in a Python library capable of real time pose estimation, and is evaluated over two different datasets, one reflecting our intended task and a common academic dataset. The overall results shows limits in the accuracy of the model, measured on average above 10 cm, exceeding the desired precision of a few millimeters, but we acknowledge the relevancy of the results in the larger context of the field, and discuss potential avenues for future work.

Algorithms for 3D hand pose extraction: a novel implementation

TRAPANOTTO, MARTINO

2023/2024

Abstract

In this work we construct a marker-less 3D hand pose ex- traction solution relying solely on a single RGB video source. We leverage available solutions for state-of-the-art computer vision models, coupled with a custom depth estimation algo- rithm to reconstruct full 3D space data. This system is pack- aged in a Python library capable of real time pose estimation, and is evaluated over two different datasets, one reflecting our intended task and a common academic dataset. The overall results shows limits in the accuracy of the model, measured on average above 10 cm, exceeding the desired precision of a few millimeters, but we acknowledge the relevancy of the results in the larger context of the field, and discuss potential avenues for future work.

Scheda

Scheda DC

	Facoltà/Dipartimento
	
				Dipartimento di Ingegneria dell'Informazione - DEI
			
	Corso di studio
	
				COMPUTER ENGINEERING Laurea Magistrale (D.M. 270/2004)
			
	Anno Accademico
	
				2023
			
	Titolo inglese
	
				Algorithms for 3D hand pose extraction: a novel implementation
			
	Abstract in italiano
	
				In this work we construct a marker-less 3D hand pose ex-
traction solution relying solely on a single RGB video source.
We leverage available solutions for state-of-the-art computer
vision models, coupled with a custom depth estimation algo-
rithm to reconstruct full 3D space data. This system is pack-
aged in a Python library capable of real time pose estimation,
and is evaluated over two different datasets, one reflecting our
intended task and a common academic dataset. The overall
results shows limits in the accuracy of the model, measured
on average above 10 cm, exceeding the desired precision of
a few millimeters, but we acknowledge the relevancy of the
results in the larger context of the field, and discuss potential
avenues for future work.
			
	Parola chiave
	
				machine learning
computer vision
pose estimation
deep learning
			
	Relatore
	
				MENEGATTI, EMANUELE
			
	Appare nelle tipologie:
	
				Lauree magistrali

File in questo prodotto:

File	Dimensione	Formato
Trapanotto_Martino.pdf accesso aperto Dimensione 17.39 MB Formato Adobe PDF Visualizza/Apri	17.39 MB	Adobe PDF	Visualizza/Apri

The text of this website © Università degli studi di Padova. Full Text are published under a non-exclusive license. Metadata are under a CC0 License

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12608/66487