Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

Comparative and Improvement Study of 3D Human Pose Estimation Algorithms using Monocular Cameras

Version 1 : Received: 23 January 2024 / Approved: 23 January 2024 / Online: 23 January 2024 (08:40:08 CET)

How to cite: Choi, J.; Ha, E.; Kim, J. Comparative and Improvement Study of 3D Human Pose Estimation Algorithms using Monocular Cameras. Preprints 2024, 2024011668. https://doi.org/10.20944/preprints202401.1668.v1 Choi, J.; Ha, E.; Kim, J. Comparative and Improvement Study of 3D Human Pose Estimation Algorithms using Monocular Cameras. Preprints 2024, 2024011668. https://doi.org/10.20944/preprints202401.1668.v1

Abstract

Human Pose Estimation (HPE) is a technique in computer vision and AI for detecting and tracking human body parts and poses from images or videos. Widely used in augmented reality, animation, fitness applications, and surveillance, HPE methods using monocular cameras are highly versatile due to their applicability in standard video and CCTV footage. These methods have evolved from 2D to 3D pose estimation. However, current 3D HPE methods trained on laboratory-based motion capture data encounter challenges such as limited training data, depth perception ambiguity, left/right switching, and issues with occlusions when applied in real-world environments. This study compares two representative 3D HPE methods by assessing their strengths and weaknesses with real-world videos. Then, we propose data processing techniques to eliminate and correct anomalies like left/right inversion and false detections of joint positions in daily life motions. Finally, we obtain joint angle trajectories using an optimization method based on a 3D humanoid simulator, taking as input the joint coordinate data corrected by applying the proposed human joint data processing technique. The efficacy of the proposed 3D HPE method is verified by applying it to three-dimensional freehand gymnastics exercises and comparing the joint angle trajectories during the motion.

Keywords

human pose estimation; monocular camera; MediaPipe Pose; HybrIK; outlier; optimization; humanoid model

Subject

Computer Science and Mathematics, Artificial Intelligence and Machine Learning

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.