Diff3DHPE: A Diffusion Model for 3D Human Pose Estimation

Zhou, Jieming; Zhang, Tong; Hayder, Zeeshan; Petersson, Lars; Harandi, Mehrtash; IEEE

doi:10.1109/ICCVW60793.2023.00223

Zhou, Jieming; Zhang, Tong; Hayder, Zeeshan; Petersson, Lars; Harandi, Mehrtash; IEEE

2023

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

Accurately estimating 3D human pose (3D HPE) and joint locations using only 2D keypoints is challenging. The noise in the predictions produced by conventional 2D human pose estimators often impeded the accuracy. In this paper, we present a diffusion-based model for 3D pose estimation, named Diff3DHPE, inspired by diffusion models' noise distillation abilities. The proposed model takes a temporal sequence of 2D keypoints as the input of a GNN backbone model to extract the 3D pose from Gaussian noise using a diffusion process during training. The model then refines it using a reverse diffusion process. To overcome over-smoothing issues in GNNs, Diff3DHPE is integrated with a discretized partial differential equation, which makes it a particular form of Graph Neural Diffusion (GRAND). Extensive experiments show that our model outperforms current state-of-the-art methods on two benchmark datasets, Human3.6M and MPI-INF-3DHP, achieving up to 39.1% improvement in MPJPE on MPI-INF-3DHP. The code is available at https://github.com/socoolzjm/Diff3DHPE.

Details

Title Diff3DHPE: A Diffusion Model for 3D Human Pose Estimation

Author(s) Zhou, Jieming ; Zhang, Tong ; Hayder, Zeeshan ; Petersson, Lars ; Harandi, Mehrtash ; IEEE

Published in 2023 Ieee/Cvf International Conference On Computer Vision Workshops, Iccvw

Pages 2084-2094

Conference IEEE/CVF International Conference on Computer Vision (ICCV), OCT 02-06, 2023, Paris, FRANCE

Date 2023-01-01

Publisher Ieee Computer Soc, Los Alamitos

ISSN 2473-9936

ISBN 979-8-3503-0744-3

DOI https://doi.org/10.1109/ICCVW60793.2023.00223

Other identifier(s) View record in Web of Science

Laboratories IVRL

Record Appears in Scientific production and competences > I&C - School of Computer and Communication Sciences > IINFCOM > IVRL - Image and Visual Representation Laboratory
Peer-reviewed publications
Conference Papers
Work produced at EPFL
Published

Grant Swiss National Science Foundation via the Sinergia grant: CRSII5-180359

Record creation date 2024-04-03