KSOF: Leveraging kinematics and spatio-temporal optimal fusion for human motion prediction

Citation DataPattern Recognition, ISSN: 0031-3203, Vol: 161, Page: 111206

Publication Year2025

0
Citations
0
Usage
0
Captures
0
Mentions
0
Social Media

Metric Options: Counts1 Year3 Year

Article Description

Ignoring the meaningful kinematics law, which generates improbable or impractical predictions, is one of the obstacles to human motion prediction. Current methods attempt to tackle this problem by taking simple kinematics information as auxiliary features to improve predictions. However, it remains challenging to utilize human prior knowledge deeply, such as the trajectory formed by the same joint should be smooth and continuous in this task. In this paper, we advocate explicitly describing kinematics information via velocity and acceleration by proposing a novel loss called joint point smoothness (JPS) loss, which calculates the acceleration of joints to smooth the sudden change in joint velocity. In addition, capturing spatio-temporal dependencies to make feature representations more informative is also one of the obstacles in this task. Therefore, we propose a dual-path network (KSOF) that models the temporal and spatial dependencies from kinematic temporal convolutional network (K-TCN) and spatial graph convolutional networks (S-GCN), respectively. Moreover, we propose a novel multi-scale fusion module named spatio-temporal optimal fusion (SOF) to enhance extraction of the essential correlation and important features at different scales from spatio-temporal coupling features. We evaluate our approach on three standard benchmark datasets, including Human3.6M, CMU-Mocap, and 3DPW datasets. For both short-term and long-term predictions, our method achieves outstanding performance on all these datasets. The code is available at https://github.com/qukehua/KSOF.

Bibliographic Details

DOI10.1016/j.patcog.2024.111206

URL IDhttp://www.sciencedirect.com/science/article/pii/S0031320324009579; http://dx.doi.org/10.1016/j.patcog.2024.111206; http://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=85210637640&origin=inward; https://linkinghub.elsevier.com/retrieve/pii/S0031320324009579

AUTHOR(S)

Rui Ding; Ke Hua Qu; Jin Tang

PUBLISHER(S)

Elsevier BV

TAG(S)

Computer Science

Provide Feedback

Have ideas for a new metric? Would you like to see something else here?Let us know