TY - GEN
T1 - Generative Adversarial Network with In-Betweening for Fine-Grained Skeleton-Based Action Generation
AU - Qi, Xiangyuan
AU - He, Zhen
AU - Wang, Qiang
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Human action generation has shown critical value in both industry application and academic research, especially fine-grained skeleton-based action generation. At present, there are several main ways to achieve fine-grained skeleton-based action generation. In this work, we utilize a generative adversarial network with in-betweening, in order to generate natural and continuous skeleton sequences and better capture complex motion details. In-betweening can be viewed as a prediction task based on the past and future context. Besides, it is convenient to control the result in a personalized way and combine the framework with other network structures to further improve the performance. In particular, two simple discriminators are applied on different timescales and LSTM serves as the foundation of the action generator because of its capability of effectively capturing long-term dependencies and processing sequence data of different lengths. The target noise added to the input of the generator can contribute to the robustness of the system and further improve the performance. Moreover, a novel Tai Chi dataset composed of fine-grained skeleton-based action data is created using high quality motion capture technology. The model is evaluated on two fine-grained skeleton-based datasets, LaFAN1 and Tai Chi, which contain common daily actions and professional sports actions respectively, and it achieves superior performance both qualitatively and quantitatively.
AB - Human action generation has shown critical value in both industry application and academic research, especially fine-grained skeleton-based action generation. At present, there are several main ways to achieve fine-grained skeleton-based action generation. In this work, we utilize a generative adversarial network with in-betweening, in order to generate natural and continuous skeleton sequences and better capture complex motion details. In-betweening can be viewed as a prediction task based on the past and future context. Besides, it is convenient to control the result in a personalized way and combine the framework with other network structures to further improve the performance. In particular, two simple discriminators are applied on different timescales and LSTM serves as the foundation of the action generator because of its capability of effectively capturing long-term dependencies and processing sequence data of different lengths. The target noise added to the input of the generator can contribute to the robustness of the system and further improve the performance. Moreover, a novel Tai Chi dataset composed of fine-grained skeleton-based action data is created using high quality motion capture technology. The model is evaluated on two fine-grained skeleton-based datasets, LaFAN1 and Tai Chi, which contain common daily actions and professional sports actions respectively, and it achieves superior performance both qualitatively and quantitatively.
KW - Fine-grained skeleton-based action generation
KW - Generative adversarial network
KW - Tai Chi dataset
UR - https://www.scopus.com/pages/publications/105000892207
U2 - 10.1109/IECON55916.2024.10905961
DO - 10.1109/IECON55916.2024.10905961
M3 - 会议稿件
AN - SCOPUS:105000892207
T3 - IECON Proceedings (Industrial Electronics Conference)
BT - IECON 2024 - 50th Annual Conference of the IEEE Industrial Electronics Society, Proceedings
PB - IEEE Computer Society
T2 - 50th Annual Conference of the IEEE Industrial Electronics Society, IECON 2024
Y2 - 3 November 2024 through 6 November 2024
ER -