BaptisteChopin-2021-HumanMotionPredictionUsingManifoleAwareWasserteinGAN-IEEE

# Human Motion Prediction Using Manifold-Aware Wasserstein GAN #paper

1. paper-info

1.1 Metadata

Author:: [[Baptiste Chopin]], [[Naima Otberdout]], [[Mohamed Daoudi]], [[Angela Bartolo]]
作者机构::
Keywords:: #HMP , #GAN
Journal:: #IEEE
Date:: [[2021-07-16]]
状态:: #Done
链接::http://arxiv.org/abs/2105.08715
修改时间:: 2022.11.22

1.2. Abstract

Human motion prediction aims to forecast future human poses given a prior pose sequence. The discontinuity of the predicted motion and the performance deterioration in long-term horizons are still the main challenges encountered in current literature. In this work, we tackle these issues by using a compact manifold-valued representation of human motion. Specifically, we model the temporal evolution of the 3D human poses as trajectory, what allows us to map human motions to single points on a sphere manifold. To learn these non-Euclidean representations, we build a manifold-aware Wasserstein generative adversarial model that captures the temporal and spatial dependencies of human motion through different losses. Extensive experiments show that our approach outperforms the state-of-the-art on CMU MoCap and Human 3.6M datasets. Our qualitative results show the smoothness of the predicted motions.

2. Introduction

领域：human motion prediction
- 问题：时间维度，如何使得预测的动作序列更加顺滑？空间维度，预测的动作更加真实？
之前的方法：RNN-based、CNN-based、GNN-based
作者的方法：
1. 将人体动作当做轨迹，并且将这些轨迹映射到流行的单个紧凑点。能够使动作序列更加smoothness。
2. 由于传统的GNN无法处理流式数据，于是作者提出了a manifold-aware Wasserstein Generative Adversaial Network(WGAN)
3. 提出了多种损失函数来优化该网络，因为GAN很难train。

3. Methods

人体结构表示: 3D坐标
\(P_t=[x_1(t),y_1(t),z_1(t),…,x_k(t),y_k(t),z_k(t)]\): 某一时刻的姿势表示。
\(k\): 关节点数。
\(t\): 时刻
作者沿用[2]，将人体骨架序列看做每个关节点的轨迹，从而将人体动作预测问题看做：给定一定序列的连续帧，去预测下一帧的点对应的曲线。将曲线描述为\(\alpha(t)\) ，问题变为：给定\(\alpha(t)_{t=1,,,\tau}\),预测\(\alpha(t)_{t=\tau+1,,,T}\)
为了能够更好的学习和建模\(\alpha(t)\)，作者采用了square-root velocity function[1]，该方法能够将曲线映射到一个超球面，得到一个超平面内的点(不同行为标签对应的轨迹产生的点之间的距离是不同的)。该方法的定义如下：

\[q(t)=\frac{\dot{\alpha}(t)}{\sqrt{\|\dot{\alpha}(t)\|}} \]

\(||.||\)操作表示为欧几里得2范数。
该方法在已知\(\alpha(0)\)的情况下，可以通过以下function还原\(\alpha(t)\)：

\[\alpha(t)=\int_{0}^{t}\|q(s)\| q(s) d s+\alpha(0) \]

为了更好的处理，将这个超球面的半径设置为1.
\(\mathcal{C}=\{q: \mathcal{I}->\mathcal{R}^n| ||q||=1\} \subset \mathcal{L}^2(\mathcal{I}, \mathcal{R}^n)\) ：表示一个动作在超球面的映射。

由于该超空间不同于欧式空间，传统的GNN方法不能够处理该类数据，于是作者提出了新的GNN–PredictiveMA-WGAN

3.1 Network Architecture

Fig.1. 网络结构

Source:

PredictiveMA-WGAN由一个predictor\(\mathcal{G}\) 和discriminator\(\mathcal{D}\) 组成，对抗式学习。

During the training of these networks, we iteratively map the SRVF data back and forth to the tangent space using the exponential and the logarithm maps, defined in a particular point on the hypersphere.

在训练阶段，使用在超球面上的特定点中定义的指数和对数映射，以迭代的方式将SRVF数据来回映射到切线空间，然后输入到网络中。
\(\mathcal{G}\) ：
36864输出通道的全连接层。
5个上采样块(512， 256，128，64，1)，上采样块由nearest-neighbor upsampling跟着一个卷积块(3×3, stride 1)和ReLU function 构成。
\(\mathcal{D}\)：
3个下采样块(64,32,16输出通道)
Conv layer(3×3 stride 1)
batch normalization
ReLU

3.2 Loss Function

Adverasrial loss

\[\begin{aligned} \mathcal{L}_{a}=& \mathbb{E}_{q_{T} \sim \mathbb{P}_{q_{T}}}\left[\mathcal{D}\left(\log _{\mu}\left(q_{T}\right)\right)\right] \\ &-\mathbb{E}_{\mathcal{G}\left(\log _{\mu}\left(q_{\tau}\right)\right) \sim \mathbb{P}_{\boldsymbol{q}_{T}}}\left[\mathcal{D}\left(\log _{\mu}\left(\exp _{\mu}\left(\mathcal{G}\left(\log _{\mu}\left(q_{\tau}\right)\right)\right)\right)\right]\right.\\ &+\lambda \mathbb{E}_{\widetilde{q} \sim \mathbb{P}_{\widetilde{q}}}\left[\left(\left\|\nabla_{\widetilde{q}} \mathcal{D}(\widetilde{q})\right\|-1\right)^{2}\right], \end{aligned}\]

Reconstruction loss

\[\mathcal{L}_{r}=\left\|\log _{\mu}\left(\exp _{\mu}\left(\mathcal{G}\left(\log _{\mu}\left(q_{\tau}\right)\right)\right)\right)-\log _{\mu}\left(q_{T}\right)\right\|_{1} \]

Skeleton integrity loss

\[\mathcal{L}_{s}=\frac{1}{m} \frac{1}{\tau} \sum_{i=1}^{m} \sum_{t=1}^{\tau} \Delta\left(P_{i, t}, \hat{P}_{i, t}\right) \]

\[\Delta\left(G_{i}, G_{j}\right)=\operatorname{tr}\left(G_{i}\right)+\operatorname{tr}\left(G_{j}\right)-2 \sum_{i=1}^{3} \sigma_{i} \]

这里解释没有看懂，具体看论文。

Bone length loss

\[\mathcal{L}_{b}=\frac{1}{m} \frac{1}{\tau} \frac{1}{B} \sum_{i}^{m} \sum_{t=1}^{\tau} \sum_{j}^{B}\left\|b_{i, j, t}-\hat{b}_{i, j, t}\right\| \]

Bone Length loss

\[\mathcal{L}_{b}=\frac{1}{m} \frac{1}{\tau} \frac{1}{B} \sum_{i}^{m} \sum_{t=1}^{\tau} \sum_{j}^{B}\left\|b_{i, j, t}-\hat{b}_{i, j, t}\right\| \]

Global loss

\[\mathcal{L}=\beta_{1} \mathcal{L}_{a}+\beta_{2} \mathcal{L}_{r}+\beta_{3} \mathcal{L}_{s}+\beta_{4} \mathcal{L}_{b} \]

4. Experiments

dataset
- Human3.6m
- CMU Motion Capture
metric
- MPJPE

\[\operatorname{Err}=\sqrt{\frac{1}{\Delta t} \frac{1}{k} \sum_{t=\tau+1}^{\tau+\Delta t} \sum_{j=1}^{k}\left\|p_{t, j}-\hat{p}_{t, j}\right\|^{2}} \]

总结

与[2]的思路大致一致，将人体动作序列看过关节点的轨迹，在[2]中，利用DCT编码处理得到轨迹特征，然后利用GCN处理空间信息，在本文中，作者利用Square Root Velocity Function[1]将轨迹mapping 到一个超球体内，一条轨迹对应一点，然后将这些点投影到超球体的切线空间中，得到GAN网络模型的输入，最后训练GAN得到输出。

reference

[1] A. Srivastava, E. Klassen, S. H. Joshi and I. H. Jermyn, “Shape Analysis of Elastic Curves in Euclidean Spaces,” in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 33, no. 7, pp. 1415-1428, July 2011, doi: 10.1109/TPAMI.2010.184.
[2]Mao W, Liu M, Salzmann M, et al. Learning trajectory dependencies for human motion prediction[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. 2019: 9489-9497.

原文地址：http://www.cnblogs.com/guixu/p/16922209.html

1. 本站所有资源来源于用户上传和网络，如有侵权请邮件联系站长！ 2. 分享目的仅供大家学习和交流，请务用于商业用途! 3. 如果你也有好源码或者教程，可以到用户中心发布，分享有积分奖励和额外收入！ 4. 本站提供的源码、模板、插件等等其他资源，都不包含技术服务请大家谅解！ 5. 如有链接无法下载、失效或广告，请联系管理员处理！ 6. 本站资源售价只是赞助，收取费用仅维持本站的日常运营所需！ 7. 如遇到加密压缩包，默认解压密码为"gltf",如遇到无法解压的请联系管理员！ 8. 因为资源和程序源码均为可复制品，所以不支持任何理由的退款兑现，请斟酌后支付下载声明：如果标题没有注明"已测试"或者"测试可用"等字样的资源源码均未经过站长测试.特别注意没有标注的源码不保证任何可用性

BaptisteChopin-2021-HumanMotionPredictionUsingManifoleAwareWasserteinGAN-IEEE

# Human Motion Prediction Using Manifold-Aware Wasserstein GAN #paper

1. paper-info

1.1 Metadata

1.2. Abstract

2. Introduction

3. Methods

3.1 Network Architecture

3.2 Loss Function

4. Experiments

总结

reference

排行榜展示

3D打印机glb模型下载-机械glb模型

树glb模型下载-树2

水稻glb模型下载-水稻1

变电箱1glb模型下载-机械glb模型

树glb模型下载-树3

模型

变电箱1glb模型下载-机械glb模型

树glb模型下载-树3

树glb模型下载-树2

水稻glb模型下载-水稻1

树glb模型下载-树1

3D打印机glb模型下载-机械glb模型