论文标题
学习复杂的3D人类自我接触
Learning Complex 3D Human Self-Contact
论文作者
论文摘要
三维人类自我接触的单眼估计对于详细的场景分析至关重要,包括肢体语言理解和行为建模。现有的3D重建方法不关注自我接触中的身体区域,因此恢复了彼此远离的配置或何时触摸的情况。这会导致感知上错误的估计值和限制在那些非常细粒度的分析域中的影响,在这些分析域中,详细的3D模型有望发挥重要作用。为了应对此类挑战,我们发现自我接触和设计3D损失以明确执行它。具体而言,我们开发了一个自我接触预测(SCP)的模型,该模型估计了自我接触的身体表面特征,在训练和推理过程中利用了自我接触的定位。我们收集两个大型数据集来支持学习和评估:(1)HumanSC3D,一个准确的3D运动捕获存储库,其中包含$ 1,032 $序列,$ 5,058 $接触事件和$ 1,246,487 $ 1,246,487地面真相3D在多个视图中收集的图像和(2)$ 3. $ 3.3d,repository $ 3.3d,3d $ 3.带有带注释的图像空间支撑的表面对应。我们还说明了如何在自我接触签名限制下恢复更具表现力的3D重建,并将面部接触的单眼检测作为通过更准确的自我接触模型所能成为可能的多个应用之一。
Monocular estimation of three dimensional human self-contact is fundamental for detailed scene analysis including body language understanding and behaviour modeling. Existing 3d reconstruction methods do not focus on body regions in self-contact and consequently recover configurations that are either far from each other or self-intersecting, when they should just touch. This leads to perceptually incorrect estimates and limits impact in those very fine-grained analysis domains where detailed 3d models are expected to play an important role. To address such challenges we detect self-contact and design 3d losses to explicitly enforce it. Specifically, we develop a model for Self-Contact Prediction (SCP), that estimates the body surface signature of self-contact, leveraging the localization of self-contact in the image, during both training and inference. We collect two large datasets to support learning and evaluation: (1) HumanSC3D, an accurate 3d motion capture repository containing $1,032$ sequences with $5,058$ contact events and $1,246,487$ ground truth 3d poses synchronized with images collected from multiple views, and (2) FlickrSC3D, a repository of $3,969$ images, containing $25,297$ surface-to-surface correspondences with annotated image spatial support. We also illustrate how more expressive 3d reconstructions can be recovered under self-contact signature constraints and present monocular detection of face-touch as one of the multiple applications made possible by more accurate self-contact models.