Microsoft Research Asia
Email: chunyu.wangdlut[at]gmail[dot]com
[Bio] [Publication]
[Google Scholar] [GitHub]
I am a researcher in Microsoft Research Asia. I work on computer vision problems with special focus on single and multiple camera 3D human pose estimation and tracking, 3D clothed human reconstruction and action analysis.
[06/2023] I will serve as Area Chair for CVPR 2024.
[03/2023] Two papers accepted by CVPR 2023.
[03/2023] I will serve as Area Chair for Neurips 2023.
[09/2022] I will serve as Area Chair for CVPR 2023.
[09/2022] One paper accepted by Neurips 2022.
[07/2022] Four papers accepted by ECCV 2022.
[03/2022] VoxelTrack (multi-camera 3D pose tracking) accepted by T-PAMI 2022.
[03/2022] One paper accepted by CVPR 2022.
[01/2022] Invited as an Area Chair for ICPR 2022.
[11/2021] One paper on attention for video understanding accepted by NeurIPS 2021.
[10/2021] One paper on semi-superivsed 2D pose estimation accepted by ICCV 2021.
[08/2021] FairMOT accepted by IJCV 2021.
Multiple View Geometry Transformers for 3D Human Pose Estimation,
Ziwei Liao, Jialiang Zhu, Chunyu Wang, Han Hu, Steven L. Waslander.
Arxiv, 2023.
[PDF]
[Code]
V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection,
Yichao Shen, Zigang Geng, Yuhui Yuan, Yutong Lin, Ze Liu, Chunyu Wang, Han Hu, Nanning Zheng, Baining Guo.
Arxiv, 2023.
[PDF]
[Code]
Unsupervised Hierarchical Grouping for Graphic Design Layout with Bootstrapped Transformers,
Jialiang Zhu, Danqing Huang, Chunyu Wang, Mingxi Cheng, Ji Li, Han Hu, Xin Geng, Baining Guo.
WACV, 2024.
[PDF]
[Code]
All in Tokens: Unifying Output Space of Visual Tasks via Soft Token,
Jia Ning, Chen Li, Zheng Zhang, Chunyu Wang, Zigang Geng, Qi Dai, Kun He, Han Hu.
ICCV, 2023.
[PDF]
[Project]
Category-Specific Prompts for Animal Action Recognition with Pretrained Vision-Language Models,
Yinuo Jing, Chunyu Wang, Ruxu Zhang, Kongming Liang, Zhanyu Ma.
ACM MM, 2023.
[PDF]
[Project]
Human Pose as Compositional Tokens,
Zigang Geng, Chunyu Wang, Xiyuan Wei, Ze Liu, Houqiang Li, Han Hu.
CVPR, 2023.
[PDF]
[Project]
[Blog (in Chinese)]
3D Human Mesh Estimation from Virtual Markers,
Xiaoxuan Ma, Jiajun Su, Chunyu Wang, Wentao Zhu, Yizhou Wang.
CVPR, 2023.
[PDF]
[Code]
MMPTRACK: Large-scale Densely Annotated Multi-camera Multiple People Tracking Benchmark,
Xiaotian Han, Quanzeng You, Chunyu Wang, Zhizheng Zhang, Peng Chu, Houdong Hu, Jiang Wang, Zicheng Liu.
WACV, 2023.
[PDF]
You Never Stop Dancing: Non-freezing Dance Generation via Bank-constrained Manifold Projection,
Jiangxin Sun^, Chunyu Wang, Huang Hu, Hanjiang Lai, Zhi Jin, Jianfang Hu.
Neurips, 2022.
[PDF]
[CODE]
Virtual Pose: Learning Generalizable 3D Human Pose Models from Virtual Data,
Jiajun Su^, Chunyu Wang, Xiaoxuan Ma^, Wenjun Zeng, Yizhou Wang.
ECCV, 2022.
[PDF]
[CODE]
Faster VoxelPose: Real-time 3D Human Pose Estimation by Orthographic Projection,
Hang Ye^, Wentao Zhu^, Chunyu Wang, Rujie Wu, Yizhou Wang.
ECCV, 2022.
[PDF]
[CODE]
Robust Multi-Object Tracking by Marginal Inference,
Yifu Zhang^, Chunyu Wang, Xinggang Wang, Wenjun Zeng, Wenyu Liu.
ECCV, 2022.
[PDF]
[CODE]
One-Shot Medical Landmark Localization by Edge-Guided Transform and Noisy Landmark Refinement,
Zihao Yin^, Ping Gong, Chunyu Wang, Yizhou Yu, Yizhou Wang.
ECCV, 2022.
[PDF]
[CODE]
VoxelTrack: Multi-Person 3D Human Pose Estimation and Tracking in the Wild,
Yifu Zhang^, Chunyu Wang, Xinggang Wang, Wenyu Liu, Wenjun Zeng.
T-PAMI, 2022.
[PDF]
Correlation-Aware Deep Tracking,
Fei Xie^, Chunyu Wang, Guangting Wang, Yue Cao, Wankou Yang, Wenjun Zeng.
CVPR, 2022.
[PDF]
Relational Self-Attention: What’s Missing in Attention for Video Understanding,
Manjin Kim^, Heeseung Kwon, Chunyu Wang, Suha Kwak, Minsu Cho.
NeurIPS, 2021.
[PDF]
[Project Page]
FairMOT: On the Fairness of Detection and Re-Identification in Multiple Object Tracking,
Yifu Zhang^, Chunyu Wang, Xinggang Wang, Wenjun Zeng, Wenyu Liu.
IJCV, 2021.
[PDF]
[CODE]
Learning Tracking Representations via Dual-Branch Fully Transformer Networks,
Fei Xie^, Chunyu Wang, Guangting Wang, Wankou Yang, Wenjun Zeng.
ICCVw, 2021.
[PDF]
An Empirical Study of the Collapsing Problem in Semi-Supervised 2D Human Pose Estimation,
Rongchang Xie^, Chunyu Wang, Wenjun Zeng, Yizhou Wang.
ICCV, 2021.
[PDF]
[CODE]
Context Modeling in 3D Human Pose Estimation: A Unified Perspective,
Xiaoxuan Ma^, Jiajun Su^, Chunyu Wang, Hai Ci, Yizhou Wang.
CVPR, 2021.
[PDF]
[CODE]
Neighborhood Geometric Structure Preserving Variational Auto-Encoder for Smooth and Bounded Data Sources,
Xingyu Chen^, Chunyu Wang, Xuguang Lan, Nanning Zheng, Wenjun Zeng.
TNNLS, 2021.
[PDF]
VoxelPose: Towards Multi-Camera 3D Human Pose Estimation in Wild Environment,
Hanyue Tu^, Chunyu Wang, Wenjun Zeng.
ECCV, 2020.
[PDF]
[CODE]
[MMPose Implementation]
AdaFuse: Adaptive Multiview Fusion for Accurate Human Pose Estimation in the Wild,
Zhe Zhang^, Chunyu Wang, Weichao Qiu, Wenhu Qin, Wenjun Zeng.
IJCV, 2020.
[PDF]
[CODE]
Locally Connected Network for Monocular 3D Human Pose Estimation,
Hai Ci^, Xiaoxuan Ma^, Chunyu Wang, Yizhou Wang.
T-PAMI, 2020.
[PDF]
[CODE]
Fusing Wearable IMUs with Multi-View Images for Human Pose Estimation: A Geometric Approach,
Zhe Zhang^, Chunyu Wang, Wenhu Qin, Wenjun Zeng.
CVPR, 2020.
[PDF]
[CODE]
MetaFuse: A Pre-trained Fusion Model for Human Pose Estimation,
Rongchang Xie^, Chunyu Wang, Yizhou Wang.
CVPR, 2020.
paper.
[PDF]
[CODE]
Semantic Image Segmentation by Scale-Adaptive Networks,
Zilong Huang^, Chunyu Wang, Xinggang Wang, Wenyu Liu, Jingdong Wang.
TIP, 2019.
[PDF]
[CODE]
Optimizing Network Structure for 3D Human Pose Estimation,
Hai Ci^, Chunyu Wang, Xiaoxuan Ma, Yizhou Wang.
ICCV, 2019.
[PDF]
[CODE]
Cross View Fusion for 3D Human Pose Estimation,
Haibo Qiu^, Chunyu Wang, Jingdong Wang, Naiyan Wang, Wenjun Zeng.
ICCV, 2019.
[PDF]
[CODE]
Object detection in videos by high quality object linking,
Peng Tang^, Chunyu Wang, Xinggang Wang, Wenyu Liu, Wenjun Zeng, Jingdong Wang.
T-PAMI, 2019.
[PDF]
Learning Basis Representation to Refine 3D Human Pose Estimations,
Chunyu Wang, Haibo Qiu, Alan L. Yuille, Wenjun Zeng.
AAAI, 2019.
[PDF]
Robust 3d human pose estimation from single images or video sequences,
Chunyu Wang, Yizhou Wang, Zhouchen Lin, Alan L. Yuille.
T-PAMI, 2018.
[PDF]
Online Dictionary Learning for Approximate Archetypal Analysis,
Jieru Mei^, Chunyu Wang, Wenjun Zeng.
ECCV, 2018.
[PDF]
Video object segmentation by learning location-sensitive embeddings,
Hai Ci^, Chunyu Wang, Yizhou Wang.
ECCV, 2018.
[PDF]
Learning discriminative activated simplices for action recognition,
Chenxu Luo, Chang Ma, Chunyu Wang, Yizhou Wang.
AAAI, 2017.
[PDF]
Mining 3D Key-Pose-Motifs for Action Recognition,
Chunyu Wang, Yizhou Wang, Alan L. Yuille.
CVPR, 2016.
[PDF]
Recognizing actions in 3d using action-snippets and activated simplices,
Chunyu Wang, John Flynn, Yizhou Wang, Alan L. Yuille.
AAAI, 2016.
[PDF]
Robust Estimation of 3D Human Poses from a Single Image,
Chunyu Wang, Yizhou Wang, Zhouchen Lin, Alan L. Yuille, Wen Gao.
CVPR, 2014.
[PDF]
An approach to pose-based action recognition,
Chunyu Wang, Yizhou Wang, Alan L. Yuille.
CVPR, 2013.
[PDF]
(^ students mentored by me at Microsoft.)