Hi, this is Chuxin Wang.
Biography
Hi! I am a researcher at the Wan team, Tongyi Lab, Alibaba Group. My research primarily focuses on multimodal large models, video generation, and image generation.
Previously, I received my PhD degree from the Department of Automation at University of Science and Technology of China, advised by Prof. Weiren Wu and Tianzhu Zhang. During my PhD, my research covered spatial intelligence and foundation models.
I interned with the Visual Computing Group at Microsoft Research Asia (2020-2021) and Deep Space Exploration Laboratory (2024-2025).
News 🔥
- Feb. 2026: GeoGuide: Hierarchical Geometric Guidance for Open-Vocabulary 3D Semantic Segmentation was accepted by CVPR 2026.
- Feb. 2026: ComPose: A Unified Completion-Pose Framework with Geometric Relation Modeling for Category-Level Object Pose Estimation was accepted by CVPR 2026.
- Jan. 2026: ScaleDepth: Decomposing Metric Depth Estimation into Scale Prediction and Relative Depth Estimation was accepted by TCSVT 2026.
- Dec. 2025: PointChain: Learning Generalizable Point Cloud Representations via Structural Chain Modeling was accepted by AAAI 2026.
- Dec. 2025: State Space Models for Long-Term Temporal Context in 3D Single Object Tracking was accepted by TCSVT 2025.
- Aug. 2025: ER-Depth: Enhancing the Robustness of Self-Supervised Monocular Depth Estimation in Challenging Scenes was accepted by TOMM 2025.
- Jul. 2025: SA3Det++: Side-Aware Quality Estimation for Semi-Supervised 3D Object Detection was accepted by TPAMI 2025.
- Jun. 2025: StruMamba3D: Exploring Structural Mamba for Self-supervised Point Cloud Representation Learning was accepted by ICCV 2025.
- Jun. 2025: Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis was accepted by ICCV 2025.
- May 2025: Exploring Vision Semantic Prompt for Efficient Point Cloud Understanding was accepted by ICML 2025.
- May 2025: Exploring Semantic Masked Autoencoder for Self-supervised Point Cloud Understanding was accepted by IJCAI 2025.
- Jan. 2025: State Space Model Meets Transformer: A New Paradigm for 3D Object Detection was accepted by ICLR 2025.
- Dec. 2024: RMAE3D: Rethinking Masked Representation Learning for 3D Point Cloud Understanding was accepted by TIP 2024.
- Aug. 2024: Pamba: Enhancing Global Interaction in Point Clouds via State Space Model was accepted by AAAI 2025.
- Oct. 2023: Not Every Side Is Equal: Localization Uncertainty Estimation for Semi-Supervised 3D Object Detection was accepted by ICCV 2023.
- Oct. 2023: Query Refinement Transformer for 3D Instance Segmentation was accepted by ICCV 2023.
- Apr. 2023: SE-ORNet: Self-Ensembling Orientation-aware Network for Unsupervised Point Cloud Shape Correspondence was accepted by CVPR 2023.
- May. 2023: Long-Short Range Adaptive Transformer With Dynamic Sampling for 3D Object Detection was accepted by TCSVT 2023.
- Mar. 2021: Style-based Point Generator with Adversarial Rendering for Point Cloud Completion was accepted by CVPR 2021.
Experiences 📖
Publications
* denotes Equal Contribution. + denotes First Student Author. More publications can be found in Google Scholar.
![]() | StruMamba3D: Exploring Structural Mamba for Self-supervised Point Cloud Representation Learning Chuxin Wang, Yixin Zha, Wenfei Yang, Tianzhu Zhang ICCV 2025 |
![]() | State Space Model Meets Transformer: A New Paradigm for 3D Object Detection Chuxin Wang, Wenfei Yang, Xiang Liu, Tianzhu Zhang ICLR 2025 |
![]() | Rethinking Masked Representation Learning for 3D Point Cloud Understanding Chuxin Wang, Yixin Zha, Jianfeng He, Wenfei Yang, Tianzhu Zhang TIP 2024 |
![]() | Not Every Side Is Equal: Localization Uncertainty Estimation for Semi-Supervised 3D Object Detection Chuxin Wang, Wenfei Yang, Tianzhu Zhang ICCV 2023 |
![]() | Long-short Range Adaptive Transformer with Dynamic Sampling for 3D Object Detection Chuxin Wang, Jiacheng Deng, Jianfeng He, Tianzhu Zhang, Zhe Zhang, Yongdong Zhang TCSVT 2023 |
![]() | PointChain: Learning Generalizable Point Cloud Representations via Structural Chain Modeling Luyang Wang*, Chuxin Wang*, Qiao Li, Tianzhu Zhang AAAI 2026 |
![]() | Exploring Semantic Masked Autoencoder for Self-supervised Point Cloud Understanding Yixin Zha*, Chuxin Wang*, Wenfei Yang, Tianzhu Zhang IJCAI 2025 |
![]() | SE-ORNet: Self-Ensembling Orientation-aware Network for Unsupervised Point Cloud Shape Correspondence Jiacheng Deng*, Chuxin Wang*, Jiahao Lu, Jianfeng He, Tianzhu Zhang, Jiyang Yu, Zhe Zhang CVPR 2023 |
![]() | Style-based Point Generator with Adversarial Rendering for Point Cloud Completion Chulin Xie*, Chuxin Wang*, Bo Zhang, Hao Yang, Dong Chen, Fang Wen CVPR 2021 |
![]() | Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis Bowen Zhang, Sicheng Xu, Chuxin Wang, Jiaolong Yang, Feng Zhao, Dong Chen, Baining Guo ICCV 2025 |
![]() | SA3Det++: Side-Aware Quality Estimation for Semi-Supervised 3D Object Detection Wenfei Yang, Chuxin Wang+, Tianzhu Zhang, Yongdong Zhang, Feng Wu TPAMI 2025 |
![]() | GeoGuide: Hierarchical Geometric Guidance for Open-Vocabulary 3D Semantic Segmentation Xujing Tao, Chuxin Wang, Yubo Ai, Zhixin Cheng, Zhuoyuan Li, Yujia Chen, Xinjun Li, Qiao Li, Wenfei Yang, Tianzhu Zhang CVPR 2026 |
![]() | ComPose: A Unified Completion-Pose Framework with Geometric Relation Modeling for Category-Level Object Pose Estimation Huan Ren, Yihan Chen, Chuxin Wang, Nailong Liu, Wenfei Yang, Tianzhu Zhang CVPR 2026 |
![]() | Exploring Vision Semantic Prompt for Efficient Point Cloud Understanding Yixin Zha, Chuxin Wang, Wenfei Yang, Xiang Liu, Tianzhu Zhang, Feng Wu ICML 2025 |
![]() | QRT3D: Query Refinement Transformer for 3D Instance Segmentation Jiahao Lu, Jiacheng Deng, Chuxin Wang, Jianfeng He, Tianzhu Zhang ICCV 2023 |
![]() | Pamba: Enhancing Global Interaction in Point Clouds via State Space Model Zhuoyuan Li, Yubo Ai, Jiahao Lu, Chuxin Wang, Jiacheng Deng, Hanzhi Chang, Yanzhe Liang, Wenfei Yang, Shifeng Zhang, Tianzhu Zhang AAAI 2025 |
![]() | State Space Models for Long-Term Temporal Context in 3D Single Object Tracking Jie Xiao, Yinchao Ma, Yuyang Tang, Chuxin Wang, Tianzhu Zhang TCSVT 2025 |
![]() | ScaleDepth: Decomposing Metric Depth Estimation into Scale Prediction and Relative Depth Estimation Ruijie Zhu, Chuxin Wang, Ziyang Song, Li Liu, Tianzhu Zhang, Yongdong Zhang TCSVT 2026 |
![]() | ER-Depth: Enhancing the Robustness of Self-Supervised Monocular Depth Estimation in Challenging Scenes. Ziyang Song*, Ruijie Zhu*, Chuxin Wang, Jiacheng Deng, Jianfeng He, Tianzhu Zhang TOMM 2025 |
Academic Services
Conference Reviewer
- IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
- International Conference on Computer Vision (ICCV)
- European Conference on Computer Vision (ECCV)
- Conference on Neural Information Processing Systems (NeurIPS)
- Association for the Advancement of Artificial Intelligence (AAAI)
- International Conference on Learning Representations (ICLR)
- International Conference on Machine Learning (ICML)
- International Conference on Artificial Intelligence and Statistics (AISTATS)
Journal Reviewer
- IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
- IEEE Transactions on Image Processing (TIP)
- IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)

















