Sixiao Zheng (郑思晓)

Generative AI Researcher

Fudan University & Shanghai Innovation Institute

I am currently a third-year Ph.D. student in the joint training program between Fudan University and Shanghai Innovation Institute, advised by Prof. Yanwei Fu. Previously, I received my Master's degree from Fudan University in 2021, co-advised by Prof. Jianfeng Feng and Prof. Yanwei Fu. I obtained my Bachelor's degree from South China Normal University in 2018, under the supervision of Prof. Jiajia Chen. From 2021 to 2023, I worked as a Researcher at Tencent Hunyuan. Currently, I am a Research Intern (Tencent Project Up) at Tencent ARC Lab, working with Dr. Wenbo Hu. My research interests include Image/Video Generation, Object Detection, and Image Segmentation. Currently, I am focusing on Generative World Models.

Sixiao Zheng

News

2026.01

Released new paper "VerseCrafter" on arXiv.

2025.07

Released new paper "TriVLA" on arXiv.

2025.07

One paper accepted to ACM MM 2025.

2025.05

Joined Tencent ARC Lab as Research Intern.

2025.02

One paper accepted to CVPR 2025.

2025.02

Released new paper "VidCRAFT3" on arXiv.

2024.12

One paper accepted to AAAI 2025.

2024.09

Started Joint-Training at Shanghai Innovation Institute.

2024.07

One paper accepted to IJCV 2024.

2024.06

Joined Huawei Noah's Ark Lab as Research Intern.

2024.02

Released new paper "Intelligent Director" on arXiv.

2022.04

Released new paper "HunYuan_tvr" on arXiv.

2022.02

One paper accepted to IEEE TAI 2022.

2021.07

Joined Tencent Hunyuan as Researcher.

2021.06

Obtained Master's degree from Fudan University.

2021.04

One paper accepted to ICMR 2021.

2021.03

One paper accepted to CVPR 2021.

2020.10

One paper accepted to ICPR 2020.

2020.06

Joined Tencent YouTu Lab as Research Intern.

Education

Fudan Logo

Fudan University

2023 - 2027 (Expected)
Ph.D. in Statistics
Shanghai Innovation Institute Logo

Shanghai Innovation Institute

2024 - 2027 (Expected)
Doctoral Joint-Training Program
Fudan Logo

Fudan University

2018 - 2021
M.S. in Computer Application
SCNU Logo

South China Normal University

2014 - 2018
B.E. in Communication Engineering

Selected Publications

For the full list, please visit my Google Scholar.
arXiv 2026

VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control

Sixiao Zheng, Minghao Yin, Wenbo Hu, Xiaoyu Li, Ying Shan, Yanwei Fu
arXiv preprint, 2026
arXiv 2025
TriVLA Paper

TriVLA: A Triple-System-Based Unified Vision-Language-Action Model with Episodic World Modeling for General Robot Control

Zhenyang Liu, Yongchong Gu, Sixiao Zheng, Yanwei Fu, Xiangyang Xue, Yu-Gang Jiang
arXiv preprint, 2025
ACM MM 2025
SpatialReasoner Paper

A Neural Representation Framework with LLM-Driven Spatial Reasoning for Open-Vocabulary 3D Visual Grounding

Zhenyang Liu, Sixiao Zheng, Siyu Chen, Cairong Zhao, Longfei Liang, Xiangyang Xue, Yanwei Fu
ACM MM 2025
arXiv 2025

VidCRAFT3: Camera, Object, and Lighting Control for Image-to-Video Generation

Sixiao Zheng, Zimian Peng, Yanpeng Zhou, Yi Zhu, Hang Xu, Xiangru Huang, Yanwei Fu
arXiv preprint, 2025
CVPR 2025
ReasonGrounder Paper

ReasonGrounder: LVLM-Guided Hierarchical Feature Splatting for Open-Vocabulary 3D Visual Grounding and Reasoning

Zhenyang Liu, Yikai Wang, Sixiao Zheng, Tongying Pan, Longfei Liang, Yanwei Fu, Xiangyang Xue
CVPR 2025
AAAI 2025
ContextualStory Paper

ContextualStory: Consistent Visual Storytelling with Spatially-Enhanced and Storyline Context

Sixiao Zheng, Yanwei Fu
AAAI 2025
arXiv 2024
ID Paper

Intelligent Director: An Automatic Framework for Dynamic Visual Composition using ChatGPT

Sixiao Zheng, Jingyang Huo, Yu Wang, Yanwei Fu
arXiv preprint, 2024
IJCV 2024
HLG Paper

Vision Transformers: From Semantic Segmentation to Dense Prediction

Li Zhang*, Jiachen Lu*, Sixiao Zheng*, Xinxuan Zhao, Xiatian Zhu, Yanwei Fu, Tao Xiang, Jianfeng Feng, Philip H.S. Torr
IJCV 2024
arXiv 2022
HunYuan_tvr Paper

HunYuan_tvr for Text-Video Retrieval

Shaobo Min, Weijie Kong, Rong-Cheng Tu, Dihong Gong, Chengfei Cai, Wenzhe Zhao, Chenyang Liu, Sixiao Zheng, Hongfa Wang, Zhifeng Li, Wei Liu
arXiv preprint, 2022
IEEE TAI 2022
EVT-kmeans Paper

Clustering by the Probability Distributions from Extreme Value Theory

Sixiao Zheng, Ke Fan, Yanxi Hou, Jianfeng Feng, Yanwei Fu
IEEE Transactions on Artificial Intelligence 2022
CVPR 2021
SETR Paper

Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers

Sixiao Zheng, Jiachen Lu, Hengshuang Zhao, Xiatian Zhu, Zekun Luo, Yabiao Wang, Yanwei Fu, Jianfeng Feng, Tao Xiang, Philip H.S. Torr, Li Zhang
CVPR 2021 4600+ Citations
ICMR 2021
NMS-Loss Paper

NMS-Loss: Learning with Non-Maximum Suppression for Crowded Pedestrian Detection

Zekun Luo, Zheng Fang, Sixiao Zheng, Yabiao Wang, and Yanwei Fu
ICMR 2021
ICPR 2020
IZSD Paper

Incrementally Zero-Shot Detection by an Extreme Value Analyzer

Sixiao Zheng, Yanwei Fu, and Yanxi Hou
ICPR 2020 (Oral)

Experience

Tencent ARC Lab Logo

Tencent ARC Lab

2025.05 – Present
Research Intern (Tencent Project Up)
Huawei Noah's Ark Lab Logo

Huawei Noah's Ark Lab

2024.06 – 2025.04
Research Intern
Tencent Hunyuan Logo

Tencent Hunyuan

2021.07 – 2023.08
Researcher
Tencent YouTu Lab Logo

Tencent YouTu Lab

2020.06 – 2021.01
Research Intern
Duoyi Network Logo

Duoyi Network

2018.06 – 2018.08
Research Intern

Honors & Awards

2024 Fudan University Outstanding Student
2022 Tencent Business Breakthrough Award
2022 Tencent TEG SEVP Award for Technical Breakthrough
2018 Outstanding Graduate, School of Physics and Telecommunication Engineering, SCNU
2017 Mathematical Contest in Modeling (MCM) 2nd Prize
2017 5th Teddy Cup Data Mining Challenge 2nd Prize
2017 8th Blue Bridge Cup Guangdong Java 2nd Prize
2016 Mathematical Contest in Modeling (MCM) 2nd Prize