Biography

I am a second year Ph.D. Candidate at the School of Computer Science, Fudan University, where I work at Vision and Learning Lab(FVL) under the supervision of Prof. Yu-Gang Jiang(IEEE Fellow) and Prof. Zuxuan Wu. Before this, I received my BS degree from TianJin University.

My research interests lie broadly in computer vision and deep learning. I mainly focus on video understanding, especially video generation, editing and recognition. I am also open and willing to explore other vision tasks, e.g., AIGC, 3D understanding. If you are interested in my topics, please do not hestitae to reach out. See details about me in CV.

News

  • [Feb’2024] Invited talk at Openmmlab about Video Generation Models, [slides].
  • [Feb’2024] SimDA is accepted by CVPR 2024.
  • [Dec’2023] Serve as a reviewer for ICML 2024.
  • [Dec’2023] Invited talk at Kunlun Research, “A Survey on Video Diffusion Models”.
  • [Aug’2023] Awarded a certificate of “Star of Tomorrow” at MSRA.
  • [May’2023] One paper is accepted by ACL 2023.
  • [Feb’2023] Two papers are accepted by CVPR 2023.
  • [July’2022] Three papers are accepted by ECCV 2022.
  • [June’2022] One paper is accepted by ACM’MM 2022.
  • [Apr’2022] One paper is accepted to ACM ICMR 2022.
  • [Mar’2022] Start my internship at MicroSoft Research Asia(MSRA).
  • [Jan’2022] One paper is accepted by DASFAA 2022.

Selected Publications:

A Survey on Video Diffusion Models
Zhen Xing, Qijun Feng, Haoran Chen, Qi Dai, Han Hu, Hang Xu, Zuxuan Wu, Yu-Gang Jiang
Under Review of ACM Computing Survey (CSUR), 2023
[Paper][HomePage][Zhihu]
Surveying 100+ recent literatures on video generation and editing with diffusion models.
SimDA: A Simple Diffusion Adapter for Efficient Video Generation
Zhen Xing, Qi Dai, Han Hu, Zuxuan Wu, Yu-Gang Jiang
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024
[Paper][HomePage]
VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models
Zhen Xing, Qi Dai, Zihao Zhang, Hui Zhang, Han Hu, Zuxuan Wu, Yu-Gang Jiang
arXiv preprint, 2023
[Paper][HomePage][Zhihu]
SVFormer: Semi-supervised Video Transformer for Action Recognition
Zhen Xing, Qi Dai, Han Hu, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023
[Paper][Code]
PanoSwin: a Pano-style Swin Transformer for Panorama Understanding
Zhixin Ling, Zhen Xing, Manliang Cao, Xiangdong Zhou
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023
[Paper][Code]
Semi-supervised Single-view 3D Reconstruction via Prototype Shape Priors
Zhen Xing, Hengduo Li, Zuxuan Wu, Yu-Gang Jiang
European Conference on Computer Vision (ECCV), 2022
[Paper][Code]
Few-shot Single-view 3D Reconstruction with Memory Prior Contrastive Network
Zhen Xing, Yijiang Chen, Zhixin Ling, Xiangdong Zhou, Yu Xiang
European Conference on Computer Vision (ECCV), 2022
[Paper][Code]
Conditional Stroke Recovery for Fine-Grained Sketch-Based Image Retrieval
Zhixin Ling, Zhen Xing,Jian Zhou, Xiangdong Zhou
European Conference on Computer Vision (ECCV), 2022
[Paper][Code]