Shoufa Chen

I am a Ph.D. student at the Department of Computer Science, The University of Hong Kong (HKU), advised by Prof. Ping Luo.

Before that, I obtained my bachelor degree from Huazhong University of Science and Technology (HUST), advised by Prof. Xinggang Wang.

Email  /  Google Scholar  /  Github


Research

GenTron: Diffusion Transformers for Image and Video Generation
Shoufa Chen*, Mengmeng Xu*, Jiawei Ren, Yuren Cong, Sen He, Yanping Xie, Animesh Sinha, Ping Luo, Tao Xiang, Juan-Manuel Perez-Rua
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024
Paper / Website

FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing
Yuren Cong, Mengmeng Xu, Christian Simon, Shoufa Chen, Jiawei Ren, Yanping Xie, Juan-Manuel Perez-Rua, Bodo Rosenhahn, Tao Xiang, Sen He
International Conference on Learning Representations (ICLR) , 2024
Paper / Code / Website

DiffusionDet: Diffusion Model for Object Detection
Shoufa Chen, Peize Sun, Yibing Song, Ping Luo
International Conference on Computer Vision (ICCV), 2023 ( oral )
Best Paper Initial List (17/8260, 0.2%)
Paper / Code

VLPart: Going Denser with Open-Vocabulary Part Segmentation
Peize Sun, Shoufa Chen, Chenchen Zhu, Fanyi Xiao, Ping Luo, Saining Xie, Zhicheng Yan
International Conference on Computer Vision (ICCV), 2023
Paper / Code

CycleMLP: A MLP-like Architecture for Dense Visual Predictions
Shoufa Chen, Enze Xie, Chongjian Ge, Runjian Chen, Ding Liang, Ping Luo
IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI), 2023
Paper / Code

GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
Shilong Zhang*, Peize Sun*, Shoufa Chen*, Min Xiao, Wenqi Shao ,Wenwei Zhang, Kai Chen, Ping Luo
arxiv preprint, July, 2023 (* denotes equal contribution)
Paper / Code / Online Demo

InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Zhaoyang Liu*, Yinan He*, Wenhai Wang*, Weiyun Wang*, Yi Wang*, Shoufa Chen*, Qinglong Zhang*, Zeqiang Lai*, Yang Yang, Qingyun Li, Jiashuo Yu, Kunchang Li, Zhe Chen, Xue Yang, Xizhou Zhu, Yali Wang, Limin Wang, Ping Luo, Jifeng Dai, Yu Qiao
arxiv preprint, May, 2023 (* denotes equal contribution)
Paper / Code / Online Demo

Soft Neighbors Are Positive Supporters in Contrastive Visual Representation Learning
Chongjian Ge, Jiangliu Wang, Zhan Tong, Shoufa Chen, Yibing Song, and Ping Luo
International Conference on Learning Representations (ICLR), 2023
Paper / Code

AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition
Shoufa Chen*, Chongjian Ge*, Zhan Tong, Jiangliu Wang, Yibing Song, Jue Wang, Ping Luo
Neural Information Processing Systems (NeurIPS), 2022 (* denotes equal contribution)
Paper / Project / Code

CtrlFormer: Learning Transferable State Representation for Visual Control via Transformer
Yao Mu, Shoufa Chen, Mingyu Ding, Jianyu Chen, Runjian Chen, Ping Luo
International Conference on Machine Learning (ICML), 2022
Paper / Project / Code

CycleMLP: A MLP-like Architecture for Dense Prediction
Shoufa Chen, Enze Xie, Chongjian Ge, Runjian Chen, Ding Liang, Ping Luo
International Conference on Learning Representations (ICLR), 2022 ( oral )
Paper / Code

Watch Only Once: An End-to-End Video Action Detection Framework
Shoufa Chen, Peize Sun, Enze Xie, Chongjian Ge, Jiannan Wu, Lan Ma, Jiajun Shen, Ping Luo
International Conference on Computer Vision (ICCV), 2021
Paper / Code


Honors and Awards
  • Hong Kong PhD Fellowship Scheme, 2021-2025
  • China National Scholarship, 2017, 2018

Academic Service
    Conference Reviewer for CVPR, ICCV, NeurIPS, ICLR, ICML