Xiefan Guo

Ph.D. Student @ Beihang University (BUAA)
Research Intern @ Alibaba TongYi Lab
Email: guoxiefan@gmail.com

I am a Ph.D. student at the School of Computer Science and Engineering, Beihang University (BUAA), supervised by Prof. Di Huang. My research interests focus on Generative AI, Image and Video Synthesis and Diffusion models.

Before that, I received my master degree from the School of Computer Science and Engineering, Beihang University (BUAA), China, in Jan. 2023 and B.E degree from the College of Inteligence and Computing, Tianjin University (TJU), China, in Jul. 2020. I interned at Alibaba DAMO Academy and Alibaba TongYi Lab.

News
  • 2024.06: We release the official PyTorch implementation of I4VGen.
  • 2024.05: Invited talk at Alibaba Cloud on InitNO, CVPR 2024.
  • 2024.02: Two papers on diffusion model (InitNO) and scene graph (DRM) accepted to CVPR 2024 (Seattle, USA).
  • 2023.09: I start my intern at Alibaba TongYi Lab.
  • 2022.03: One paper on photo retouching (ABPN) accepted to CVPR 2022 (New Orleans, USA).
  • 2021.09: I obtain China National Scholarship.
  • 2021.07: One paper on image inpainting (CTSDG) accepted to ICCV 2021 (Montreal, Canada).
Recent Publications

See the full list on my Publications and Google Scholar.

  • I4VGen: Image as Free Stepping Stone for Text-to-Video Generation
    Xiefan Guo, Jinlin Liu, Miaomiao Cui, Liefeng Bo, Di Huang
    In: arXiv:2406.02230, Technical Report, 2024
    [arXiv:2406.02230] [Code] [Project]
  • Disentangling Foreground and Background Motion for Enhanced Realism in Human Video Generation
    Jinlin Liu, Kai Yu, Mengyang Feng, Xiefan Guo, Miaomiao Cui
    In: arXiv:2405.16393, Technical Report, 2024
    [arXiv:2405.16393] [Code] [Project]
  • InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization
    Xiefan Guo, Jinlin Liu, Miaomiao Cui, Jiankai Li, Hongyu Yang, Di Huang
    In: The 42th IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 17-21, 2024, Seattle, USA
    [Paper] [arXiv:2404.04650] [Code] [Project]
  • Leveraging Predicate and Triplet Learning for Scene Graph Generation
    Jiankai Li, Yunhong Wang, Xiefan Guo, Ruijie Yang, Weixin Li
    In: The 42th IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 17-21, 2024, Seattle, USA
    [Paper] [arXiv:2406.02038] [Code]
  • DreaMoving: A Human Video Generation Framework based on Diffusion Models
    Mengyang Feng, Jinlin Liu, Kai Yu, Yuan Yao, Zheng Hui, Xiefan Guo, Xianhui Lin, Haolan Xue, Chen Shi, Xiaowen Li, Aojie Li, Xiaoyang Kang, Biwen Lei, Miaomiao Cui, Peiran Ren, Xuansong Xie
    In: arXiv:2312.05107, Technical Report, 2023
    [arXiv:2312.05107] [Code] [Project] [ModelScope] [Hugging Face]