Xiefan Guo

Ph.D. Student
Beihang University (BUAA)
Email: guoxiefan@gmail.com

I am a Ph.D. student at the School of Computer Science and Engineering, Beihang University (BUAA), supervised by Prof. Di Huang. My research interests focus on Generative AI, Image and Video Synthesis and Diffusion models.

Before that, I received my master degree from the School of Computer Science and Engineering, Beihang University (BUAA), China, in Jan. 2023 and B.E degree from the College of Inteligence and Computing, Tianjin University (TJU), China, in Jul. 2020. I interned at Alibaba DAMO Academy and Alibaba TongYi Lab.

News
  • 2025.02: One paper (Diffusion-4K) is accepted by CVPR 2025.
  • 2024.06: We release the official PyTorch implementation of I4VGen.
  • 2024.05: Invited talk at Alibaba Cloud on InitNO, CVPR 2024.
  • 2024.02: Two papers (InitNO and DRM) are accepted by CVPR 2024.
  • 2023.09: I start my intern at Alibaba TongYi Lab.
  • 2022.03: One paper (ABPN) is accepted by CVPR 2022.
  • 2021.09: I obtain China National Scholarship.
  • 2021.07: One paper (CTSDG) is accepted by ICCV 2021.
Publications

See the full list on my Publications and Google Scholar.

  • Diffusion-4K: Ultra-High-Resolution Image Synthesis with Latent Diffusion Models
    Jinjin Zhang, Qiuyu Huang, Junjie Liu, Xiefan Guo, Di Huang
    The 43nd IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 11-25, 2025, Nashville, USA
    [Paper] [Code] [Dataset]
  • I4VGen: Image as Free Stepping Stone for Text-to-Video Generation
    Xiefan Guo, Jinlin Liu, Miaomiao Cui, Liefeng Bo, Di Huang
    arXiv:2406.02230, Technical Report, 2024
    [Paper] [Code] [Project]
  • Disentangling Foreground and Background Motion for Enhanced Realism in Human Video Generation
    Jinlin Liu, Kai Yu, Mengyang Feng, Xiefan Guo, Miaomiao Cui
    arXiv:2405.16393, Technical Report, 2024
    [Paper] [Code] [Project]
  • InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization
    Xiefan Guo, Jinlin Liu, Miaomiao Cui, Jiankai Li, Hongyu Yang, Di Huang
    The 42nd IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 17-21, 2024, Seattle, USA
    [Paper] [arXiv] [Code] [Project] [Slide] [Poster]
  • Leveraging Predicate and Triplet Learning for Scene Graph Generation
    Jiankai Li, Yunhong Wang, Xiefan Guo, Ruijie Yang, Weixin Li
    The 42nd IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 17-21, 2024, Seattle, USA
    [Paper] [arXiv] [Code]
  • DreaMoving: A Human Video Generation Framework based on Diffusion Models
    Mengyang Feng, Jinlin Liu, Kai Yu, Yuan Yao, Zheng Hui, Xiefan Guo, Xianhui Lin, Haolan Xue, Chen Shi, Xiaowen Li, Aojie Li, Xiaoyang Kang, Biwen Lei, Miaomiao Cui, Peiran Ren, Xuansong Xie
    arXiv:2312.05107, Technical Report, 2023
    [Paper] [Code] [Project] [ModelScope] [Hugging Face]
  • ABPN: Adaptive Blend Pyramid Network for Real-Time Local Retouching of Ultra High-Resolution Photo
    Biwen Lei, Xiefan Guo, Hongyu Yang, Miaomiao Cui, Xuansong Xie, Di Huang
    The 40th IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 19-24, 2022, New Orleans, USA
    [Paper] [ModelScope] [Dataset]
  • Image Inpainting via Conditional Texture and Structure Dual Generation
    Xiefan Guo, Hongyu Yang, Di Huang
    The 18th IEEE International Conference on Computer Vision (ICCV), Oct. 11-17, 2021, Montreal, Canada (Virtual)
    [Paper] [arXiv] [Code] [Slide] [Poster]