Publications

See the full list on my Google Scholar.

2025

  • Diffusion-4K: Ultra-High-Resolution Image Synthesis with Latent Diffusion Models
    Jinjin Zhang, Qiuyu Huang, Junjie Liu, Xiefan Guo, Di Huang
    In: The 43nd IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 11-25, 2025, Nashville, USA
    [Paper] [Code] [Dataset]

2024

  • I4VGen: Image as Free Stepping Stone for Text-to-Video Generation
    Xiefan Guo, Jinlin Liu, Miaomiao Cui, Liefeng Bo, Di Huang
    In: arXiv:2406.02230, Technical Report, 2024
    [Paper] [Code] [Project]
  • Disentangling Foreground and Background Motion for Enhanced Realism in Human Video Generation
    Jinlin Liu, Kai Yu, Mengyang Feng, Xiefan Guo, Miaomiao Cui
    In: arXiv:2405.16393, Technical Report, 2024
    [Paper] [Code] [Project]
  • InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization
    Xiefan Guo, Jinlin Liu, Miaomiao Cui, Jiankai Li, Hongyu Yang, Di Huang
    In: The 42nd IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 17-21, 2024, Seattle, USA
    [Paper] [arXiv] [Code] [Project] [Slide] [Poster]
  • Leveraging Predicate and Triplet Learning for Scene Graph Generation
    Jiankai Li, Yunhong Wang, Xiefan Guo, Ruijie Yang, Weixin Li
    In: The 42nd IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 17-21, 2024, Seattle, USA
    [Paper] [arXiv] [Code]

2023

  • DreaMoving: A Human Video Generation Framework based on Diffusion Models
    Mengyang Feng, Jinlin Liu, Kai Yu, Yuan Yao, Zheng Hui, Xiefan Guo, Xianhui Lin, Haolan Xue, Chen Shi, Xiaowen Li, Aojie Li, Xiaoyang Kang, Biwen Lei, Miaomiao Cui, Peiran Ren, Xuansong Xie
    In: arXiv:2312.05107, Technical Report, 2023
    [Paper] [Code] [Project] [ModelScope] [Hugging Face]

2022

  • ABPN: Adaptive Blend Pyramid Network for Real-Time Local Retouching of Ultra High-Resolution Photo
    Biwen Lei, Xiefan Guo, Hongyu Yang, Miaomiao Cui, Xuansong Xie, Di Huang
    In: The 40th IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 19-24, 2022, New Orleans, USA
    [Paper] [ModelScope] [Dataset]

2021

  • Image Inpainting via Conditional Texture and Structure Dual Generation
    Xiefan Guo, Hongyu Yang, Di Huang
    In: The 18th IEEE International Conference on Computer Vision (ICCV), Oct. 11-17, 2021, Montreal, Canada (Virtual)
    [Paper] [arXiv] [Code] [Slide] [Poster]