ICCV 2025 Workshop (HiGen), Oct 19
The rapid evolution of generative AI has reshaped content creation across images, video, and 3D/4D visuals. This workshop focuses on cutting-edge methodologies, practical applications, and open challenges in image/video/3D/4D generation and related editing tasks with an emphasis on flexible and friendly human interactions and multi-modal control signals. This workshop will serve as a platform for researchers and practitioners to discuss key topics related to visual content creation and editing with versatile interactions.
Swipe / scroll → to see more photos
| Time | Event | Speaker |
|---|---|---|
| 09:00-09:10 | Introduction and Opening Remarks | Organizers |
| 09:10-09:40 | Invited Talk 1: Efficient Image and Video Generation with Diffusion Models and Acceleration | Enze Xie |
| 09:40-10:10 | Invited Talk 2: The Full Stack Creator: A Unified Stack for Editable and Dynamic 3D Worlds | Yanpei Cao |
| 10:10-10:25 | Coffee Break | - |
| 10:25-10:55 | Invited Talk 3: Multimodal and Interactive Visual Generation towards World Models | Xihui Liu |
| 10:55-11:25 | Invited Talk 4: Towards Unified Generative Models for Image and Video Editing | Soo Ye Kim |
| 11:25-11:55 | Invited Talk 5: LongLive: Real-time Interactive Long Video Generation | Yukang Chen |
| 11:25-12:15 | Poster Session | - |
| 12:15-13:00 | Lunch Break | - |
| 13:00-13:35 | Best Paper Talk | TBD |
| 13:30-14:00 | Invited Talk 6: From controlling, prompting to real-time interactive video generation | Jimei Yang |
| 14:00-14:30 | Invited Talk 7: Customizing Text-to-Image Diffusion Models | Nupur Kumari |
| 14:30-14:40 | Coffee Break | - |
| 14:40-15:15 | Oral Paper Presentations | - |
| 15:15-15:45 | Invited Talk 8: Diffusion Transformers with Representation Autoencoders | Saining Xie |
| 15:45-16:15 | Invited Talk 9: Generative Reconstruction and Distillation of Human-Object Interactions | Jiajun Wu |
| 16:15-16:45 | Invited Talk 10: BAGEL: The Open-Source Unified Multimodal Mode | Haoqi Fan |
| 16:45-17:15 | Invited Talk 11: Audio-visual generation fueled by video learning | Kristen Grauman |
| 17:15-17:30 | Closed Remarks | Organizers |
Topics of interests include, but are not limited to:
Submissions are expected to present original, unpublished work, and should be longer than 4 pages in length (up to 8 pages, including figures and tables but excluding references). Accepted papers will be published in the official ICCV workshop proceedings.
Important Dates (All deadlines are 23:59 UTC)
The Non-Proceedings Track offers a more flexible avenue for presenting a wider range of contributions without page limits. Accepted papers will NOT be published in the official ICCV workshop proceedings, but they can be presented and promoted as oral or poster presentations at our workshop. Submissions include but are not limited to:
Important Dates (All deadlines are 23:59 UTC)
Contact us at: xichen.csai@gmail.com