Skip to main navigation Skip to search Skip to main content

Invert Your Prompt: Editing-Aware Diffusion Inversion

  • Yangyang Xu*
  • , Wenqi Shao
  • , Yong Du
  • , Haiming Zhu
  • , Yang Zhou
  • , Jiayuan Xie
  • , Ping Luo
  • , Shengfeng He*
  • *Corresponding author for this work
  • Harbin Institute of Technology Shenzhen
  • Shanghai Artificial Intelligence Laboratory
  • Ocean University of China
  • Singapore Management University
  • South China University of China
  • Hong Kong Polytechnic University
  • The University of Hong Kong

Research output: Contribution to journalArticlepeer-review

Abstract

Recent advancements in text-guided diffusion models have enabled powerful image manipulation capabilities. However, balancing reconstruction fidelity and editability for real images remains a significant challenge. In this work, we introduce Editing Inversion (EditInv), a novel framework that inverts and edits real images for specific editing tasks by optimizing specific prompt embeddings within the extended P∗ space. By leveraging distinct embeddings across different U-Net layers and time steps, EditInv seamlessly integrates inversion and editing through reciprocal optimization, ensuring both high fidelity and precise editability. This hierarchical editing mechanism classifies tasks into structure, appearance, and global edits, optimizing only those embeddings that are unaffected by the current editing task. Extensive experiments on benchmark datasets demonstrate EditInv’s superior performance over existing methods, delivering both quantitative and qualitative improvements while showcasing its versatility with a few-step diffusion model.

Original languageEnglish
Article number163
JournalInternational Journal of Computer Vision
Volume134
Issue number4
DOIs
StatePublished - Apr 2026
Externally publishedYes

Keywords

  • Diffusion Inversion
  • Disentanglement
  • Image Editing

Fingerprint

Dive into the research topics of 'Invert Your Prompt: Editing-Aware Diffusion Inversion'. Together they form a unique fingerprint.

Cite this