Edit Everything: A Text-Guided Generative System for Images Editing

Defeng Xie,Ruichen Wang,Jian Ma,Chen Chen,Haonan Lu,Dong Yang,Fobo Shi,Xiaodong Lin
2023-04-27
Abstract:We introduce a new generative system called Edit Everything, which can take image and text inputs and produce image outputs. Edit Everything allows users to edit images using simple text instructions. Our system designs prompts to guide the visual module in generating requested images. Experiments demonstrate that Edit Everything facilitates the implementation of the visual aspects of Stable Diffusion with the use of Segment Anything model and CLIP. Our system is publicly available at <a class="link-external link-https" href="https://github.com/DefengXie/Edit_Everything" rel="external noopener nofollow">this https URL</a>.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper aims to address several key issues in image editing: 1. **Improving the quality of image generation**: Existing image generation systems have limitations in producing high-quality, realistic images. This paper proposes a new generation system—Edit Everything, which combines text guidance and advanced visual models (such as Stable Diffusion, Segment Anything Model (SAM), and CLIP) to generate higher quality images. 2. **Achieving efficient image editing**: Traditional image editing methods often require professional skills and a significant amount of time. The Edit Everything system allows users to edit images through simple text instructions, greatly improving the efficiency and convenience of image editing. 3. **Supporting complex editing tasks**: For complex editing tasks, existing systems may struggle to generate satisfactory images. The Edit Everything system efficiently handles complex tasks by breaking them down into multiple small steps and gradually replacing target objects. 4. **Supporting Chinese scenarios**: The system is specifically optimized for Chinese scenarios, with specially trained CLIP and Stable Diffusion models that can understand and process Chinese text prompts, generating images that fit the Chinese context. Through these improvements, the Edit Everything system not only significantly enhances the quality of image generation but also provides powerful tools for researchers and practical applications.