CSGO: Content-Style Composition in Text-to-Image Generation
Official 🤗 Gradio demo for CSGO: Content-Style Composition in Text-to-Image Generation.
How to use:
- Upload a content image if you want to use image-driven style transfer.
- Upload a style image.
- Sets the type of task to perform, by default image-driven style transfer is performed. Options are Image-driven style transfer, Text-driven style synthesis, and Text editing-driven style synthesis.
- If you choose a text-driven task, enter your desired prompt.
- If you don't provide a prompt, the default is to use the BLIP model to generate the caption. We suggest that by providing detailed prompts for Content images, CSGO is able to effectively guarantee content.
- Click the Submit button to begin customization.
- Share your stylized photo with your friends and enjoy! 😊
Advanced usage:
- Click advanced options.
- Choose different guidance and steps.
0 2
0 2
1 15
1 4
5 100
-1000000 2147483647
Examples
| Content Image (optional) | Style Image | task | Prompt | Content Scale | Style Scale | guidance scale | Seed Value |
|---|
📝 Tips In CSGO, the more accurate the text prompts for content images, the better the content retention. Text-driven style synthesis and text-edit-driven style synthesis are expected to be more stable in the next release.
📝 Citation
If our work is helpful for your research or applications, please cite us via:
@article{xing2024csgo,
title={CSGO: Content-Style Composition in Text-to-Image Generation},
author={Peng Xing and Haofan Wang and Yanpeng Sun and Qixun Wang and Xu Bai and Hao Ai and Renyuan Huang and Zechao Li},
year={2024},
journal = {arXiv 2408.16766},
}
📧 Contact
If you have any questions, please feel free to open an issue or directly reach us out at xingp_ng@njust.edu.cn.