A collection of awesome text-to-image generation studies.
-
Updated
Dec 25, 2025 - TeX
A collection of awesome text-to-image generation studies.
[CVPR 2021] Multi-Modal-CelebA-HQ: A Large-Scale Text-Driven Face Generation and Understanding Dataset
Official PyTorch codes for the paper: "ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation"
[NeurIPS 2024] RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models
Official Pytorch repo of CVPR'23 and NeurIPS'23 papers on understanding replication in diffusion models.
Code for ACM MM'23 paper: LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation
[CVPR '23] Unite and Conquer: Plug & Play Multi-Modal Synthesis using Diffusion Models
[ECCV'24] T2IShield: Defending Against Backdoors on Text-to-Image Diffusion Models
[ECCV 2024] Official repository of ECCV 2024 paper: Object-Conditioned Energy-Based Attention Map Alignment in Text-to-Image Diffusion Models
[ECCV 2024 - Oral] Official PyTorch Implementation of "Adversarial Robustification via Text-to-Image Diffusion Models"
Codebase for the paper ImPoster: Text and Frequency Guidance for Subject Driven Action Personalization using Diffusion Models
DreamBooth Text to Image AI Stable Diffusion
An empirical study investigating whether Stable Diffusion XL produces genuinely novel visual concepts — or merely recombines patterns from its training data.
🤗 Generate images with diffusion models.
Text to Image app with Stable Diffusion Pipeline and tkinter as its UI
Smart and Simple Flux for GPU-poor
A curated list of AI image generation APIs, SDKs, and tools including text-to-image, image editing, diffusion models, generative art systems, and multimodal AI platforms. Covers commercial services, open source models with APIs, and scalable infrastructure for developers building visual applications.
Personalized & controllable image generation with your favorite characters and style!
Curate and access a comprehensive list of AI image generation APIs, SDKs, and tools designed for developer integration and production use.
🚀 Simplify FLUX access with 4-bit models and CPU offloading, enabling powerful GPU utilization and prompt generation from keywords with DeepSeek-R1 AI.
Add a description, image, and links to the text-to-image-diffusion topic page so that developers can more easily learn about it.
To associate your repository with the text-to-image-diffusion topic, visit your repo's landing page and select "manage topics."