FreeFuse enables seamless multi-subject composition by fusing multiple LoRAs without additional training or user-defined masks.
- Release basic inference code
- Release inference code for controlnet, ipadapter and redux
- Supports ComfyUI
- Supports Z-Image-turbo in diffusers
- Supports Z-Image-turbo in ComfyUI
- Supports FLUX.2-klein-4B/9B in diffusers
- Supports FLUX.2-klein-4B/9B in ComfyUI
- Support ControlNet+FreeFuse in ComfyUI
- Exploring FreeFuse in text2video model
This paper proposes FreeFuse, a training-free framework for multi-subject text-to-image generation through automatic fusion of multiple subject LoRAs. In contrast to prior studies that focus on retraining LoRA to alleviate feature conflicts, our analysis reveals that simply spatially confining the subject LoRA's output to its target region and preventing other LoRAs from directly intruding into this area is sufficient for effective mitigation. Accordingly, we implement Adaptive Token-Level Routing during the inference phase. We introduce FreeFuseAttn, a mechanism that exploits the flow matching model's intrinsic semantic alignment to dynamically match subject-specific tokens to their corresponding spatial regions at early denoising timesteps, thereby bypassing the need for external segmentors. FreeFuse distinguishes itself through high practicality: it necessitates no additional training, model modifications, or user-defined masks spatial conditions. Users need only provide subject activation words to achieve seamless integration into standard workflows. Extensive experiments validate that FreeFuse outperforms existing approaches in both identity preservation and compositional fidelity.
Qualitative results on Flux.dev. FreeFuse achieves superior identity preservation and compositional fidelity.
Additional comparisons demonstrating FreeFuse's effectiveness on SDXL.
Overview of FreeFuse architecture. FreeFuseAttn dynamically routes subject-specific LoRAs to their corresponding spatial regions via adaptive token-level routing, enabling training-free multi-subject LoRA fusion.
# use comfy-cli
pip install comfy-cli
comfy node install freefuse
# or manual installation
cd <your ComfyUI path>/custom_nodes
git clone --depth 1 --branch comfyui https://github.com/yaoliliu/FreeFuse.git freefuse_comfyuiYou can find workflows for flux.dev and SDXL in the freefuse_comfyui/workflows folder. (^▽^)
You just need clone this repo and use uv to install dependencies.
apt-get update && apt-get install -y git-lfs && git lfs install
git clone https://github.com/yaoliliu/FreeFuse.git
git lfs pull
# or using huggingface if lfs budget is used up
hf download lsmpp/freefuse_example_loras --local-dir=./loras
uv venv
source .venv/bin/activate
uv syncwithout background identification
python main_freefuse_flux.pywith background identification and style LoRA
python main_freefuse_flux_bg_except.pywith redux
python redux_freefuse_flux_bg_except.pywith ipadapter
python ipadapter_freefuse_flux_bg_except.pywith controlnet
python controlnet_freefuse_flux_bg_except.pyIf you running out of your vram, you can set the quantization variable in the script to 'True' to enable quantization, which will reduce the vram usage.
python main_freefuse_sdxl.pypython main_freefuse_z_image.pyThanks for Tekemo and jrewingwannabe947's ZIT LoRAs, which are used in our Z-Image-turbo examples.
FreeFuse solves feature conflicts between multiple subject LoRAs without additional training in flux1.dev, sdxl, z-image and more.




