Skip to content

trinity-graphics/dream-in-style

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Dream-in-Style: Text-to-3D Generation using Stylized Score Distillation

     

Hubert Kompanowski, Binh-Son Hua
International Conference on 3D Vision (3DV) 2025


Dream-in-Style Teaser

Installation

(tested on Linux)

Prerequisites

  1. CUDA 12.1

    Our code is tested on CUDA 12.1. Follow the official installation guide.

  2. Conda

    If you don't have conda installed, follow the official instructions.

Setup Steps

  1. Clone Threestudio

    git clone https://github.com/threestudio-project/threestudio.git
  2. Clone Dream-in-Style to "custom" directory

    cd threestudio/custom
    git clone https://github.com/hubert-kompanowski/dream-in-style.git
  3. Create Conda environment and install dependencies

    cd dream-in-style
    conda create --name "threestudio" python=3.11 -y
    conda activate threestudio
    
    # Install PyTorch with CUDA support
    pip install torch==2.5.1 torchvision==0.20.1 torchaudio==2.5.1 xformers --index-url https://download.pytorch.org/whl/cu121
    
    # Install remaining dependencies
    pip install ninja
    pip install -r requirements.txt

Usage Guide

Running Stylized 3D Generation

We provide example configuration files that can be modified to suit your specific needs.

From the threestudio root directory, run:

python launch.py --config custom/dream-in-style/configs/toy-car-fire-style-nfsd.yaml --train --gpu 0

Configuration Options

The configuration files provide extensive customization options for both 3D generation and style application:

  • Content Customization

    • prompt_processor.prompt: Change this to generate different 3D objects (e.g., "a toy car", "a vase", "a chair")
    • guidance.style_image_path: Path to your reference style image (e.g., "custom/dream-in-style/images/a_fire_on_a_black_background.png")
  • Style Control Parameters

    • guidance.style_ratio_scheduler_type: Type of style strength scheduling ("constant", "linear", "sqrt", "quadratic")
    • guidance.style_ratio_start_scale: Initial style strength (0.0-1.0)
    • guidance.style_ratio_end_scale: Final style strength (0.0-1.0)

Note: The method was tested on Nvidia RTX 4090 GPU with 24GB VRAM. If you encounter OOM issues, consider lowering data.width, data.height, or reducing num_samples_per_ray in the config file.

Acknowledgement

This work was conducted with the financial support of the Research Ireland Centre for Research Training in Digitally Enhanced Reality (d-real) under Grant No. 18/CRT/6224. For the purpose of Open Access, the author has applied a CC BY public copyright licence to any Author Accepted Manuscript version arising from this submission.

This project is supported by Research Ireland under the Research Ireland Frontiers for the Future Programme, award number 22/FFP-P/11522.

Our implementation builds upon threestudio and the Visual-Style-Prompting framework from NAVER AI. We thank the authors for making their code publicly available.

Citation

Please consider citing our paper if our code is useful:

@inproceedings{kompanowski2024dreaminstyle,
    author    = {Hubert Kompanowski and Binh-Son Hua},
    title     = {Dream-in-Style: Text-to-3D Generation using Stylized Score Distillation},
    booktitle = {International Conference on 3D Vision},
    year      = {2025},
}

About

Dream-in-Style: Text-to-3D Generation using Stylized Score Distillation (3DV 2025)

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages