Structured 3D Latents
for Scalable and Versatile 3D Generation

Installation Steps

Clone the repo:

git clone --recurse-submodules https://github.com/MackinationsAi/Window_Trellis.git
cd Window_Trellis

Install the dependencies:

Simply run the oc_install.bat file

oc_install.bat

Running Gradio WebUI

After installing the dependencies, you can run the app by simply running the boot_window_trellis.bat file:

boot_window_trellis.bat

Structured 3D Latents
for Scalable and Versatile 3D Generation

TRELLIS is a large 3D asset generation model. It takes in text or image prompts and generates high-quality 3D assets in various formats, such as Radiance Fields, 3D Gaussians, and meshes. The cornerstone of TRELLIS is a unified Structured LATent (SLAT) representation that allows decoding to different output formats and Rectified Flow Transformers tailored for SLAT as the powerful backbones. We provide large-scale pre-trained models with up to 2 billion parameters on a large 3D asset dataset of 500K diverse objects. TRELLIS significantly surpasses existing methods, including recent ones at similar scales, and showcases flexible output format selection and local 3D editing capabilities which were not offered by previous models.

Check out our Project Page for more videos and interactive demos!

🌟 Features

High Quality: It produces diverse 3D assets at high quality with intricate shape and texture details.
Versatility: It takes text or image prompts and can generate various final 3D representations including but not limited to Radiance Fields, 3D Gaussians, and meshes, accommodating diverse downstream requirements.
Flexible Editing: It allows for easy editings of generated 3D assets, such as generating variants of the same object or local editing of the 3D asset.

⏩ Updates

02/01/2025

Simple Windows installation & updated Gradio UI.

12/26/2024

Release TRELLIS-500K dataset and toolkits for data preparation.

12/18/2024

Implementation of multi-image conditioning for TRELLIS-image model. (#7). This is based on tuning-free algorithm without training a specialized model, so it may not give the best results for all input images.
Add Gaussian export in app.py and example.py. (#40)

🚧 TODO List

Release inference code and TRELLIS-image-large model
Release dataset and dataset toolkits
Release TRELLIS-text model series
Release training code

📦 Installation

Prerequisites

System: For windows setup.
Hardware: An NVIDIA GPU with at least 16GB of memory is necessary. The code has been verified on NVIDIA A100 and A6000 GPUs.
Software:
- The CUDA Toolkit is needed to compile certain submodules. The code has been tested with CUDA versions 11.8 and 12.2.
- Conda is recommended for managing dependencies.
- Python version 3.8 or higher is required.

🤖 Pretrained Models

We provide the following pretrained models:

Model	Description	#Params	Download
TRELLIS-image-large	Large image-to-3D model	1.2B	Download
TRELLIS-text-base	Base text-to-3D model	342M	Coming Soon
TRELLIS-text-large	Large text-to-3D model	1.1B	Coming Soon
TRELLIS-text-xlarge	Extra-large text-to-3D model	2.0B	Coming Soon

The models are hosted on Hugging Face. You can directly load the models with their repository names in the code:

TrellisImageTo3DPipeline.from_pretrained("JeffreyXiang/TRELLIS-image-large")

If you prefer loading the model from local, you can download the model files from the links above and load the model with the folder path (folder structure should be maintained):

TrellisImageTo3DPipeline.from_pretrained("/path/to/TRELLIS-image-large")

💡 Usage

Minimal Example

Here is an example of how to use the pretrained models for 3D asset generation.

import os
# os.environ['ATTN_BACKEND'] = 'xformers'   # Can be 'flash-attn' or 'xformers', default is 'flash-attn'
os.environ['SPCONV_ALGO'] = 'native'        # Can be 'native' or 'auto', default is 'auto'.
                                            # 'auto' is faster but will do benchmarking at the beginning.
                                            # Recommended to set to 'native' if run only once.

import imageio
from PIL import Image
from trellis.pipelines import TrellisImageTo3DPipeline
from trellis.utils import render_utils, postprocessing_utils

# Load a pipeline from a model folder or a Hugging Face model hub.
pipeline = TrellisImageTo3DPipeline.from_pretrained("JeffreyXiang/TRELLIS-image-large")
pipeline.cuda()

# Load an image
image = Image.open("assets/example_image/T.png")

# Run the pipeline
outputs = pipeline.run(
    image,
    seed=1,
    # Optional parameters
    # sparse_structure_sampler_params={
    #     "steps": 12,
    #     "cfg_strength": 7.5,
    # },
    # slat_sampler_params={
    #     "steps": 12,
    #     "cfg_strength": 3,
    # },
)
# outputs is a dictionary containing generated 3D assets in different formats:
# - outputs['gaussian']: a list of 3D Gaussians
# - outputs['radiance_field']: a list of radiance fields
# - outputs['mesh']: a list of meshes

# Render the outputs
video = render_utils.render_video(outputs['gaussian'][0])['color']
imageio.mimsave("sample_gs.mp4", video, fps=30)
video = render_utils.render_video(outputs['radiance_field'][0])['color']
imageio.mimsave("sample_rf.mp4", video, fps=30)
video = render_utils.render_video(outputs['mesh'][0])['normal']
imageio.mimsave("sample_mesh.mp4", video, fps=30)

# GLB files can be extracted from the outputs
glb = postprocessing_utils.to_glb(
    outputs['gaussian'][0],
    outputs['mesh'][0],
    # Optional parameters
    simplify=0.95,          # Ratio of triangles to remove in the simplification process
    texture_size=1024,      # Size of the texture used for the GLB
)
glb.export("sample.glb")

# Save Gaussians as PLY files
outputs['gaussian'][0].save_ply("sample.ply")

After running the code, you will get the following files:

sample_gs.mp4: a video showing the 3D Gaussian representation
sample_rf.mp4: a video showing the Radiance Field representation
sample_mesh.mp4: a video showing the mesh representation
sample.glb: a GLB file containing the extracted textured mesh
sample.ply: a PLY file containing the 3D Gaussian representation

📚 Dataset

We provide TRELLIS-500K, a large-scale dataset containing 500K 3D assets curated from Objaverse(XL), ABO, 3D-FUTURE, HSSD, and Toys4k, filtered based on aesthetic scores. Please refer to the dataset README for more details.

⚖️ License

TRELLIS models and the majority of the code are licensed under the MIT License. The following submodules may have different licenses:

diffoctreerast: We developed a CUDA-based real-time differentiable octree renderer for rendering radiance fields as part of this project. This renderer is derived from the diff-gaussian-rasterization project and is available under the LICENSE.
Modified Flexicubes: In this project, we used a modified version of Flexicubes to support vertex attributes. This modified version is licensed under the LICENSE.

📜 Citation

If you find this work helpful, please consider citing our paper:

@article{xiang2024structured,
    title   = {Structured 3D Latents for Scalable and Versatile 3D Generation},
    author  = {Xiang, Jianfeng and Lv, Zelong and Xu, Sicheng and Deng, Yu and Wang, Ruicheng and Zhang, Bowen and Chen, Dong and Tong, Xin and Yang, Jiaolong},
    journal = {arXiv preprint arXiv:2412.01506},
    year    = {2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
assets		assets
dataset_toolkits		dataset_toolkits
extensions/vox2seq		extensions/vox2seq
trellis		trellis
utils		utils
.gitignore		.gitignore
.gitmodules		.gitmodules
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
DATASET.md		DATASET.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
SUPPORT.md		SUPPORT.md
app.py		app.py
boot_window_trellis.bat		boot_window_trellis.bat
example.py		example.py
example_multi_image.py		example_multi_image.py
oc_install.bat		oc_install.bat
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Installation Steps

Running Gradio WebUI

Structured 3D Latents
for Scalable and Versatile 3D Generation

🌟 Features

⏩ Updates

🚧 TODO List

📦 Installation

Prerequisites

🤖 Pretrained Models

💡 Usage

Minimal Example

📚 Dataset

⚖️ License

📜 Citation

About

Uh oh!

Releases 1

Packages

Languages

License

MackinationsAi/Window_Trellis

Folders and files

Latest commit

History

Repository files navigation

Installation Steps

Running Gradio WebUI

Structured 3D Latentsfor Scalable and Versatile 3D Generation

🌟 Features

⏩ Updates

🚧 TODO List

📦 Installation

Prerequisites

🤖 Pretrained Models

💡 Usage

Minimal Example

📚 Dataset

⚖️ License

📜 Citation

About

Resources

License

Code of conduct

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Structured 3D Latents
for Scalable and Versatile 3D Generation

Packages