Skip to content
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
11 changes: 11 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -24,6 +24,7 @@ This is an active repository, you can watch for following the latest advances. I
## Table of Contents

- [Awesome-LLM-3D](#awesome-llm-3D)
- [3D Unified Understanding and Generation (LLM)](#3d-unified-understanding-and-generation-via-llm)
- [3D Understanding (LLM)](#3d-understanding-via-llm)
- [3D Understanding (other Foundation Models)](#3d-understanding-via-other-foundation-models)
- [3D Reasoning](#3d-reasoning)
Expand All @@ -33,12 +34,20 @@ This is an active repository, you can watch for following the latest advances. I
- [Contributing](#contributing)


## 3D Unified Understanding and Generation via LLM

| Date | Keywords | Institute (first) | Paper | Publication | Others |
| :-----: | :------------------: | :--------------: | :---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | :---------: | :---------:
| 2025-11-07 | Omni-View | PKU | [Omni-View: Unlocking How Generation Facilitates Understanding in Unified 3D Model based on Multiview images](https://arxiv.org/abs/2511.07222) | ICLR 2026 | [github](https://github.com/AIDC-AI/Omni-View) |
| 2025-08-16 | UniUGG | FDU | [UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding](https://arxiv.org/abs/2508.11952) | ICLR 2026 | [github](https://github.com/fudan-zvg/UniUGG) |

## 3D Understanding via LLM

| Date | Keywords | Institute (first) | Paper | Publication | Others |
| :-----: | :------------------: | :--------------: | :---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | :---------: | :---------:
| 2025-11-27 | G<sup>2</sup>VLM | Shanghai AI Lab | [G<sup>2</sup>VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning](https://arxiv.org/pdf/2511.21688) | Arxiv | [github](https://github.com/InternRobotics/G2VLM) |
| 2025-11-07 | Omni-View | PKU | [Omni-View: Unlocking How Generation Facilitates Understanding in Unified 3D Model based on Multiview images](https://arxiv.org/abs/2511.07222) | ICLR 2026 | [github](https://github.com/AIDC-AI/Omni-View) |
| 2025-08-16 | UniUGG | FDU | [UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding](https://arxiv.org/abs/2508.11952) | ICLR 2026 | [github](https://github.com/fudan-zvg/UniUGG) |
| 2025-07-31 | 3D-R1 | PKU | [3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding](https://arxiv.org/abs/2507.23478) | Arxiv | [project](https://aigeeksgroup.github.io/3D-R1/) |
| 2025-06-11 | LEO-VL | BIGAI | [LEO-VL: Efficient Scene Representation for Scalable 3D Vision-Language Learning](https://arxiv.org/abs/2506.09935) | Arxiv | [project](https://leo-vl.github.io/) |
| 2025-06-02 | 3DRS | HKU | [MLLMs Need 3D-Aware Representation Supervision for Scene Understanding](https://arxiv.org/html/2506.01946v1) | Arxiv | [project](https://visual-ai.github.io/3drs/) |
Expand Down Expand Up @@ -143,6 +152,8 @@ This is an active repository, you can watch for following the latest advances. I
## 3D Generation
| Date | keywords | Institute | Paper | Publication | Others |
| :-----: | :------------------: | :--------------: | :---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | :---------: | :---------:
| 2025-11-07 | Omni-View | PKU | [Omni-View: Unlocking How Generation Facilitates Understanding in Unified 3D Model based on Multiview images](https://arxiv.org/abs/2511.07222) | ICLR 2026 | [github](https://github.com/AIDC-AI/Omni-View) |
| 2025-08-16 | UniUGG | FDU | [UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding](https://arxiv.org/abs/2508.11952) | ICLR 2026 | [github](https://github.com/fudan-zvg/UniUGG) |
| 2024-11-14 | LLaMA-Mesh | THU | [LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models](https://arxiv.org/html/2411.09595v1) |Arxiv| [project](https://research.nvidia.com/labs/toronto-ai/LLaMA-Mesh/) |
| 2023-11-29 | ShapeGPT | Fudan University | [ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model](https://arxiv.org/pdf/2311.17618.pdf) |Arxiv| [github](https://github.com/OpenShapeLab/ShapeGPT) | | Arxiv | [github]() |
| 2023-11-27| MeshGPT | TUM | [MeshGPT: Generating Triangle Meshes with Decoder-Only Transformers](https://arxiv.org/pdf/2311.15475.pdf) |Arxiv | [project](https://nihalsid.github.io/mesh-gpt/) |
Expand Down