Skip to content

Pull requests: mlc-ai/mlc-llm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Embedding][Serve] Phase 3: first-class encoder embedding path
#3481 opened Apr 13, 2026 by xthomaswang Contributor Loading…
Hybrid model upgrade
#3480 opened Apr 12, 2026 by babusid Contributor Loading…
[Feature] Add Qwen3.5 vision model support
#3474 opened Apr 2, 2026 by gnguralnick Contributor Loading…
2 tasks done
Add qwen3vl support
#3463 opened Mar 26, 2026 by yatish04 Loading…
[Embedding][Serve] Phase 2: TVM-native runtime and single-task serving
#3461 opened Mar 24, 2026 by xthomaswang Contributor Loading…
9 tasks done
[Refactor] Unify the CausalLM classes
#3447 opened Mar 6, 2026 by babusid Contributor Loading…
[Model] Support Gemma 3 Vision
#3429 opened Feb 23, 2026 by gnguralnick Contributor Loading…
Add GLM-4.5-Air MoE support
#3388 opened Nov 29, 2025 by otarkhan Loading…
NUMA-aware tensor parallelism for CPU inference
#3320 opened Aug 30, 2025 by MagellaX Contributor Loading…
Add sequence padding to BeginForward
#3314 opened Aug 25, 2025 by joshua-j-hong Contributor Loading…
Add ArceeForCausalLM support
#3294 opened Jul 27, 2025 by bartowski1182 Loading…
Add Comprehensive QAT Training Framework for MLC-LLM
#3258 opened Jun 23, 2025 by alohachen Loading…
7 of 9 tasks
[CPP_CLI] MLC Cli App over JSONEngine interface
#3114 opened Jan 30, 2025 by srkreddy1238 Contributor Loading…
[Model] Add use_qk_norm option for Cohere model
#2877 opened Sep 2, 2024 by tlopex Member Loading…
[Serving] PagedKVCache Quantization
#2663 opened Jul 16, 2024 by davidpissarra Member Loading…
Add docker container support
#1271 opened Nov 15, 2023 by Sing-Li Contributor Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.