-
Notifications
You must be signed in to change notification settings - Fork 2k
Pull requests: mlc-ai/mlc-llm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[CI] Migrate lint from Jenkins to GitHub Actions, switch to ruff
#3486
opened Apr 20, 2026 by
MasterJH5574
Member
Loading…
[Embedding][Serve] Phase 3: first-class encoder embedding path
#3481
opened Apr 13, 2026 by
xthomaswang
Contributor
Loading…
[Feature] Add Qwen3.5 vision model support
#3474
opened Apr 2, 2026 by
gnguralnick
Contributor
Loading…
2 tasks done
Add binary group quantization and WebGPU sampler fixes for Bonsai/WebLLM
#3472
opened Apr 2, 2026 by
HMDRAMS-DEV
Loading…
Add NemotronH hybrid Mamba2-Attention model (Nemotron-3-Nano-4B)
#3464
opened Mar 27, 2026 by
OmarAzizi
Loading…
[Embedding][Serve] Phase 2: TVM-native runtime and single-task serving
#3461
opened Mar 24, 2026 by
xthomaswang
Contributor
Loading…
9 tasks done
feat: add Llama 3.2 chat template with Cutting Knowledge Date
#3455
opened Mar 16, 2026 by
gururajkosuru
Contributor
Loading…
NUMA-aware tensor parallelism for CPU inference
#3320
opened Aug 30, 2025 by
MagellaX
Contributor
Loading…
Add sequence padding to BeginForward
#3314
opened Aug 25, 2025 by
joshua-j-hong
Contributor
Loading…
Add Comprehensive QAT Training Framework for MLC-LLM
#3258
opened Jun 23, 2025 by
alohachen
Loading…
7 of 9 tasks
Perf: load weights, create KV cache, initialize tokenizer in parallel
#3215
opened Apr 27, 2025 by
Bekaboo
Loading…
[Serving] Support tool function calls under strict format constraints
#3190
opened Mar 26, 2025 by
Irfnfnkemed
Loading…
[CPP_CLI] MLC Cli App over JSONEngine interface
#3114
opened Jan 30, 2025 by
srkreddy1238
Contributor
Loading…
[SERVE][CPP][Android] add native executable program to benchmark models
#2987
opened Oct 18, 2024 by
pfk-beta
Loading…
ProTip!
Mix and match filters to narrow down what you’re looking for.