[feature] sglang support #955

wheresmyhair · 2025-11-23T10:43:35Z

Supports sglang inference backend
bump lmflow version to 1.1.0

research4pan

Unittest is needed.

Main feature

Update datasets to 3.6.0
setup.py: add sglang as an optional dependency
--inference_engine is for choosing vllm

Details

src/lmflow/args.py
- inference_tensor_parallel_size and inferece_gpu_memory_utilization is the latest argument for both vllm and sglang
  - vllm_tensor_parallel_size and vllm_gpu_memory_utilization will soon be deprecated (but still usable in this version with higher priority)
src/lmflow/models/hf_decoder_model.py
- Add @deprecated_args to allow automatic deprecated arg translation to latest args (improve backward compatibility, also in src/lmflow/utils/deprecated.py)
- Add corresponding logics for sglang
- line 539: bos_token -> "" for sglang following sglang's logics

Question

src/lmflow/models/hf_decoder_model.py: line 539: bos_token -> "". This may lead to prompt distribution. Confirm its correctness
src/lmflow/models/hf_model_mixin.py: line 555-559: Can be removed if not necessary

Suggestions

Add README.md for how to use sglang inferencer
Add unittest for vllm, SGLang inference
- compare with the original version of vllm and SGLang with a small example
- a small example, compare with huggingface generation, check inference probability

wheresmyhair · 2025-11-25T15:13:20Z

Notes on error ModuleNotFoundError: No module named 'common_ops' when using SGLang:

If you encounter this error, please try apt-get update and apt install numactl.

wheresmyhair · 2025-11-25T16:50:54Z

Added shutdown for sglang engine
Added unittest for sglang inferencer

research4pan

Add unittests. LGTM

wheresmyhair added 4 commits November 23, 2025 15:13

[feature] add sglang support

5da1d35

[fix] sglang bug fix, unit test

bd29f99

[versioning] bump lmflow version to 1.1.0

8368535

[ci] lint

235935d

research4pan requested changes Nov 24, 2025

View reviewed changes

wheresmyhair added 2 commits November 26, 2025 00:32

[sglang] deterministic inference

0bf8d91

[ci] lint

cf12a47

research4pan approved these changes Nov 25, 2025

View reviewed changes

research4pan merged commit a1882bb into main Nov 25, 2025
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feature] sglang support #955

[feature] sglang support #955

Uh oh!

wheresmyhair commented Nov 23, 2025

Uh oh!

research4pan left a comment •

edited

Loading

Uh oh!

wheresmyhair commented Nov 25, 2025 •

edited

Loading

Uh oh!

wheresmyhair commented Nov 25, 2025

Uh oh!

research4pan left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[feature] sglang support #955

[feature] sglang support #955

Uh oh!

Conversation

wheresmyhair commented Nov 23, 2025

Uh oh!

research4pan left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Main feature

Details

Question

Suggestions

Uh oh!

wheresmyhair commented Nov 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wheresmyhair commented Nov 25, 2025

Uh oh!

research4pan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

research4pan left a comment •

edited

Loading

wheresmyhair commented Nov 25, 2025 •

edited

Loading