Releases · janelu9/EasyLLM

12 Feb 07:10

janelu9

5.0.6.rc1

854b921

v5.0.6.rc1 Pre-release

Pre-release

qwen3-vl supported.

Assets 2

04 Sep 03:04

janelu9

5.0.5

19f9215

v5.0.5 Latest

Latest

qwen3-4b with tp>1 was supportted.
RLHF:
Skipping loading weights from local files when starting up vllm engines.
LLM was replaced by AsyncLLM. So manual batching was replaced by auto-continuous batching. Which means training waitting time has be shorten as the first response time.
max_num_seqs of training was no longer needed. Adjusting max_num_seqs and num_vllm_engines of vllm to balance training speed and inferrence speed of one batch samples will benefite a highest performance.

Assets 2

18 Aug 08:45

janelu9

5.0.4.post1

a24123f

v5.0.4.post1

RLHF:
More vllm engines corresponsed to one training duplicate can be started, max_num_seqs*num_vllm_engines repsponces could be generatted at every inferrence request.
Adjusting arguments max_num_seqs and num_vllm_engines to make training speed of max_num_seqs*num_vllm_engines*num_generations samples approximately equals to inferrence speed of one request will benefite a highest performance.

Assets 2

14 Aug 08:38

janelu9

5.0.4

2560632

v5.0.4

Enjoy an efficient RL.

Assets 2

05 Aug 12:18

janelu9

5.0.4.rc2

f63e099

v5.0.4.rc2

Support GRPO on deepseek-685b.

Assets 2

24 Jul 02:41

janelu9

5.0.4.rc1

92d79ae

v5.0.4.rc1

Cyclically make vLLM inference and jllm training asynchronous during RL.
Support zero loss of RL.
Support training a micro batch of the generated samples belong to one group at a micro step during RL.

Assets 2

11 Jul 14:35

janelu9

5.0.3

33bb9fd

v5.0.3

MoE:
Experts in tp-ep group will be recombined at several steps for load balance if setting swap_experts_per_steps>0.
RLHF:
Multiple vllm engines can be started.
Generated completions of one group can be distributed on diffrent data parallel when sequence parallel is on.

Assets 2

04 Jul 06:20

janelu9

5.0.3.rc1

b69bea9

v5.0.3.rc1

qwen3/qwen3-moe supportted.

Assets 2

25 Jun 09:42

janelu9

5.0.2.post1

604db63

5.0.2.post1

qwen2-vl/qwen2.5-vl supportted on NPUs.

Assets 2

10 Jun 05:06

janelu9

5.0.2

4a602b4

v5.0.2

More efficient training on MoE with big EP.

Assets 2

Releases: janelu9/EasyLLM

v5.0.6.rc1

Uh oh!

v5.0.5

Uh oh!

v5.0.4.post1

Uh oh!

v5.0.4

Uh oh!

v5.0.4.rc2

Uh oh!

v5.0.4.rc1

Uh oh!

v5.0.3

Uh oh!

v5.0.3.rc1

Uh oh!

5.0.2.post1

Uh oh!

v5.0.2

Uh oh!