-
Notifications
You must be signed in to change notification settings - Fork 302
Pull requests: NovaSky-AI/SkyRL
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Modify SkyRL Generator to Append Router Indices in Multi-Turn
#1530
opened Apr 17, 2026 by
devpatelio
Collaborator
Loading…
Add FoldGRPO advantage estimator and process_rewards pipeline
#1514
opened Apr 15, 2026 by
sumi-fleet-hub
Loading…
SFT loss aggregation consistent with RL path
#1513
opened Apr 15, 2026 by
agolajko
Contributor
Loading…
[WIP][ci] Add megatron/vllm test suite and supported models page to docs
#1508
opened Apr 14, 2026 by
erictang000
Collaborator
•
Draft
fix(docker): optimize Dockerfile.megatron to reduce image size by 1.36 GB
run_train_megatron_gpu_ci
#1499
opened Apr 11, 2026 by
dinhxuanvu
Loading…
[tinker] Fix single request batching in TinkerEngine
#1489
opened Apr 10, 2026 by
pcmoritz
Collaborator
Loading…
[skyrl][tinker] Multi-modal Tinker Sampling
#1484
opened Apr 9, 2026 by
nithinvc
Contributor
Loading…
3 tasks done
Add prefix-aware merging for step-wise training
#1479
opened Apr 8, 2026 by
CharlieFRuan
Member
Loading…
3 tasks done
feat: add max_tokens_per_microbatch config for token-based micro-batching
#1477
opened Apr 8, 2026 by
erictang000
Collaborator
Loading…
feat: native Atropos-SHM integration and modular ingestion layer
#1473
opened Apr 7, 2026 by
RUFFY-369
Loading…
[train] Enable expandable_segments to reduce GPU memory fragmentation
run_train_gpu_ci
#1470
opened Apr 7, 2026 by
CharlieFRuan
Member
•
Draft
5 tasks done
[tinker] Support prompt_logprobs in SkyRLTrainBackend sample() path
#1461
opened Apr 6, 2026 by
pbokc
Contributor
Loading…
[tinker] Support KL loss in SkyRLTrainBackend
#1460
opened Apr 5, 2026 by
pbokc
Contributor
Loading…
feat: LLM-synthesized hints for failed trajectories
#1456
opened Apr 4, 2026 by
dzorlu
Loading…
4 tasks
[skyrl-train] feat: add native GMPO policy loss with validation and tests
#1449
opened Apr 2, 2026 by
taivu1998
Loading…
Fix event-loop blocking in one-step-off async save/export paths
#1446
opened Apr 2, 2026 by
taivu1998
Loading…
Change default KL estimator from k3 to k2 for loss-based KL
#1445
opened Apr 2, 2026 by
taivu1998
Loading…
[skyrl-train] Add trainer-side max_response_length for Dr. GRPO normalization and DAPO overlong handling
#1440
opened Apr 2, 2026 by
taivu1998
Loading…
[tx] Add initial implementation of RayJaxBackend
#1418
opened Mar 31, 2026 by
andrewsykim
Contributor
Loading…
[train] Add Virtual Pipeline Parallelism support to Megatron
#1400
opened Mar 27, 2026 by
tamoghnokandar
Contributor
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2026-03-17.