Commit dc0d886
committed
docs: clarify --ctx-size and --parallel interaction in arg.cpp
When using --parallel N, the --ctx-size value is the TOTAL context
divided among all slots, not the per-slot context. This is a common
source of confusion (see #11681, #5732).
Examples:
- --ctx-size 4096 --parallel 4 → each slot gets 1024 tokens
- To get 4096 tokens per slot with 4 parallel slots, use --ctx-size 16384
Updated the help text in arg.cpp (the source for auto-generated docs)
for both --ctx-size and --parallel flags to clarify this behavior.
Fixes #116811 parent c42712b commit dc0d886
1 file changed
+5
-2
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
920 | 920 | | |
921 | 921 | | |
922 | 922 | | |
923 | | - | |
| 923 | + | |
| 924 | + | |
| 925 | + | |
924 | 926 | | |
925 | 927 | | |
926 | 928 | | |
| |||
1756 | 1758 | | |
1757 | 1759 | | |
1758 | 1760 | | |
1759 | | - | |
| 1761 | + | |
| 1762 | + | |
1760 | 1763 | | |
1761 | 1764 | | |
1762 | 1765 | | |
| |||
0 commit comments