Skip to content

Commit a10784a

Browse files
Merge pull request #11 from XyLearningProgramming/bugfix/oom
hard limited n ctx to 2048
2 parents 98dde6f + e31a035 commit a10784a

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

slm_server/config.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -62,7 +62,7 @@ class Settings(BaseSettings):
6262
description="Owner label for /models list. Set SLM_MODEL_OWNER to override.",
6363
)
6464
n_ctx: int = Field(
65-
8192, description="Maximum context window (input + generated tokens)."
65+
2048, description="Maximum context window (input + generated tokens)."
6666
)
6767
n_threads: int = Field(
6868
2, description="Number of OpenMP threads llama‑cpp will spawn."

0 commit comments

Comments
 (0)