Fix multi-sequence embeddings by iamlemec · Pull Request #2058 · abetlen/llama-cpp-python

iamlemec · 2025-08-19T19:23:50Z

Fixes multi-sequence (batch) embeddings by handling n_seq_max and kv_unified flags. See discussion in #2051.

LimePencil · 2025-09-15T14:22:08Z

@abetlen any updates yet?

freckletonj · 2025-12-13T07:56:56Z

confirming this is still an issue

mlisovyi · 2026-05-05T09:45:32Z

Shouldn't n_seq_max be also used in Llama.embed() ? One should add or p_batch == n_seq_max to the batch-evaluation condition in the loop here. Otherwise one runs into a danger of collecting a batch that will consist of more sequences as the configured maximum (if individual inputs are short or the configured n_seq_max is small) and this will also lead to the same llama_decode returned -1 error

mlisovyi · 2026-05-05T10:13:12Z

Also, would it make sense to expose those parameters in ModelSettings with some meiningful defaults to allow setting them in the server run?

abetlen · 2026-06-01T02:01:44Z

@iamlemec thank you, this should have been fixed in v0.3.23

add n_seq_max and kv_unified options; fix batch embedding

a8f1233

astrowonk mentioned this pull request Sep 8, 2025

Can't compute multiple embeddings in a single call #2051

Open

4 tasks

atharva-again mentioned this pull request Oct 8, 2025

feat: Support For Multi-Sequence Embedding / Batch Processing inference-sh/llama-cpp-python#11

Open

kavorite mentioned this pull request Oct 12, 2025

support batch embeddings and zero-copy numpy returns #2077

Closed

abetlen closed this Jun 1, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix multi-sequence embeddings#2058

Fix multi-sequence embeddings#2058
iamlemec wants to merge 1 commit into
abetlen:mainfrom
iamlemec:fix-batch-embed

iamlemec commented Aug 19, 2025

Uh oh!

LimePencil commented Sep 15, 2025

Uh oh!

freckletonj commented Dec 13, 2025

Uh oh!

mlisovyi commented May 5, 2026 •

edited

Loading

Uh oh!

mlisovyi commented May 5, 2026

Uh oh!

abetlen commented Jun 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

iamlemec commented Aug 19, 2025

Uh oh!

LimePencil commented Sep 15, 2025

Uh oh!

freckletonj commented Dec 13, 2025

Uh oh!

mlisovyi commented May 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mlisovyi commented May 5, 2026

Uh oh!

abetlen commented Jun 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

mlisovyi commented May 5, 2026 •

edited

Loading