Skip to content

Conversation

@cmunley1
Copy link
Contributor

Draft

needs more testing and support for respones tokenids either through patching vllm or calling /tokenize

Signed-off-by: Christian Munley <cmunley@nvidia.com>
Signed-off-by: Christian Munley <cmunley@nvidia.com>
Signed-off-by: Christian Munley <cmunley@nvidia.com>
@cmunley1 cmunley1 requested a review from bxyu-nvidia January 28, 2026 02:39
@copy-pr-bot
Copy link

copy-pr-bot bot commented Jan 28, 2026

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants