Enhance multimodal support and speculative decoding in atomic-llama-c… by Ooooze · Pull Request #14 · AtomicBot-ai/atomic-llama-cpp-turboquant

Ooooze · 2026-05-13T17:26:52Z

…pp-turboquant

Updated NEXTN.md to document the integration of --mmproj with speculative decoding types mtp, nextn, and eagle3, allowing coexistence on a single slot.
Revised README.md to reflect the new multimodal capabilities and their implications for text and image processing.
Added functions in common/speculative.cpp and common/speculative.h to check compatibility of speculative types with multimodal settings.
Enhanced server context handling to manage multimodal prompts and ensure correct behavior during speculative decoding.
Introduced a new script for running Gemma 4 with multimodal projector support, detailing expected behavior for text and image turns.
Updated documentation in docs/speculative.md to clarify per-turn behavior and future roadmap for draft acceleration on vision turns.

Overview

Additional information

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure:

…pp-turboquant - Updated NEXTN.md to document the integration of `--mmproj` with speculative decoding types `mtp`, `nextn`, and `eagle3`, allowing coexistence on a single slot. - Revised README.md to reflect the new multimodal capabilities and their implications for text and image processing. - Added functions in `common/speculative.cpp` and `common/speculative.h` to check compatibility of speculative types with multimodal settings. - Enhanced server context handling to manage multimodal prompts and ensure correct behavior during speculative decoding. - Introduced a new script for running Gemma 4 with multimodal projector support, detailing expected behavior for text and image turns. - Updated documentation in `docs/speculative.md` to clarify per-turn behavior and future roadmap for draft acceleration on vision turns.

Ooooze merged commit 0a635dc into feature/turboquant-kv-cache May 13, 2026

github-actions Bot added documentation Improvements or additions to documentation examples server script labels May 13, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhance multimodal support and speculative decoding in atomic-llama-c…#14

Enhance multimodal support and speculative decoding in atomic-llama-c…#14
Ooooze merged 1 commit into
feature/turboquant-kv-cachefrom
b1-mtp-qwen-rebase

Ooooze commented May 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Ooooze commented May 13, 2026

Overview

Additional information

Requirements

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant