B1 mtp qwen rebase by Ooooze · Pull Request #13 · AtomicBot-ai/atomic-llama-cpp-turboquant

Ooooze · 2026-05-13T14:37:08Z

Overview

Additional information

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure:

… Qwen 3.6 NextN - Added detailed descriptions of AtomicChat `UDT` quantization process in NEXTN.md, including tensor-type file overrides and build entrypoints. - Updated README.md to include optional UDT quant information and links to relevant documentation. - Modified bench-matrix script to support combined GGUF benchmarking and added filtering options for benchmark modes. - Improved summary output in the benchmarking script to include optional markdown headings and better formatting.

- Introduced a new environment variable `QWEN_UDT_ABLATION_AUTO` to control filtering for benchmark modes based on model versions. - Refactored the `bench-qwen-udt-matrix-local.sh` script to improve clarity and structure, ensuring proper handling of model types and filtering. - Updated `bench-qwen-udt-quality.sh` to support an optional second pass on chat-style text files, with a default sample chat calibration file included. - Improved error handling in `get-wikitext-2.sh` for downloading and unzipping files. - Added a new sample chat calibration file to enhance benchmarking capabilities.

…Qwen 3.6 NextN enhancements - Revised NEXTN.md to highlight the new AtomicChat UDT collection, detailing the combined `_MTP.gguf` quants and their benefits for NextN processing. - Updated README.md to reflect changes in recommended sources for Qwen 3.6 models, emphasizing the AtomicChat UDT collection and its features. - Enhanced quantization scripts to support improved file handling and added compatibility for new tensor types. - Introduced a new script for running perplexity benchmarks on UDT quant models, generating detailed performance logs. - Improved error handling and user feedback in various scripts to streamline the quantization and benchmarking processes.

Ooooze added 3 commits May 13, 2026 12:31

Ooooze merged commit 8893692 into feature/turboquant-kv-cache May 13, 2026
1 check passed

github-actions Bot added documentation Improvements or additions to documentation script labels May 13, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

B1 mtp qwen rebase#13

B1 mtp qwen rebase#13
Ooooze merged 3 commits into
feature/turboquant-kv-cachefrom
b1-mtp-qwen-rebase

Ooooze commented May 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Ooooze commented May 13, 2026

Overview

Additional information

Requirements

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant