Skip to content

B1 mtp qwen rebase#13

Merged
Ooooze merged 3 commits into
feature/turboquant-kv-cachefrom
b1-mtp-qwen-rebase
May 13, 2026
Merged

B1 mtp qwen rebase#13
Ooooze merged 3 commits into
feature/turboquant-kv-cachefrom
b1-mtp-qwen-rebase

Conversation

@Ooooze
Copy link
Copy Markdown

@Ooooze Ooooze commented May 13, 2026

Overview

Additional information

Requirements

Ooooze added 3 commits May 13, 2026 12:31
… Qwen 3.6 NextN

- Added detailed descriptions of AtomicChat `UDT` quantization process in NEXTN.md, including tensor-type file overrides and build entrypoints.
- Updated README.md to include optional UDT quant information and links to relevant documentation.
- Modified bench-matrix script to support combined GGUF benchmarking and added filtering options for benchmark modes.
- Improved summary output in the benchmarking script to include optional markdown headings and better formatting.
- Introduced a new environment variable `QWEN_UDT_ABLATION_AUTO` to control filtering for benchmark modes based on model versions.
- Refactored the `bench-qwen-udt-matrix-local.sh` script to improve clarity and structure, ensuring proper handling of model types and filtering.
- Updated `bench-qwen-udt-quality.sh` to support an optional second pass on chat-style text files, with a default sample chat calibration file included.
- Improved error handling in `get-wikitext-2.sh` for downloading and unzipping files.
- Added a new sample chat calibration file to enhance benchmarking capabilities.
…Qwen 3.6 NextN enhancements

- Revised NEXTN.md to highlight the new AtomicChat UDT collection, detailing the combined `_MTP.gguf` quants and their benefits for NextN processing.
- Updated README.md to reflect changes in recommended sources for Qwen 3.6 models, emphasizing the AtomicChat UDT collection and its features.
- Enhanced quantization scripts to support improved file handling and added compatibility for new tensor types.
- Introduced a new script for running perplexity benchmarks on UDT quant models, generating detailed performance logs.
- Improved error handling and user feedback in various scripts to streamline the quantization and benchmarking processes.
@Ooooze Ooooze merged commit 8893692 into feature/turboquant-kv-cache May 13, 2026
1 check passed
@github-actions github-actions Bot added documentation Improvements or additions to documentation script labels May 13, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation script

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant