References: - https://github.com/ggml-org/llama.cpp/issues/20977 - https://github.com/TheTom/turboquant_plus - https://github.com/ggml-org/llama.cpp/pull/21089
References: