kvcache.ai
Pinned Loading
Repositories
- Mooncake Public
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
kvcache-ai/Mooncake’s past year of commit activity - evalscope Public Forked from modelscope/evalscope
A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.
kvcache-ai/evalscope’s past year of commit activity - ktransformers Public
A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations
kvcache-ai/ktransformers’s past year of commit activity - accelerate Public Forked from huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
kvcache-ai/accelerate’s past year of commit activity - transformers Public Forked from huggingface/transformers
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
kvcache-ai/transformers’s past year of commit activity - sglang Public Forked from sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
kvcache-ai/sglang’s past year of commit activity - sglang_awq Public Forked from sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
kvcache-ai/sglang_awq’s past year of commit activity - kvcache-blog Public
kvcache-ai/kvcache-blog’s past year of commit activity - vllm Public Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
kvcache-ai/vllm’s past year of commit activity - DeepEP_fault_tolerance Public Forked from deepseek-ai/DeepEP
DeepEP: an efficient expert-parallel communication library that supports fault tolerance
kvcache-ai/DeepEP_fault_tolerance’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…