on-device-llm

Here are 7 public repositories matching this topic...

es617 / hunch

On-device shell command generator for macOS Tahoe. Uses Apple's 3B model with dynamic few-shot retrieval from 21k tldr examples.

shell zsh cli terminal developer-tools on-device-ai foundation-models llm-tools apple-intelligence on-device-llm local-llm-macos

Updated Apr 19, 2026
Python

Agentic Android Open Source Project (AAOSP) — Android fork with native LLM system service, MCP-aware apps, and an agent-driven launcher. On-device Qwen 2.5 via llama.cpp. Apps declare tools in their manifest. The OS runs the model.

android agent mcp aosp android-framework edge-ai tool-use jetpack-compose cuttlefish on-device-ai system-service ai-agent llm android-15 llama-cpp agentic qwen model-context-protocol on-device-llm

Updated Apr 16, 2026
Java

HiveForensics-AI / knolo-core

Star

KnoLo Core is a local-first knowledge base engine built for small language models (LLMs). It packages your documents into a compact .knolo file and enables fully deterministic querying — no embeddings, no vector databases, no cloud services required. Designed for on-device and edge LLM deployments.

offline-first knowledge-base document-retrieval edge-computing edge-ai local-first lexical-search offline-llm rag-alternative vector-database-alternative small-llms on-device-llm retrieval-engine deterministic-search knolo

Updated Apr 15, 2026
TypeScript

whyisitworking / llama-bro

Star

High-performance Android SDK for on-device LLM inference (GGUF). Privacy-focused, offline-first, and powered by llama.cpp with a clean Kotlin Coroutines API.

android cmake ai ndk android-library llama android-app android-package on-device-ai ndk-jni ai-assistant llamacpp llama-cpp on-device-models on-device-inference on-device-llm

Updated Mar 27, 2026
Kotlin

martinkorelic / mobiletransformers-docs

Star

Documentation for MobileTransformers - a lightweight, modular framework based on ONNX Runtime for running and adapting large language models (LLMs) directly on mobile and edge devices. It supports on-device fine-tuning (PEFT), efficient inference, quantization, weight merging, and direct inference from merged models.

android mobile mars lora llm llm-finetuning on-device-llm

Updated Feb 14, 2026

rufolangus / platform_external_llamacpp

Star

llama.cpp packaged for AOSP — Android.bp build rules, JNI bridge, and Qwen 2.5 model download scripts for on-device inference

android jni aosp llama on-device-ai llm llama-cpp qwen gguf on-device-llm

Updated Apr 20, 2026
C++

YueLich / aios-wiki

Star

📱 手机端 AI 操作系统全景知识库 — 334+ 篇深度页面，覆盖端侧大模型、AI Agent、芯片适配、推理优化 | 自动更新

wiki xiaomi arxiv knowledge-base quantization npu inference-optimization edge-ai ai-os harmonyos mobile-ai ai-assistant llm mobile-agent on-device-llm

Updated Apr 20, 2026

Improve this page

Add a description, image, and links to the on-device-llm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the on-device-llm topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

on-device-llm

Here are 7 public repositories matching this topic...

es617 / hunch

rufolangus / AAOSP

HiveForensics-AI / knolo-core

whyisitworking / llama-bro

martinkorelic / mobiletransformers-docs

rufolangus / platform_external_llamacpp

YueLich / aios-wiki

Improve this page

Add this topic to your repo