cpu-only
Here are 25 public repositories matching this topic...
🦙 chat-o-llama: A lightweight, modern web interface for AI conversations with support for both Ollama and llama.cpp backends. Features persistent conversation management, real-time backend switching, intelligent context compression, and a clean responsive UI.
-
Updated
Dec 10, 2025 - Python
A high-performance Python library for extracting structured content from PDF documents with layout-aware text extraction. pdf_to_json preserves document structure including headings (H1-H6) and body text, outputting clean JSON format.
-
Updated
Jan 6, 2026 - Python
An LLM-based content moderator. Firefox extension to block webpages unrelated to work, based on page title and URL. Local LLMs with Ollama and Langchain to ensure your browsing history never leaves your device, for complete privacy. Google Gemini also supported.
-
Updated
Dec 12, 2024 - Python
CPU-only RAG stack: PDFs→Docling→Ollama→pgvector. Windows/macOS/Linux. Docker Compose. Graph-aware code search + scanned PDF OCR.
-
Updated
Mar 21, 2026 - Python
-
Updated
Mar 10, 2026 - Shell
LISA: Train 32B-120B language models on limited RAM (8GB). Layer-by-layer processing + LoRA adapters = 97% memory reduction.
-
Updated
Apr 3, 2026 - Python
Cloud transcription stores your audio. Local alternatives need a GPU. Whiscribe runs on CPU, in your browser, with one Python script.
-
Updated
Mar 23, 2026 - Python
Image Classification with On-Device Inference, built with Flutter, AI model runs on mobile cpu
-
Updated
Jan 29, 2025 - Dart
Face locking system built on ArcFace (ONNX) and 5-point alignment that recognizes a selected identity, locks onto it, tracks facial actions, and records behavior over time.
-
Updated
Feb 6, 2026 - Python
Public distributed LLM across any platform
-
Updated
Apr 2, 2026
A new one shot face swap approach for image and video domains - version tailored to work on CPU
-
Updated
Aug 20, 2024 - Python
Pre-built Llama-CPP Wheel for HF Spaces (Python 3.13)
-
Updated
Mar 10, 2026
Semantic plagiarism & content originality detector using sentence-transformer embeddings + FAISS. Catches paraphrasing, not just copy-paste. 3 severity levels, originality scoring, HTML/JSON/terminal reports. 100% local, no API keys, CPU-only.
-
Updated
Mar 30, 2026 - Python
Probabilistic Signed Distance Fusion with View Planning on CPU
-
Updated
Mar 6, 2026 - Python
One-line TTS for Python. Real speech on any CPU. No GPU, no cloud, no API keys.
-
Updated
Mar 20, 2026 - Python
Real-time sign language detection & translation using MediaPipe, LSTM, and Gemini 2.5 Flash — with WebSocket streaming, TTS audio output, and a Next.js frontend. CPU-only, low-spec friendly.
-
Updated
Apr 1, 2026 - Python
CPU-optimized RAG pipeline reducing latency 2.7× (247ms → 92ms). Implements caching, filtering, quantization for production. Complete with FastAPI, Docker, benchmarks, investor materials. The engineering showcase that sells itself.
-
Updated
Mar 31, 2026 - Python
Improve this page
Add a description, image, and links to the cpu-only topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the cpu-only topic, visit your repo's landing page and select "manage topics."