Skip to content

jarek108/jarvis

Repository files navigation

Jarvis Assistant: High-Performance STS & VLM Infrastructure

Jarvis is a low-latency, integration-based Speech-to-Speech (STS) and Vision-Language (VLM) assistant framework optimized for the NVIDIA RTX 5090 (Blackwell). It orchestrates cutting-edge AI models into a unified pipeline capable of real-time voice interaction and visual analysis.

🚀 Key Features

  • Multi-Modal Brain: Support for Large Language Models (LLM) and Vision-Language Models (VLM) via Ollama and vLLM.
  • Fast Transcription: Optimized Speech-to-Text (STT) powered by faster-whisper.
  • Natural Voice: High-quality Text-to-Speech (TTS) using the Chatterbox engine.
  • Hierarchical Dashboard: Real-time TUI dashboard for monitoring benchmarks, logs, and VRAM usage.
  • Benchmarking Suite: Comprehensive test runner with automated Google Drive reporting and session-based artifact persistence.
  • Refactor Guard: A high-fidelity "Plumbing Mode" to verify code integrity without requiring GPU resources.

🚀 Core Capabilities Matrix

Jarvis interacts with your environment through prioritized data channels, focusing on tactile productivity first.

Channel Type Priority Usage Example
Microphone aud P0 Capture "Summarize this" voice command.
Selection txt P0 Read highlighted code for refactoring.
Clipboard txt P0 Paste result directly back into a draft.
Chat UI txt P1 Monitor real-time logs and history.
Speaker aud P1 Verbal responses for hands-free mode.
Screenshot img P2 Analyze a static error or UI element.

📂 Project Structure

  • /servers: Individual component servers (STT, TTS, STS).
  • /utils: Core system utilities (Config, Infra, VRAM, Hardware).
  • /tests: Benchmarking logic, test plans, and domain suites.
  • /loadouts: Production-ready model configurations.
  • /docs: Detailed architectural and procedural documentation.

🛠️ Quick Start

1. Installation

Jarvis requires a specialized environment for NVIDIA Blackwell hardware.

# Run the automated bootstrap script
python setup/setup_env.py

For manual installation or troubleshooting, see docs/TUTORIAL_QUICKSTART.md.

2. Infrastructure Management

Use the loadout manager to start or stop the Jarvis cluster.

# Apply a specific model preset
python manage_loadout.py --apply base-qwen30-multi

# Check cluster health
python manage_loadout.py --status

📖 Documentation Index (Diátaxis)

🎓 Tutorials (Learning)

🛠️ How-to Guides (Tasks)

📚 Concepts (Understanding)

📖 Reference (Information)


For AI-assisted development instructions, see GEMINI.MD.

About

Jarvis fun project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages