Skip to content
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
29 changes: 29 additions & 0 deletions .github/workflows/pr-target-check.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,29 @@
name: PR Target Check

on:
pull_request:
types: [opened, reopened, edited, synchronize]

jobs:
check-target:
runs-on: ubuntu-latest
steps:
- name: Enforce branch strategy
run: |
BASE="${{ github.event.pull_request.base.ref }}"
HEAD="${{ github.event.pull_request.head.ref }}"

echo "PR: $HEAD → $BASE"

if [ "$BASE" = "master" ] && [ "$HEAD" != "develop" ]; then
echo "❌ Only the 'develop' branch can open PRs to 'master'."
echo " Please target 'develop' instead."
exit 1
fi

if [ "$BASE" = "develop" ] && [ "$HEAD" = "master" ]; then
echo "❌ Do not merge 'master' back into 'develop'."
exit 1
fi

echo "✅ PR target is valid: $HEAD → $BASE"
41 changes: 35 additions & 6 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -27,33 +27,62 @@

<div align="center">

### 🛡️ Introducing [SharpAI Aegis](https://www.sharpai.org) — Desktop App for DeepCamera
### 🚀 Introducing [Aegis AI](https://www.sharpai.org) — The World's First AI Home Security Agent

**Use DeepCamera's AI skills through a desktop app with LLM-powered setup, agent chat, and smart alerts — connected to your mobile via Discord / Telegram / Slack.**
**GPT + Local VLM powered security agent for your Mac and PC. Always connected to your mobile via Discord / Telegram / Slack.**

[SharpAI Aegis](https://www.sharpai.org) is the desktop companion for DeepCamera. It uses LLM to automatically set up your environment, configure camera skills, and manage the full AI pipeline — no manual Docker or CLI required. It also adds an intelligent agent layer: persistent memory, agentic chat with your cameras, AI video generation, voice (TTS), and conversational messaging via Discord / Telegram / Slack.
**Aegis AI** is the next-generation product from SharpAI — a desktop security agent that connects to any camera, understands scenes using Vision Language Models, and talks to your family through the apps they already use. DeepCamera will be integrated into Aegis AI as its detection and model training engine.

[**📦 Download SharpAI Aegis**](https://www.sharpai.org)
[**🌐 Visit www.sharpai.org**](https://www.sharpai.org) · [**📦 Download Aegis AI**](https://github.com/SharpAI/DeepCamera/releases/tag/Aegis)

</div>

<table>
<tr>
<td width="50%">
<p align="center"><b>Browse & Run VLMs Locally from HuggingFace</b></p>
<img src="screenshots/aegis-vlm-browser.png" alt="SharpAI Aegis — Browse and download vision language models from HuggingFace" width="100%">
<img src="screenshots/aegis-vlm-browser.png" alt="Aegis AI — Browse and download vision language models from HuggingFace" width="100%">
<p align="center"><em>Download and run SmolVLM2, Qwen-VL, LFM2.5, LLaVA, MiniCPM-V directly on your machine. Even a Mac M1 Mini 8GB works.</em></p>
</td>
<td width="50%">
<p align="center"><b>Chat with your AI Security Agent</b></p>
<img src="screenshots/aegis-chat-agent.png" alt="SharpAI Aegis — Ask your agent what happened and get real answers" width="100%">
<img src="screenshots/aegis-chat-agent.png" alt="Aegis AI — Ask your agent what happened and get real answers" width="100%">
<p align="center"><em>Ask "anyone entering the room?" — Aegis searches your footage and gives you a real answer with timestamps and clips.</em></p>
</td>
</tr>
</table>

---

### 🏗️ The SharpAI Ecosystem

| | **DeepCamera** (This Repo) | **Aegis AI** |
|---|---|---|
| **Role** | Open-source AI/ML engine | AI Security Agent |
| **Frontend UI** | No (CLI + Docker) | Full desktop app |
| **What it does** | Object detection, person REID, model training, feature extraction | VLM scene understanding, smart alerts, chat interface, unified timeline |
| **Cameras** | Via Home-Assistant (RTSP, Blink, Nest) | Direct: Ring, Blink, RTSP/ONVIF, webcam, iPhone, Android |
| **AI Models** | YOLOv7, FastReID, custom training | GPT Vision, local VLMs via llama-server, HuggingFace models |
| **LLM / VLM** | No | Yes — local + cloud |
| **Runs on** | Docker on edge devices & desktops | Native desktop app (Mac, Windows, Linux) |
| **Alerts** | No | Slack, Discord, Telegram |
| **Installation** | Docker + CLI setup | One-click download |

> **DeepCamera** is being integrated into Aegis AI as the detection and model training layer. DeepCamera will remain fully open-source — handling object detection, person re-identification, and custom model training — while Aegis AI provides the agent layer: VLM-powered scene understanding, camera connections, agentic memory, and smart notifications.

### 🗺️ DeepCamera Roadmap

DeepCamera is evolving into a full AI skill platform. Planned features:

- [ ] **Upgrade object detection** to 2026 state-of-the-art YOLO models
- [ ] **VLM analysis backend** — offline scene understanding of recorded clips using vision language models
- [ ] **AI Studios backend** — SAM2 interactive segmentation, DINOv3 grounding, depth estimation, feature extraction
- [ ] **Direct camera provider plugins** — Blink, Ring, Eufy, Reolink, Tapo, RTSP/ONVIF (beyond Home Assistant)
- [ ] **Messaging channel plugins** — Telegram, Discord, Slack, WhatsApp
- [ ] **Automation triggers** — MQTT, webhooks, Home Assistant events
- [ ] **go2rtc streaming** — RTSP to WebRTC live view
- [x] **Skill architecture** — pluggable `SKILL.md` interface for all capabilities

---

## 🎯 Overview
Expand Down
Loading