FastAPI backend blueprint for AI agent applications.
DDD domains · SQLAlchemy/Alembic · Taskiq workers · admin UI · RAG infrastructure · Claude/Codex collaboration harness.
60s Quickstart · Why · AI Collaboration · Comparison · Architecture · 한국어
No Docker, no PostgreSQL, no cloud credentials — SQLite + in-memory broker.
git clone https://github.com/Mr-DooSun/fastapi-agent-blueprint.git
cd fastapi-agent-blueprint
make setup # one-time: venv + deps via uv
make quickstart # FastAPI on :8001, SQLite schema auto-createdIn a second terminal, make demo exercises the auth + user domains
(JWT register → CRUD → refresh → logout) and make demo-rag exercises the
docs domain (end-to-end RAG: upload → chunk → embed → retrieve → answer
with citations, zero credentials):
→ Health check
{ "status": "ok" }
→ Register
{ "success": true, "data": { "accessToken": "...", "refreshToken": "..." } }
→ Create a user
{ "success": true, "data": { "id": 2, "username": "bob", ... } }
→ List users (page=1, pageSize=10)
{ "data": [ { "id": 1, "username": "alice" }, { "id": 2, "username": "bob" } ],
"pagination": { "currentPage": 1, "totalItems": 2, "hasNext": false } }
→ Update the user → Delete the user
→ Refresh token → Logout
→ Done. API docs: http://127.0.0.1:8001/docs
- API docs: http://127.0.0.1:8001/docs (Stoplight Elements & Scalar recommended; spec download + frontend handoff link on the same page)
- Admin UI: http://127.0.0.1:8001/admin (
admin/admin) - Full walkthrough:
docs/quickstart.md - Frontend handoff:
docs/frontend-handoff.md - Real dev stack (PostgreSQL + migrations):
docs/reference.md
Clone → quickstart → CRUD → JWT auth → background worker → RAG query:
make quickstart && make demo && make demo-ragFull integration walkthrough (auth · RBAC · worker · admin · RAG · OTEL): docs/canonical-demo.md
|
Production rigor
|
AI-assisted acceleration
|
Most templates stop at generated files. This blueprint also ships the collaboration layer that keeps AI coding agents useful after the first scaffold:
- Shared source of truth —
AGENTS.mddefines the architecture, DTO rules, logging rules, security constraints, and default coding flow. - Claude Code + Codex parity — tool-specific harnesses point back to the same shared rules instead of drifting into separate playbooks.
- Repo-local skills — domain scaffolding, API work, admin pages, worker tasks, migrations, tests, architecture review, security review, and guideline sync.
- Governed changes — pre-commit checks, import guards, language policy, and review workflows catch architecture drift before it becomes team debt.
The result is a backend starter that can be used by hand, then accelerated by AI tools without asking every contributor to remember the whole architecture from scratch.
| Feature | FastAPI Agent Blueprint | tiangolo/full-stack | s3rius/template | teamhide/boilerplate |
|---|---|---|---|---|
| Zero-boilerplate CRUD (8 methods) | Yes | No | No | No |
| Auto domain discovery | Yes | No | No | No |
| Architecture enforcement (pre-commit) | Yes | No | No | No |
| AI workflow skills (Claude + Codex) | 14 + 14 | 0 | 0 | 0 |
| Vector infrastructure (S3 Vectors) | Yes | No | No | No |
| Multi-interface (API + Worker + Admin + MCP) | 3 + 1 planned | 2 | 1 | 1 |
| Architecture Decision Records | 18 active · 30 archived | 0 | 0 | 0 |
| Type-safe generics across layers | Yes | Partial | Partial | No |
| IoC container DI | Yes | No | No | No |
Full comparison including Litestar, Robyn, cookiecutter, and adoption paths: docs/comparison.md
The blueprint ships a worked RAG example — upload documents, ask questions, get structured answers with citations. It proves the building blocks (vectors, embeddings, LLM agent, worker, admin) compose end-to-end.
make quickstart # terminal 1
make demo-rag # terminal 2 — seeds 3 docs, runs a queryPOST /v1/docs/documents # chunk → embed → upsert
POST /v1/docs/query # embed question → top-k retrieval → agent answer
GET /admin/docs # browse + query playground
Under the hood, the RAG orchestration is a reusable _core pattern
(ADR 040), not a domain.
src/docs/ is one consumer; future AI domains (support_bot, product_qa)
inject the same RagPipeline instead of duplicating chunking + retrieval
code:
# src/_core/domain/services/rag_pipeline.py
class RagPipeline(Generic[TChunk]):
async def answer(self, question, top_k=5, filters=None) -> tuple[QueryAnswer, list[TChunk]]:
... # embed → vector_store.search → answer_agent.answerZero-config path uses a stub embedder (keyword bag-of-words) and stub
answer agent (templated response from retrieved chunks), both in
src/_core/infrastructure/rag/. Set EMBEDDING_PROVIDER + LLM_PROVIDER
in .env to swap in real providers — the pipeline is the same.
Every domain under src/{domain}/ has four DDD layers. Arrows mean
"depends on". Application (use cases) is optional — the dotted
line is the common path for simple CRUD (Router → Service directly).
flowchart LR
subgraph domain["src/{domain}/ (4 DDD layers)"]
I["Interface<br/>routers · admin · worker · schemas"]
A["Application<br/>use cases — optional"]
D["Domain<br/>services · protocols · DTOs · value objects"]
Inf["Infrastructure<br/>repositories · models · DI container"]
I --> A
A --> D
Inf --> D
I -. direct when no UseCase .-> D
end
Core["src/_core/<br/>Base classes · CoreContainer · shared VOs"]
I --> Core
A --> Core
D --> Core
Inf --> Core
Other["Another domain"] -. via Protocol-based DIP .-> D
| Layer | Role | Base class |
|---|---|---|
| Interface | Router · Request/Response · Admin · Worker task | — |
| Domain | Service · Protocol · DTO · Exceptions | BaseService[CreateDTO, UpdateDTO, ReturnDTO] |
| Infrastructure | Repository · Model · DI container | BaseRepository[ReturnDTO] |
| Application | UseCase — optional orchestrator | — |
Full set of diagrams (Layer · Write · Read) plus RDB / DynamoDB / S3 Vectors variants lives in
docs/ai/shared/architecture-diagrams.md. Non-Mermaid viewers: SVG exports.
flowchart LR
C[Client] -->|"HTTP + JSON"| R[Router]
R -->|"Request schema"| S[Service]
S -->|"entity"| Re["Repository<br/>BaseRepository[DTO]"]
Re -->|"Model(**dto.model_dump())"| M[ORM Model]
M -->|"SQLAlchemy"| DB[(Database)]
- Request → Service directly when fields match (no intermediate DTO — ADR 004).
- Model ↔ DTO conversion happens only inside the Repository.
- Read flow is the mirror image; the Router strips sensitive fields on the way out.
Same flow, different base classes:
| Storage | Service base | Repository / Store base | List return |
|---|---|---|---|
| RDB (default) | BaseService[Create, Update, DTO] |
BaseRepository[DTO] |
(list[DTO], PaginationInfo) |
| DynamoDB | BaseDynamoService[…] |
BaseDynamoRepository[DTO] |
CursorPage[DTO] |
| S3 Vectors | domain-specific | BaseS3VectorStore[DTO] |
VectorSearchResult[DTO] |
One business logic, multiple surfaces:
| Interface | Tech | Status | Purpose |
|---|---|---|---|
| HTTP API | FastAPI | Stable | REST endpoints |
| Async worker | Taskiq + SQS / RabbitMQ / InMemory | Stable | Background jobs |
| Admin UI | NiceGUI | Stable | Auto-generated admin CRUD |
| MCP server | FastMCP | Planned (#18) | AI agent tool interface |
| I want to… | Read |
|---|---|
| Spin it up and poke around | docs/quickstart.md |
| See everything work end-to-end (auth · RBAC · worker · RAG · OTEL) | docs/canonical-demo.md |
| Build a real domain, end-to-end | docs/tutorial/first-domain.md |
| See small, pattern-focused example apps | examples/ |
| Understand the architecture in depth | docs/ai/shared/architecture-diagrams.md · AGENTS.md |
| Set up Claude Code or Codex CLI | docs/ai-development.md |
| Add a domain by hand (no AI tools) | docs/tutorial/first-domain.md (Path B) |
| Adopt into an existing FastAPI project | docs/adoption.md |
| Check Python / FastAPI / tool version support | docs/compatibility.md |
| See detailed env vars, tech stack, project tree | docs/reference.md |
| Understand why a decision was made | ADR index (18 active · 30 archived) |
| Follow what's next | Roadmap · issue tracker |
- MCP server interface — expose domain services as agent tools via FastMCP (#18)
- pgvector — additional vector backend alongside S3 Vectors (#11)
See full roadmap · open issues
See CONTRIBUTING.md for dev setup, coding guidelines,
and the PR workflow. Newcomers — check the
good first issue
label; the small apps tracked under examples/ are a
low-friction place to land your first PR.
MIT — free for commercial use, modification, and distribution.

