⚡ SpawnVerse

The universe where agents are born from tasks.
Zero pre-built agents. Distributed memory. Fossil record. Guardrails.

⭐ If SpawnVerse saves you time, please star this repo — it helps others discover it.

The Problem

Every agent framework today — LangChain, CrewAI, AutoGen, LangGraph — requires you to define agents before the task arrives. You write the roles. You write the prompts. You define the tools. The framework runs what you built.

You are the architect of the workforce.

SpawnVerse inverts this. You give it a task. SpawnVerse invents the workforce.

TRADITIONAL FRAMEWORK:         SPAWNVERSE:
Developer defines agents  →    Task defines agents
Fixed team structure      →    Team emerges from task
Same agents every run     →    New agents every run
You write the code        →    LLM writes the code
Agents forget everything  →    Agents leave fossil memory

How It Works

YOU  →  "Plan a 7-day Tokyo trip, 2 people, budget INR 2L"
                      ↓
        ORCHESTRATOR reads task
        Asks LLM: "What agents does this need?"
        LLM returns: flight_researcher, hotel_finder,
                     itinerary_builder, budget_analyzer...
                      ↓
        For each agent:
          LLM writes complete Python code
          Guardrail scans code for dangerous patterns
          Subprocess runs with OS resource limits
          Agent reads distributed memory
          Agent does its work using think()
          Agent writes ONLY to its own namespace
          Agent sends messages to other agents
          Agent optionally spawns sub-agents
          Agent deposits a fossil on death
                      ↓
YOU  ←  Complete trip plan: flights, hotels,
        itinerary, budget breakdown, weather

Diagrams

System Architecture

Agent Lifecycle Flow

Distributed Memory Model

Guardrails — 4 Layers

Fossil Record — Long Memory

SpawnVerse vs Traditional Frameworks

Architecture

╔══════════════════════════════════════════════════════════╗
║                    SPAWNVERSE ENGINE                     ║
╠══════════════════════════════════════════════════════════╣
║                                                          ║
║  Input: task description + optional context dict         ║
║                          ↓                              ║
║  PHASE 0  Index knowledge base into ChromaDB (optional)  ║
║                          ↓                              ║
║  PHASE 1  Decompose: LLM plans the agent team            ║
║                          ↓                              ║
║  PHASE 2  Wave 1 — Gathering (runs in parallel)          ║
║   ┌──────────┐ ┌──────────┐ ┌──────────┐               ║
║   │ Agent A  │ │ Agent B  │ │ Agent C  │               ║
║   │(invented)│ │(invented)│ │(invented)│               ║
║   │subprocess│ │subprocess│ │subprocess│               ║
║   └────┬─────┘ └────┬─────┘ └────┬─────┘               ║
║        └────────────┴────────────┘                      ║
║                  DISTRIBUTED MEMORY                      ║
║           read any namespace · write own only            ║
║                          ↓                              ║
║  PHASE 3  Wave 2 — Synthesis (reads Wave 1 outputs)      ║
║                          ↓                              ║
║  PHASE 4  Fossil deposition · Relationship tracking      ║
║                          ↓                              ║
║  Output: results + message log + execution summary       ║
╚══════════════════════════════════════════════════════════╝

Distributed Memory Model

              spawnverse.db  (SQLite · WAL mode · concurrent-safe)
                           │
       ┌───────────────────┼───────────────────┐
       ↓                   ↓                   ↓
  system.*           flight_agent.*      hotel_agent.*
  project context    own writes only     own writes only
       │                   │                   │
  READ by all         READ by all         READ by all
  WRITE by orch       WRITE by self       WRITE by self

The contract is simple:

read_output("any_agent") — read anyone's result
write_result(value) — write only to your own namespace

Enforced at code-generation time and at the database layer.

Fossil Record

Every agent that runs leaves a fossil when it dies:
  agent_id · role · task_summary
  constitution  (the code that defined this agent)
  quality_score (0-1: how good was the output?)
  intent_score  (0-1: how close to the original task?)
  tokens_used · runtime_seconds · depth

When vector DB is enabled:
  Run   1: agents use your documents + zero past runs
  Run  10: agents find 9 runs of accumulated knowledge
  Run 100: agents have a rich fossil library to learn from

Quick Start

# 1. Clone
git clone https://github.com/sajosam/spawnverse
cd spawnverse

# 2. Install — pick your provider
pip install groq              # Groq / LLaMA  (free tier, default)
# pip install openai          # OpenAI        (GPT-4o, GPT-4o-mini)
# pip install anthropic       # Anthropic     (Claude Opus / Sonnet / Haiku)

# For vector DB support:
# pip install chromadb

# 3. Set your API key
export GROQ_API_KEY=your_key_here        # free at console.groq.com
# export OPENAI_API_KEY=your_key_here
# export ANTHROPIC_API_KEY=your_key_here

# 4. Run the simplest example
python examples/01_general/run.py

# 5. Or pass your own task
python examples/01_general/run.py "Research best laptops under 60k in India 2025"

Supported Providers

SpawnVerse works with any of the four providers below. Switch with one config line — no task code changes needed.

Provider	Models	Key needed	Notes
Groq (default)	`llama-3.3-70b-versatile` · `llama-3.1-8b-instant` · `llama3-70b-8192`	`GROQ_API_KEY`	Free tier · fastest inference
OpenAI	`gpt-4o` · `gpt-4o-mini` · `gpt-4-turbo` · `gpt-3.5-turbo`	`OPENAI_API_KEY`	Best reasoning quality
Anthropic	`claude-opus-4-5` · `claude-sonnet-4-5` · `claude-haiku-4-5`	`ANTHROPIC_API_KEY`	Excellent instruction-following
Ollama (local)	`llama3` · `mistral` · `phi3` · any pulled model	(none)	Fully offline · zero API cost

Switching providers

from spawnverse import Orchestrator, DEFAULT_CONFIG

# Groq (default — free tier)
config = {**DEFAULT_CONFIG, "provider": "groq",      "model": "llama-3.3-70b-versatile"}

# OpenAI
config = {**DEFAULT_CONFIG, "provider": "openai",    "model": "gpt-4o-mini"}

# Anthropic
config = {**DEFAULT_CONFIG, "provider": "anthropic", "model": "claude-haiku-4-5-20251001"}

# Ollama (local, no internet required)
config = {**DEFAULT_CONFIG, "provider": "ollama",    "model": "llama3",
          "ollama_base_url": "http://localhost:11434/v1"}

Orchestrator(config).run({"description": "Your task here", "context": {}})

Usage

Simplest form

from spawnverse import Orchestrator, DEFAULT_CONFIG

result = Orchestrator().run({
    "description": "Research top 5 EVs in India under 25 lakhs for 2025",
    "context": {"buyer_type": "first-time EV buyer", "location": "Bangalore"}
})

Custom config

from spawnverse import Orchestrator, DEFAULT_CONFIG

config = {**DEFAULT_CONFIG, **{
    "max_depth"    : 3,
    "wave1_agents" : 5,
    "parallel"     : True,
    "output_format": "report",
}}

Orchestrator(config).run({
    "description": "Write a market research report on AI in Indian healthcare 2025",
    "context": {"audience": "Chief Medical Officer"}
})

With your own documents (RAG)

from spawnverse import Orchestrator, DEFAULT_CONFIG

config = {**DEFAULT_CONFIG, **{
    "vector_db_enabled": True,
    "vector_db_path"   : "./my_knowledge",
}}

# Your documents — text strings or file paths
knowledge = [
    "Mumbai office average rent is INR 120/sqft in 2025.",
    "/path/to/your/market_report.txt",
]

Orchestrator(config).run(
    {"description": "Analyse office real estate in Mumbai for 2025", "context": {}},
    knowledge_base=knowledge
)

# Inside agents, your docs are searchable:
# ctx = rag_context("Mumbai office rent trends")
# answer = think(f"Context:\n{ctx}\n\nAnalyse: ...")

CONFIG Reference

from spawnverse import DEFAULT_CONFIG

# All keys with defaults:
{
    # Provider
    "provider"           : "",                         # "" = auto-detect from model name
    "model"              : "llama-3.3-70b-versatile",

    # Agent tree
    "max_depth"          : 2,    # 1=flat  2=balanced  3=deep  4=complex
    "wave1_agents"       : 4,    # gathering agents (run in parallel)
    "wave2_agents"       : 4,    # synthesis agents (run after wave 1)

    # Execution
    "parallel"           : True,
    "max_parallel"       : 4,
    "timeout_depth0"     : 120,  # seconds — top-level agents
    "timeout_depth1"     : 90,   # seconds — sub-agents
    "timeout_depth2"     : 60,   # seconds — sub-sub-agents
    "retry_failed"       : True,

    # Quality gates
    "min_spawn_score"    : 0.4,
    "drift_warn"         : 0.45,
    "quality_min"        : 0.45,

    # Token budget
    "token_budget"       : 80000,
    "per_agent_tokens"   : 8000,
    "rate_limit_retry"   : 5,
    "rate_limit_wait"    : 3,

    # Sandbox (Unix only — silently skipped on Windows)
    "sandbox_enabled"    : True,
    "sandbox_cpu_sec"    : 60,
    "sandbox_ram_mb"     : 512,
    "sandbox_fsize_mb"   : 10,

    # Guardrails
    "guardrail_code"     : True,
    "guardrail_output"   : True,
    "guardrail_semantic" : True,

    # Vector DB (optional)
    "vector_db_enabled"  : False,
    "vector_db_path"     : "spawnverse_vectordb",
    "rag_top_k"          : 5,
    "rag_chunk_size"     : 800,
    "rag_chunk_overlap"  : 100,

    # Output
    "output_format"      : "structured",  # "report" | "action_plan"
    "show_stdout"        : True,
    "show_messages"      : True,
    "show_progress"      : True,

    # Paths
    "db_path"            : "spawnverse.db",
    "agents_dir"         : ".spawnverse_agents",
}

Agent Helpers Reference

Every generated agent has these functions available:

# READ — open to all agents
read(namespace, key)              # read any agent's data
read_output(agent_id)             # read another agent's result
read_system(key)                  # read project context
done_agents()                     # list of completed agent IDs

# WRITE — own namespace only
write(key, value)                 # write to your namespace
write_result(value)               # write your main output
write_context(key, value)         # write context for others

# PROGRESS
progress(pct, message)            # report 0-100% progress

# MESSAGES
send(to, type, subject, body)     # send to specific agent
broadcast(subject, body)          # send to all agents
inbox()                           # read your unread messages

# SPAWN
spawn(name, role, task, tools, my_depth)  # request sub-agent

# LLM (safe, with budget + rate-limit handling)
think(prompt)                     # text response
think(prompt, as_json=True)       # structured JSON response
                                  # NEVER json.loads() directly

# VECTOR DB (when enabled)
rag_search(query)                 # list of {text, score, source}
rag_context(query)                # formatted string for think() prompts
rag_store(text, key)              # store for other agents to find

# DONE
done(score)                       # mark complete, score 0.0-1.0

Examples

#	Name	What it demonstrates
01	General	Pure LLM, any task, zero dependencies
02	External APIs	Real weather, forex rates (no keys)
03	Vector DB	Your documents + semantic RAG
04	Minimal	2 agents, fastest run, lowest cost
05	Maximal	Depth 3, all features, production
providers	Multi-Provider	Groq · OpenAI · Anthropic · Ollama

Guardrails — 4 Layers

Layer 1 — Code Scan (BEFORE subprocess starts)
  Blocks dangerous patterns in LLM-generated code:
  os.system · subprocess · __import__ · eval · exec
  open("/etc/...") · requests.post · socket.*
  os.environ (except *_API_KEY reads)
  → Agent file never executed if violation found

Layer 2 — Budget Enforcer (DURING LLM calls)
  Per-agent token limit enforced in agent stdlib
  think() returns "" or {} when budget exhausted
  Prevents one agent consuming the whole run budget

Layer 3 — Output Validator (BEFORE memory write)
  Checks: not None · not empty · not too large · not empty dict
  → Blocked output never reaches shared memory

Layer 4 — Semantic Guardrail (LLM-as-judge)
  Reviews output for: personal data · harmful content ·
  misinformation · off-task · prompt injection
  → Flagged output blocked from shared memory

What Makes SpawnVerse Different

	LangGraph	CrewAI	AutoGen	SpawnVerse
Pre-built agents required	✅	✅	✅	❌ Never
Agent code written at runtime	❌	❌	❌	✅
Distributed memory ownership	❌	❌	❌	✅
Fossil record across runs	❌	❌	❌	✅
Intent drift measurement	❌	❌	❌	✅
Code-scan guardrail	❌	❌	❌	✅
OS-level resource limits	❌	❌	❌	✅
Multi-provider (Groq/OpenAI/Anthropic/Ollama)	❌	Partial	Partial	✅
Agents invent sub-agents	Partial	❌	Partial	✅

File Structure

spawnverse/
├── core/
│   ├── __init__.py
│   └── engine.py              complete system (single file)
├── examples/
│   ├── 01_general/run.py      pure LLM, any task
│   ├── 02_external_apis/run.py
│   ├── 03_vectordb/run.py     ChromaDB RAG
│   ├── 04_minimal/run.py
│   ├── 05_maximal/run.py      all features
│   └── providers/run.py       Groq · OpenAI · Anthropic · Ollama
├── docs/
│   ├── diagrams/              architecture diagrams (PNG)
│   ├── architecture.md
│   ├── memory-model.md
│   ├── guardrails.md
│   └── fossils.md
├── requirements.txt
├── setup.py
├── LICENSE
└── README.md

Roadmap

v0.2

Docker sandbox per agent
Better intent drift via sentence-transformers
Unit tests for core modules
Windows resource limits support
Ollama local provider

v0.3

Soul system — persistent agent identities across runs
Fossil curator — auto-promote top constitutions
Agent relationship graph
Evolution engine

v0.4

Async execution
REST API server mode
Web UI for run inspection

Contributing

See CONTRIBUTING.md.

Priority areas:

Tests for core modules
Docker sandbox executor
More domain-specific examples
Better intent drift measurement
Ollama provider integration

License

MIT — use it, build on it, sell with it.

Citation

@software{spawnverse2025,
  title  = {SpawnVerse: Self-Spawning Cognitive Agents with
            Distributed Memory and Fossil Record},
  author = {sajosam},
  year   = {2025},
  url    = {https://github.com/sajosam/spawnverse}
}

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
core		core
docs		docs
examples		examples
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
ROADMAP.md		ROADMAP.md
__init__.py		__init__.py
debug.py		debug.py
requirements.txt		requirements.txt
setup.py		setup.py

Folders and files

Latest commit

History

Repository files navigation

⚡ SpawnVerse

The Problem

How It Works

Diagrams

System Architecture

Agent Lifecycle Flow

Distributed Memory Model

Guardrails — 4 Layers

Fossil Record — Long Memory

SpawnVerse vs Traditional Frameworks

Architecture

Distributed Memory Model

Fossil Record

Quick Start

Supported Providers

Switching providers

Usage

Simplest form

Custom config

With your own documents (RAG)

CONFIG Reference

Agent Helpers Reference

Examples

Guardrails — 4 Layers

What Makes SpawnVerse Different

File Structure

Roadmap

v0.2

v0.3

v0.4

Contributing

License

Citation

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages