Mycel

AI-powered Universal Knowledge Engine. Captures unstructured knowledge through conversations and structures it using multi-agent AI.

Mycel captures unstructured knowledge through natural, multi-turn conversations and transforms it into structured, queryable information. Configured via JSON schemas — a Domain Schema (what to capture) and a Persona Schema (how to communicate) — the same engine can power any knowledge collection use case: a village chronicle, a biography project, or a product knowledge base.

How It Feels

You:   "The old village church was built in 1732 in Baroque style."
Mycel: "1732, Baroque — that's really old! Was it ever renovated?"
You:   "Yes, a new tower was added in 1890."
Mycel: "Fascinating! Anything else come to mind — maybe a local club or a special place?"

You talk. Mycel listens, asks follow-up questions, and structures what you share — all without the user noticing the complexity behind the scenes.

Key Features

Multi-Agent Pipeline — Classifier, Context Dispatcher, Gap-Reasoning, Persona, and Structuring agents work together to understand, complete, and structure knowledge
Dynamic Schema Bootstrap & Evolution — Generate domain schemas from a plain-text description using web research; evolve them over time as conversation patterns reveal new categories
RAG (Cross-Session Recall) — Retrieval-augmented generation via Firestore Vector Search brings prior knowledge into new conversations
Real-time Web Enrichment — Claims are extracted from conversations and validated against web sources asynchronously
Document Generation — Auto-generates structured Markdown knowledge bases from collected entries
Anonymous Auth with Tenant Isolation — GCP Identity Platform JWTs with all data scoped under tenants/{tenantId}/
Interactive API Docs — Auto-generated OpenAPI 3.1 spec with Scalar API Reference UI at /docs

Tech Stack

Layer	Technology
API	Hono + @hono/zod-openapi
Language	TypeScript (strict mode)
AI / LLM	Vertex AI (Gemini), LangGraph.js
Database	Cloud Firestore + Vector Search
Auth	GCP Identity Platform (anonymous JWT)
Hosting	Cloud Run
Infrastructure	Terraform
Testing	Vitest

Getting Started

Prerequisites

Node.js 20+, npm 10+
Java 21+ (for Firestore emulator)
GCP project with Vertex AI API enabled (or use MYCEL_MOCK_LLM=true for local dev without GCP)

Install & Build

npm install
npm run build

Environment Setup

cp .env.example .env
# Edit .env — at minimum set MYCEL_GCP_PROJECT_ID
# For local dev without GCP, keep MYCEL_MOCK_LLM=true

Local Development

# Terminal 1: Start Firestore emulator
npm run emulator:start

# Terminal 2: Start the API server
npm run dev:api
# → http://localhost:8080

Seed Default Schemas

npm run seed:schemas

This loads the default Domain and Persona schemas from config/ into Firestore.

Running Tests

npm run test              # Unit tests
npm run test:integration  # Integration tests (requires emulator)
npm run lint              # ESLint
npm run typecheck         # TypeScript type checking

API Documentation

With the API server running, open http://localhost:8080/docs for the interactive Scalar API Reference UI. The raw OpenAPI spec is available at /openapi.json.

Core endpoints:

Method	Path	Description
`POST`	`/sessions`	Start a new conversation session
`POST`	`/sessions/:id/turns`	Submit a user message
`POST`	`/sessions/:id/end`	End a session
`POST`	`/domains/generate`	Generate a domain schema from a description
`POST`	`/domains/:id/documents/generate`	Generate a knowledge document
`POST`	`/domains/:id/evolution/analyze`	Analyze knowledge for schema evolution

Deployment

Deploy to Cloud Run using the included script:

export MYCEL_GCP_PROJECT_ID=your-project-id
# Optional: export MYCEL_GCP_REGION=europe-west3 (default)
./scripts/deploy.sh

This builds a linux/amd64 Docker image, pushes it to Artifact Registry, and updates the Cloud Run service.

Architecture

graph LR
    A[Input<br/>Text / Audio / Image] --> B[Session Manager<br/>Multi-Turn Conversation]
    B --> C[Agent Pipeline<br/>Classify → Dispatch → Reason → Respond → Structure]
    C --> D[Knowledge Base<br/>Firestore + Vector Search]
    D --> E[Document Generator<br/>Structured Markdown]
    D -->|RAG| C
    D -->|Async| F[Web Enrichment<br/>Claim Validation]
    F --> D
    G[Schema Bootstrap<br/>Web Research] -->|generates| H[Domain & Persona Schemas]
    H -->|configures| C
    D -->|patterns| I[Schema Evolution<br/>New Categories & Fields]
    I -->|evolves| H

The Agent Pipeline processes each user turn through five specialized agents:

Classifier — Categorizes input against the domain schema
Context Dispatcher — Retrieves relevant prior knowledge (RAG)
Gap-Reasoning — Identifies missing information and formulates follow-up questions
Persona — Generates a natural response in the configured communication style
Structuring — Extracts and structures knowledge entries from the conversation

The engine is fully configured by two JSON schemas — no domain logic is hardcoded.

For more detail, see the Architecture Overview and ADRs.

Project Structure

mycel/
├── packages/
│   ├── api/          # HTTP API (Hono, Cloud Run entrypoint)
│   ├── core/         # AI engine (agents, orchestration, RAG)
│   ├── ingestion/    # Multimodal input processing
│   ├── schemas/      # Domain & Persona schema definitions
│   └── shared/       # Shared types, utilities, logger
├── config/           # Default schema JSON files
├── docs/             # Architecture docs & ADRs
├── infra/            # Terraform (GCP infrastructure)
└── scripts/          # CLI tools & deployment

Frontend

The web interface is in a separate repository: mycel-web

Contributing

Contributions are welcome! Please see CONTRIBUTING.md for guidelines.

License

MIT License — see LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 70 Commits
.github		.github
assets		assets
config		config
docs		docs
infra/terraform		infra/terraform
packages		packages
scripts		scripts
.dockerignore		.dockerignore
.env.example		.env.example
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
.prettierrc		.prettierrc
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
ROADMAP.md		ROADMAP.md
SECURITY.md		SECURITY.md
openapi.json		openapi.json
package-lock.json		package-lock.json
package.json		package.json
test-rag.sh		test-rag.sh
tsconfig.base.json		tsconfig.base.json
tsconfig.json		tsconfig.json
vitest.config.ts		vitest.config.ts
vitest.integration.config.ts		vitest.integration.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mycel

How It Feels

Key Features

Tech Stack

Getting Started

Prerequisites

Install & Build

Environment Setup

Local Development

Seed Default Schemas

Running Tests

API Documentation

Deployment

Architecture

Project Structure

Frontend

Contributing

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Mycel

How It Feels

Key Features

Tech Stack

Getting Started

Prerequisites

Install & Build

Environment Setup

Local Development

Seed Default Schemas

Running Tests

API Documentation

Deployment

Architecture

Project Structure

Frontend

Contributing

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages