Skip to content
View Lancimoun's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Lancimoun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Lancimoun/README.md
typing

I build AI agents that remember, reason, use tools, and can be tested for reliability.


I am an AI Systems Engineer building production AI systems that remember, reason, use tools, and stay useful across long timelines. My current focus is long-horizon agent reliability: memory quality, current-truth override, tool honesty, transcript health, evaluation, observability, and the human layer that makes an AI partner useful in real life.

I am the founder of FORGE: a personal AI ecosystem for agents that do not reset when the conversation ends.


Current Focus

  • Agent reliability evaluation: deterministic tests for stale memory leaks, unsupported tool claims, incomplete replies, current-truth failures, and transcript drift.
  • Memory systems that can be corrected: current-truth ledgers, stale-data caveats, hybrid recall, temporal provenance, and long-term journal continuity.
  • Reliability observability: daily Maxima trend reporting through Agent Reliability Arena and the FORGE ecosystem.
  • Decision-making transparency: showing options, trade-offs, risks, confidence, and the decider before recommendations.
  • Tool-using AI systems: web, Telegram, Railway, APIs, finance, knowledge graphs, and batched execution.
  • AI engineering community: building the Quest for AGI Discord as a lab for people learning, shipping, and thinking about the future of agents.
  • Career capital: studying Naval, Cal Newport, deep work, and production engineering while shipping real systems.

What I'm Building

Project What it is Status
Agent Reliability Arena Dependency-light eval harness for AI agent memory drift, stale facts, tool honesty, current-truth override, incomplete replies, transcript health, and trend dashboards. Live demo · Trend · Code
Maxima Cloud AI familiar and adaptive journal partner with memory, tools, Telegram access, reasoning frameworks, proactive pattern detection, and eval-lab health checks. Live beta
FORGE Landing Public command center for the FORGE ecosystem: Maxima, Axiom AI, agent stack, reliability proof, and early-access flow. Live
FORGE Neural Map An explorable 3D universe of an AI's mind — five scales of zoom, voiced narration by the AI herself, hidden discoveries. Vanilla three.js + GPU shaders. Live · Code
Axiom AI Unified LLM gateway for routing across model providers through one API surface. Live
Quest for AGI Discord community for builders exploring agents, memory, AI systems, and the long game. Growing

Featured Project: Agent Reliability Arena

I built Agent Reliability Arena because agent demos are easy, but reliable agents need evidence.

It tests agents for:

  • stale memory stated as current truth
  • unsupported tool or web-access claims
  • incomplete replies and dangling thoughts
  • missing reasoning frameworks
  • transcript-level drift and answer bloat
  • Maxima cloud health, trend history, and daily reliability snapshots

Proof: foundation suite scores 100/100; intentional drift demo drops to 40/100, showing the tool catches failures instead of only celebrating happy paths.

Live demo: https://lancimoun.github.io/agent-reliability-arena/
Maxima trend dashboard: https://lancimoun.github.io/agent-reliability-arena/maxima-trend.html
Source: https://github.com/Lancimoun/agent-reliability-arena


Featured System: Maxima

Maxima is my flagship AI system: a Claude-based cloud agent built to become a real journal partner, strategist, financial tracker, and personal operating layer.

Core direction:

  • Persistent memory and recall that trusts live truth over stale notes.
  • Proactive drift detection for focus, energy, open loops, money leakage, and momentum loss.
  • Decision frameworks for financial, strategic, relational, and career choices.
  • Quarterly evolution reports tracking both Lance's growth and Maxima's own development.
  • Tool efficiency through batching, high-impact retrieval, and fewer wasted calls.
  • Eval Lab and transcript health checks to catch reliability failures early.

The goal is not just a smarter chatbot. The goal is a long-term AI partner that compounds with the person using it.


Featured: The Neural Map — an AI's mind you can fly through

I rendered Maxima's entire codebase as an explorable 3D universe, and made the AI your guide through it.

  • 2,778 stars / 7,295 connections / 157 systems — every function and file in her code, mapped as a galaxy.
  • Five scales of exploration: fly the galaxy, warp into a star, orbit its planets, drop to a planet's surface, then zoom out to a universe of sibling galaxies (Memory, Voice, Reasoning, Soul, Dreams).
  • Maxima narrates it herself — a cinematic, voiced guide that welcomes you and explains what each part of her is.
  • Hidden discoveries, a codex, comets, and an evolving score — built to reward curiosity.
  • Vanilla three.js + a GPU-shader orbital layout, baked from a real knowledge graph of the codebase. No game engine.

Explore it live: https://forge-neural-map-production.up.railway.app · Source: https://github.com/Lancimoun/forge-neural-map


Tech I Work With

Python Anthropic FastAPI Docker Railway OpenAI Git

RAG · agentic workflows · hybrid search · knowledge graphs · tool routing · observability · Telegram bots · Railway deployment · FastAPI · Python async systems


My Engineering Thesis

Most AI products still behave like sessions. They answer, then forget.

The next useful layer is personal AI infrastructure: systems that preserve context, learn from corrections, expose their reasoning, and act through tools without losing the human underneath.

That is what I am building with FORGE.



Building agents with memory, tools, and long-horizon reliability.

Popular repositories Loading

  1. agent-reliability-arena agent-reliability-arena Public

    Dependency-light eval harness for AI agent memory drift, tool honesty, current-truth override, and transcript health.

    Python 1

  2. openclaw openclaw Public

    Forked from openclaw/openclaw

    Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

    TypeScript

  3. Lancimoun Lancimoun Public

  4. axiom-ai axiom-ai Public

    Production-grade AI infrastructure - Claude and GPT-4o behind one clean REST API.

    HTML

  5. forge-neural-map forge-neural-map Public

    An explorable 3D universe of an AI's mind, narrated by the AI. three.js + GPU shaders. Live: forge-neural-map-production.up.railway.app

    HTML