ADEA - Autonomous Data Engineer Agent

ADEA is an AI-driven data engineering platform that can:

generate SQL pipelines from natural-language prompts
execute those pipelines in DuckDB
monitor failures and anomalies
diagnose root causes
repair pipelines with a hybrid LLM + fallback strategy
retry execution automatically
suggest optimization improvements
visualize pipeline lineage and live execution flow

This repo contains both:

the Python backend (adea/)
the Next.js frontend dashboard (frontend/)

Core Stack

Backend:

Python 3.11
FastAPI
LangGraph
DuckDB
SQLAlchemy
Pydantic
Groq SDK
FAISS
SQLGlot

Frontend:

Next.js 14
React 18
TypeScript
Tailwind CSS
Framer Motion
React Flow
Recharts
SWR
Zustand

Project Structure

ADEA/
├─ adea/
│  ├─ agents/
│  ├─ api/
│  ├─ app/
│  ├─ database/
│  ├─ interface/
│  ├─ llm/
│  ├─ memory/
│  ├─ monitoring/
│  ├─ orchestration/
│  ├─ pipelines/
│  └─ utils/
├─ frontend/
│  ├─ app/
│  ├─ components/
│  ├─ hooks/
│  ├─ lib/
│  └─ styles/
├─ requirements.txt
├─ run_adea.py
└─ test_pipeline.py

Before You Start

1. Python environment

Create and activate a virtual environment, then install backend dependencies:

python -m venv venv
.\venv\Scripts\activate
python -m pip install -r requirements.txt

2. Frontend dependencies

From the project root:

cd frontend
C:\nvm4w\nodejs\npm.cmd install

3. Environment variables

Create a local .env from .env.example.

At minimum, if you want live Groq-backed reasoning instead of fallback behavior:

GROQ_API_KEY=your_key_here

Notes:

.env is ignored by git
if GROQ_API_KEY is missing or unreachable, ADEA falls back safely to deterministic behavior

4. Graphviz

If you want PNG/SVG graph rendering, you need both:

the Python graphviz package
the Graphviz system binary (dot) available on PATH

Without the system binary, graph generation may fall back to text/DOT artifacts.

How To Run

Backend API

Start the FastAPI server:

python -m uvicorn adea.app.main:app --reload

Useful URLs:

Health check: http://127.0.0.1:8000/health
API docs: http://127.0.0.1:8000/docs

Frontend Dashboard

In a new terminal:

cd frontend
C:\nvm4w\nodejs\npm.cmd run dev

Open:

http://127.0.0.1:3000/dashboard

CLI Interface

Run the interactive CLI:

python run_adea.py

Demo Mode

Run the dashboard/CLI demo workflow:

python run_adea.py --demo

Backend Test Script

Run the local end-to-end pipeline test:

python test_pipeline.py

Typical Workflow For Contributors

Start the backend
Start the frontend
Open the dashboard
Run a pipeline prompt such as:

Build sales analytics pipeline

Watch:

live agent execution timeline
pipeline lineage graph
execution logs
optimization suggestions
pipeline history

Important Notes For Teammates

Hybrid AI behavior

ADEA is designed as an LLM-first system with safe fallbacks.

That means:

when Groq is reachable, agents use LLM reasoning
when Groq fails or times out, deterministic fallback logic keeps the workflow running

Memory and repair learning

The system stores successful repair experiences and can reuse them for similar failures later.

Temporary runtime artifacts

Some runtime-generated artifacts may appear during local work, such as:

pipeline graph files
temporary DuckDB files
analyzer reports under frontend/.next/analyze

These should not be committed unless intentionally needed.

Do not commit

Please do not commit:

.env
frontend/node_modules
frontend/.next
local virtual environments

The repo .gitignore already covers these.

Frontend Performance Tooling

Bundle analyzer:

cd frontend
C:\nvm4w\nodejs\npm.cmd run analyze

Analyzer reports are generated in:

frontend/.next/analyze/client.html
frontend/.next/analyze/nodejs.html
frontend/.next/analyze/edge.html

There is also a frontend performance guide here:

frontend/PERFORMANCE.md

Troubleshooting

Backend starts but frontend cannot load data

Check:

backend is running on 127.0.0.1:8000
frontend is running on 127.0.0.1:3000
CORS was not modified

Groq is not working

Check:

GROQ_API_KEY is set in .env
the machine has internet access
the Groq SDK is installed from requirements.txt

Graph PNG is not generated

Check:

Python graphviz is installed
Graphviz system binaries are installed
dot is on your PATH

Collaboration

If you are adding new features:

keep business logic out of FastAPI routes
do not change the PipelineState structure casually
keep agent responsibilities isolated
preserve the LangGraph orchestration flow

The project architecture rules are documented in:

AGENTS.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ADEA - Autonomous Data Engineer Agent

Core Stack

Project Structure

Before You Start

1. Python environment

2. Frontend dependencies

3. Environment variables

4. Graphviz

How To Run

Backend API

Frontend Dashboard

CLI Interface

Demo Mode

Backend Test Script

Typical Workflow For Contributors

Important Notes For Teammates

Hybrid AI behavior

Memory and repair learning

Temporary runtime artifacts

Do not commit

Frontend Performance Tooling

Troubleshooting

Backend starts but frontend cannot load data

Groq is not working

Graph PNG is not generated

Collaboration

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
adea		adea
frontend		frontend
.env.example		.env.example
.gitignore		.gitignore
AGENTS.md		AGENTS.md
README.md		README.md
requirements.txt		requirements.txt
run_adea.py		run_adea.py
test_pipeline.py		test_pipeline.py

Folders and files

Latest commit

History

Repository files navigation

ADEA - Autonomous Data Engineer Agent

Core Stack

Project Structure

Before You Start

1. Python environment

2. Frontend dependencies

3. Environment variables

4. Graphviz

How To Run

Backend API

Frontend Dashboard

CLI Interface

Demo Mode

Backend Test Script

Typical Workflow For Contributors

Important Notes For Teammates

Hybrid AI behavior

Memory and repair learning

Temporary runtime artifacts

Do not commit

Frontend Performance Tooling

Troubleshooting

Backend starts but frontend cannot load data

Groq is not working

Graph PNG is not generated

Collaboration

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages