ET Compass

LUNA for ET
The AI concierge front door to the Economic Times ecosystem

RAG-powered discovery, profile-aware routing, verified ET grounding, and selective concierge UI.

Important

This project is not trying to be “just another chatbot.” It is built around the ET hackathon brief: understand the user quickly, guide them into the right ET lane, remember their path, and expose more of the ET ecosystem than users typically discover on their own.

What This Project Is
Hackathon Fit
What We Built
Why The RAG Matters
How The RAG Works
How We Synthesize Answers
Dynamic Behaviors We Maintain
Selective Visual Philosophy
End-to-End Architecture
Complete Tech Stack
Frontend Experience
Voice AI Layer
Backend API Surface
Data, Ingestion, And Evaluation
Deployment
Local Setup
Repo Structure
Current Limitations
Stage-Wise Roadmap
Documentation Trail

What This Project Is

ET Compass is a full-stack prototype for the AI Concierge for ET problem statement.

The system is designed to:

understand their intent, sophistication, and goals
map them to the right Economic Times products and pathways
answer with grounded ET context instead of vague LLM fluency

In product terms:

Layer	Role
Next.js frontend	Landing page, login/signup, profile dashboard, ET concierge interface, selective widgets
FastAPI backend	API layer, RAG orchestration, market snapshot endpoint, voice endpoint, session/history APIs
LangGraph concierge graph	Profile extraction, routing, retrieval, answer generation, response shaping
Mongo-backed knowledge layer	Vector retrieval, ET source grounding, session persistence
Firebase auth layer	Persistent user identity on the frontend
Sarvam voice layer	Speech-to-text and text-to-speech on top of the same grounded ET answer path

This project directly addresses that by making LUNA:

A conversational ET welcome concierge
A product/pathway guide instead of a generic assistant
A profile-aware recommender
A source-grounded explainer
A future-ready foundation for cross-sell, financial-life navigation, and voice

What We Built

User-facing product surfaces

LUNA search / chat experience
User profile dashboard
Threaded conversation history
Voice-AI chat controls

Backend concierge capabilities

profile extraction from natural conversation
conservative onboarding questions only when necessary
hybrid ET retrieval from MongoDB
verification-aware source citations
structured journey-history storage
answer-style control by query type

Why The RAG Matters

This system works because the answer is not coming from “LLM memory alone.” The RAG layer is what keeps the product aligned with ET.

How The RAG Works

flowchart LR
    A[User Query] --> B[FastAPI /chat]
    B --> C[LangGraph Concierge Flow]
    C --> D[Profile Extraction]
    C --> E[Intent Routing]
    E --> F[Profiling]
    E --> G[Product Query]
    E --> H[News Guardrail]
    E --> I[Chitchat]
    G --> J[Hybrid Retrieval]
    J --> K[Mongo Vector Search]
    G --> L[ET Product Registry]
    G --> M[Verification Notes]
    G --> N[Gemini Response Generator]
    N --> O[Answer Style + Presentation Hints]
    O --> P[Structured API Response]
    P --> Q[Next.js Search UI]
    C --> R[Journey History Save]
    R --> S[Mongo Sessions]

Core backend stages

Stage	What happens
Profile extraction	Pulls profession, goal, intent, sophistication, and ET-affinity signals from natural language
Routing	Decides whether the turn is profiling, product query, news request, or chitchat
Retrieval	Fetches ET knowledge chunks, registry facts, and source metadata
Generation	Synthesizes an ET-grounded answer shaped for the exact question type
Presentation shaping	Decides whether the frontend should show products, roadmap, chips, visual panels, or none
History persistence	Saves route, answer, citations, visual hint, and profile snapshot into session history

How We Synthesize Answers

This is one of the most important parts of the project.

The backend does not just retrieve chunks and dump them into the model.

It synthesizes across multiple layers:

1. Retrieved ET context

Knowledge chunks from the vector store provide:

product descriptions
source-specific ET facts
tool pages
event portals
benefits pages
app-store / surface metadata

2. Structured ET registry facts

The product registry gives the system a stable canonical map of:

official ET product names
aliases and normalization
category / fit
summaries
features
benefits
verification state

This is how LUNA stays consistent when users say things like:

ET Edge
ET edge events
masterclass
print edition
portfolio

3. User profile state

The same answer changes depending on who is asking.

Examples:

a beginner/student should not get the same response shape as an active trader
a broad discovery question should not get the same response as a market-tools question
a trust-sensitive activation/pricing question should surface caution and verification

4. Verification notes

For ambiguous or sensitive ET facts, the assistant can explicitly say:

public pages show mixed signals
latest live checkout should be verified
activation eligibility must be confirmed

5. Answer-style control

The backend now classifies the requested answer style and adapts the response:

Query pattern	Answer style
latest-news request	`brief`
roadmap / 5-day / step-by-step	`roadmap`
all ET products / ecosystem overview	`overview`
compare / vs / difference between	`compare`
deep dive / explain in detail	`detailed`
normal ET product question	`standard`

That means the system can produce:

a short refusal when live news is unsupported
a clean ET product overview
a structured roadmap
a compact comparison
a fuller explanation when explicitly requested

End-To-End Architecture

flowchart TD
    A[Frontend User] --> B[Next.js App Router]
    B --> C[Firebase Auth]
    B --> D[Search UI]
    D --> E[FastAPI Backend]
    E --> F[LangGraph ET Concierge]
    F --> G[Gemini]
    F --> H[Mongo Knowledge Base]
    F --> I[Mongo Sessions]
    F --> J[ET Product Registry]
    E --> K[Market Snapshot Service]
    E --> L[Sarvam Voice APIs]
    K --> D
    L --> D
    I --> M[Profile Dashboard / Thread History]

Complete Tech Stack

Frontend

Technology	Role
Next.js 16	App router frontend
React 19	UI framework
TypeScript	Type safety
Tailwind CSS	Styling system
anime.js	SVG and motion effects
Firebase Web SDK	User auth and persisted session state
MediaRecorder API	Browser microphone capture for voice input

Backend

Technology	Role
FastAPI	HTTP API server
LangGraph	Multi-step RAG orchestration
LangChain Core	Messaging and model plumbing
Gemini / Google GenAI	Extraction and response generation
MongoDB Atlas	Vector knowledge base + sessions
langchain-mongodb	Mongo vector search integration
Pydantic	Request/response validation
yfinance	Structured market snapshot data for live-context widgets
Sarvam AI APIs	Speech-to-text and text-to-speech for Voice-AI
httpx	Async Sarvam API calls

Product and data layer

Layer	Purpose
ET source pack	Verified ET sources and allow-list
ET product registry	Canonical ET product definitions
Bootstrap chunk pack	Seed retrieval data
Evaluation prompts	Controlled ET benchmark questions

Frontend Experience

Landing page

ET Compass branded hero and product sections
ET ecosystem cards
intro video flow
logged-in avatar state
CTAs into search/profile

Search page

left thread rail
central conversation area
right concierge rail
selective widgets based on backend presentation hints
real-time style loader and LUNA animation
microphone button for Voice-AI turns using the same thread and ET session memory

Authentication

email/password signup
email/password login
Google sign-in
persisted signed-in session

Profile dashboard

persona snapshot
goal and sophistication
recent journey view
ET lane recommendations

Voice AI Layer

Voice mode is implemented as an extension of the current ET concierge, not a separate assistant.

The flow is:

browser records audio through the microphone
backend sends the audio to Sarvam STT
the returned transcript is passed into the same concierge_service.chat() RAG flow
the grounded ET answer is cleaned for speech playback
backend sends that final spoken script to Sarvam TTS
frontend plays the returned audio while still saving the turn in the normal thread history

Why this design matters

voice does not bypass the current ET retrieval stack
voice does not invent a second answer path
the same profile memory, citations, and recommendation logic still apply
text and voice remain aligned inside the same thread

Backend API Surface

`POST /chat`

Sends a single user message into the ET concierge graph and returns:

answer
sources
source citations
recommended products
verification notes
answer style
presentation hints
visual hint
roadmap when appropriate

`POST /chat/voice`

Accepts a recorded audio file and a thread_id, then returns:

user_text from Sarvam speech-to-text
the normal grounded ET answer payload
spoken_answer cleaned for voice playback
audio from Sarvam text-to-speech
used_rag flag so the voice turn can still be inspected as part of the ET answer path

`GET /sessions`

Returns saved session summaries for thread listing.

`GET /sessions/{session_id}`

Returns a full session document with:

profile snapshot
full journey history
stored turn metadata

`GET /market-snapshot`

Returns the structured live-context market panel data used by the frontend.

`GET /health`

Simple backend health status.

Data, Ingestion, And Evaluation

Ingestion pipeline

flowchart TD
    A[ET Source Files / Research Pack] --> B[Ingestion Scripts]
    B --> C[Cleaning + Normalization]
    C --> D[Chunking]
    D --> E[Embeddings]
    E --> F[Mongo Knowledge Base]
    A --> G[Product Registry]
    G --> H[Routing + Verification Layer]
    F --> I[Retriever]
    H --> I
    I --> J[Response Generator]

The ingestion path supports

bootstrap chunk ingestion
live ET source ingestion
registry-derived summary records
metadata-rich chunks
verification-aware source metadata

Evaluation goals

correct ET route selection
useful citations
lower hallucination risk
stronger ET product fit
safer handling of uncertainty
consistent answer shape

Example evaluation run

cd backend
source .venv/bin/activate
python scripts/run_et_eval.py --limit 40

Why evaluation matters here

RAG quality is not:

“one answer looked good once”

It is:

repeatable
benchmarked
failure-aware
improved with controlled passes

Deployment

Recommended platform split

Layer	Platform	Why
Frontend	Vercel	Best fit for Next.js, simplest preview/prod workflow
Backend	Render	Better fit for a persistent FastAPI + LangGraph + Mongo service than serverless Python functions

Why not put the backend on Vercel too?

Vercel is excellent for the Next.js app, but this backend behaves more like a long-lived application service than a tiny serverless function:

FastAPI API layer
LangGraph orchestration
Gemini model calls
MongoDB vector retrieval
session history persistence
market snapshot service

For this prototype, Render is the safer and simpler backend deployment target.

Frontend deployment on Vercel

Import the GitHub repo into Vercel
Let Vercel detect the project as Next.js
Keep the root directory as the repo root
Add the frontend env vars from .env.example
Set NEXT_PUBLIC_API_BASE_URL to your deployed Render backend URL
Deploy

Vercel frontend env vars

Variable	Purpose
`NEXT_PUBLIC_API_BASE_URL`	URL of the deployed FastAPI backend
`NEXT_PUBLIC_FIREBASE_API_KEY`	Firebase web config
`NEXT_PUBLIC_FIREBASE_AUTH_DOMAIN`	Firebase auth domain
`NEXT_PUBLIC_FIREBASE_PROJECT_ID`	Firebase project id
`NEXT_PUBLIC_FIREBASE_STORAGE_BUCKET`	Firebase storage bucket
`NEXT_PUBLIC_FIREBASE_MESSAGING_SENDER_ID`	Firebase messaging sender id
`NEXT_PUBLIC_FIREBASE_APP_ID`	Firebase app id
`NEXT_PUBLIC_FIREBASE_MEASUREMENT_ID`	Firebase analytics id

Backend deployment on Render

This repo includes a ready Render blueprint at render.yaml.

You can deploy either:

from the Render dashboard manually, or
from the blueprint config in the repo

Manual Render service settings

Setting	Value
Root Directory	`backend`
Runtime	`Python`
Build Command	`pip install -r requirements.txt`
Start Command	`uvicorn app.main:app --host 0.0.0.0 --port $PORT`
Health Check Path	`/health`

Render backend env vars

Use backend/.env.example as the base.

Variable	Required	Purpose
`GOOGLE_API_KEY`	Yes	Gemini / Google model access
`MONGODB_URI`	Yes	MongoDB Atlas connection
`SARVAM_API_KEY`	Recommended for Voice-AI	Sarvam speech-to-text and text-to-speech
`EMBEDDING_MODEL`	Yes	Embedding model name
`GOOGLE_CHAT_MODEL`	Recommended	Primary chat model
`MONGODB_DB_NAME`	Yes	Database name
`MONGODB_KNOWLEDGE_COLLECTION`	Yes	Knowledge collection
`MONGODB_PERSONA_COLLECTION`	Yes	Persona collection
`MONGODB_SESSIONS_COLLECTION`	Yes	Sessions collection
`MONGODB_VECTOR_INDEX`	Yes	Vector index name
`ALLOWED_ORIGINS`	Yes	Comma-separated frontend origins allowed by CORS

CORS for production

The backend now reads ALLOWED_ORIGINS from environment variables.

Example:

ALLOWED_ORIGINS=http://localhost:3000,http://127.0.0.1:3000,https://your-vercel-project.vercel.app

This is required so your deployed Vercel frontend can call the Render backend.

Firebase production checklist

In Firebase Console:

enable Email/Password
enable Google
add localhost to authorized domains for local development
add your Vercel production domain to authorized domains

Recommended go-live order

Deploy the backend to Render
Copy the Render URL
Set NEXT_PUBLIC_API_BASE_URL in Vercel
Deploy the frontend to Vercel
Add the Vercel domain to:
- Firebase authorized domains
- backend ALLOWED_ORIGINS

Deployment files included in this repo

File	Purpose
.env.example	Frontend env template
backend/.env.example	Backend env template
render.yaml	Render blueprint for the backend
vercel.json	Explicit Vercel frontend config marker

Local Setup

Frontend

npm install
npm run dev

Backend

cd backend
source .venv/bin/activate
pip install -r requirements.txt
uvicorn app.main:app --reload --host 127.0.0.1 --port 8000

Quick health check

curl http://127.0.0.1:8000/health

Environment notes

Frontend expects

NEXT_PUBLIC_API_BASE_URL
Firebase public config values

Backend expects

Google API key
embedding model name
MongoDB URI
ET knowledge already ingested or available for ingestion

Repo Structure

ET-Concierge/
├── backend/
│   ├── app/
│   │   ├── chatbot/
│   │   │   ├── agents.py
│   │   │   ├── graph.py
│   │   │   ├── retriever_service.py
│   │   │   ├── registry.py
│   │   │   ├── ingestion.py
│   │   │   ├── market_data.py
│   │   │   ├── service.py
│   │   │   └── state.py
│   │   └── main.py
│   ├── scripts/
│   ├── eval_results/
│   ├── EXPLANATION.md
│   └── requirements.txt
├── public/
├── src/
│   ├── app/
│   ├── components/
│   ├── content/
│   └── lib/
└── README.md

Documentation Trail

Root system overview: README.md
Plain-English backend change log: backend/EXPLANATION.md

Why this repo matters

This prototype demonstrates that the ET concierge idea is viable when three things are combined properly:

a grounded ET-specific RAG layer
a disciplined UI that knows when not to over-render
a product mindset that optimizes for guidance, not just answers

ET Compass
Built to help users discover more of ET through one intelligent conversation.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
backend		backend
public		public
src		src
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
AGENTS.md		AGENTS.md
README.md		README.md
eslint.config.mjs		eslint.config.mjs
next.config.ts		next.config.ts
package-lock.json		package-lock.json
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
postcss.config.mjs		postcss.config.mjs
render.yaml		render.yaml
requirements.txt		requirements.txt
tailwind.config.ts		tailwind.config.ts
todo.txt		todo.txt
tsconfig.json		tsconfig.json
vercel.json		vercel.json

Folders and files

Latest commit

History

Repository files navigation

ET Compass

Table Of Contents

What This Project Is

What We Built

User-facing product surfaces

Backend concierge capabilities

Why The RAG Matters

How The RAG Works

Core backend stages

How We Synthesize Answers

1. Retrieved ET context

2. Structured ET registry facts

3. User profile state

4. Verification notes

5. Answer-style control

End-To-End Architecture

Complete Tech Stack

Frontend

Backend

Product and data layer

Frontend Experience

Landing page

Search page

Authentication

Profile dashboard

Voice AI Layer

Why this design matters

Backend API Surface

POST /chat

POST /chat/voice

GET /sessions

GET /sessions/{session_id}

GET /market-snapshot

GET /health

Data, Ingestion, And Evaluation

Ingestion pipeline

The ingestion path supports

Evaluation goals

Example evaluation run

Why evaluation matters here

Deployment

Recommended platform split

Why not put the backend on Vercel too?

Frontend deployment on Vercel

Vercel frontend env vars

Backend deployment on Render

Manual Render service settings

Render backend env vars

CORS for production

Firebase production checklist

Recommended go-live order

Deployment files included in this repo

Local Setup

Frontend

Backend

Quick health check

Environment notes

Frontend expects

Backend expects

Repo Structure

Documentation Trail

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`POST /chat`

`POST /chat/voice`

`GET /sessions`

`GET /sessions/{session_id}`

`GET /market-snapshot`

`GET /health`

Packages