AI Codebase Explainer

An AI-powered developer assistant that analyzes GitHub repositories and answers questions about the codebase using Retrieval-Augmented Generation (RAG).

The system indexes repository files, performs semantic code search using embeddings and FAISS, and generates explanations with Google Gemini through a React chat interface.

Features

AI-powered codebase understanding
Index any public GitHub repository
Semantic code search using embeddings
FAISS vector database for fast retrieval
Google Gemini LLM for explanation generation
Chat interface for multi-turn questions
Source file references for answers
Code snippet highlighting from retrieved context

Architecture

User (React Chat UI)
        ↓
FastAPI Backend
        ↓
Repository Ingestion
        ↓
Code Parsing + Chunking
        ↓
Embedding Model
        ↓
FAISS Vector Database
        ↓
Semantic Code Search
        ↓
Gemini LLM
        ↓
Answer + Sources + Code Snippets

This architecture follows the Retrieval-Augmented Generation (RAG) pattern used in modern AI applications.

Tech Stack

Frontend

React
JavaScript

Backend

FastAPI
Python

AI / ML

Sentence Transformers
FAISS Vector Database
Google Gemini API

Other Tools

GitHub repository ingestion
LangChain components
REST API architecture

How It Works

User provides a GitHub repository URL.
The backend clones the repository.
Code files are parsed and split into smaller chunks.
Each chunk is converted into vector embeddings.
Embeddings are stored in a FAISS vector database.
When a question is asked:
- FAISS retrieves the most relevant code chunks.
- The retrieved code is sent to the Gemini LLM.
The AI generates an explanation with source references and code snippets.

Example Usage

Step 1: Index a Repository

Enter a GitHub repository URL:

https://github.com/tiangolo/fastapi

Click Index Repository.

Step 2: Ask Questions

Example questions:

Where is authentication implemented?

How does token validation work?

Explain the dependency injection system.

Example Output

AI:
Authentication is implemented in the FastAPI security module.

Sources:
fastapi/security.py

Code Snippet:
class OAuth2PasswordBearer:
    def __init__(self, tokenUrl: str):

Project Structure

ai-codebase-explainer
│
├── app
│   ├── api
│   │   └── routes.py
│   ├── services
│   │   ├── repo_service.py
│   │   ├── query_service.py
│   │   └── embedding_service.py
│   ├── vectorstore
│   │   └── faiss_store.py
│   └── main.py
│
├── frontend
│   └── React application
│
├── vector_db
│   └── FAISS index storage
│
├── repos
│   └── cloned repositories
│
├── requirements.txt
└── README.md

Installation

1. Clone the repository

git clone https://github.com/znixxx30/ai-codebase-explainer.git
cd ai-codebase-explainer

2. Backend Setup

Create a virtual environment:

python -m venv venv

Activate it:

venv\Scripts\activate

Install dependencies:

pip install -r requirements.txt

3. Add Gemini API Key

Create a .env file:

GEMINI_API_KEY=your_api_key_here

4. Start Backend

uvicorn app.main:app --reload

5. Start Frontend

cd frontend
npm install
npm start

Screenshots

Chat Interface

AI Answer with Sources

Future Improvements

Potential upgrades:

streaming AI responses
improved UI styling
repository caching
code syntax highlighting
support for private repositories

Resume Description

Built an AI-powered developer assistant using Retrieval-Augmented Generation (RAG). The system indexes GitHub repositories, performs semantic code search using embeddings and FAISS, and generates explanations with Google Gemini LLM, accessible through a React chat interface.

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
app		app
frontend		frontend
screenshots		screenshots
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
requirements.txt		requirements.txt
runtime.txt		runtime.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Codebase Explainer

Features

Architecture

Tech Stack

How It Works

Example Usage

Step 1: Index a Repository

Step 2: Ask Questions

Example Output

Project Structure

Installation

1. Clone the repository

2. Backend Setup

3. Add Gemini API Key

4. Start Backend

5. Start Frontend

Screenshots

Chat Interface

AI Answer with Sources

Future Improvements

Resume Description

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AI Codebase Explainer

Features

Architecture

Tech Stack

How It Works

Example Usage

Step 1: Index a Repository

Step 2: Ask Questions

Example Output

Project Structure

Installation

1. Clone the repository

2. Backend Setup

3. Add Gemini API Key

4. Start Backend

5. Start Frontend

Screenshots

Chat Interface

AI Answer with Sources

Future Improvements

Resume Description

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages