A professional RAG (Retrieval-Augmented Generation) system built with FastAPI and LangChain. This assistant allows users to upload PDF documents and query them using OpenAI's LLMs with high accuracy and source citations.
- PDF Ingestion: Automated text extraction and chunking using
PyPDFandRecursiveCharacterTextSplitter. - Vector Search: High-performance similarity search powered by
ChromaDB. - RAG Chain: Advanced retrieval logic using
langchain-classicfor stable enterprise performance. - REST API: Fully documented interactive API endpoints via FastAPI (Swagger UI).
- Framework: FastAPI
- AI Orchestration: LangChain (v1.x)
- Database: ChromaDB (Vector Store)
- Environment Management: Poetry
-
Clone the repository:
git clone https://github.com/farihaNaqvi/enterprise-knowledge-assistant.git
-
Install dependencies:
poetry install
-
Configure Environment: Create a .env file and add your OPENAI_API_KEY.
-
Run the Application:
poetry run uvicorn app.main:app --reload