This project is an AI-powered medical diagnostic system that integrates Computer Vision (CV) and Natural Language Processing (NLP) to analyze medical images (X-rays, MRIs, CT scans) and generate diagnostic reports.
The workflow includes:
- Training CV models and exporting
.ptweights. - Building a Retrieval-Augmented Generation (RAG) pipeline using Pinecone for medical knowledge retrieval.
- A Flask backend for serving predictions and report generation.
- A React frontend for doctors/patients to interact with the system.
- YOLOv8 & CNN-based models for disease detection.
- NLP-powered diagnostic report generation with RAG.
- Pinecone vector database integration for retrieval.
- JPG + DICOM medical imaging support.
- Full-stack application: Flask backend + React frontend.
- PDF report generation with recommended tests.
📦 Project Root
│── 📁 CV/ # Model creation (training notebooks)
│ ├── model1.ipynb
│ ├── model2.ipynb
│ └── ...
│ └── (Outputs: *.pt files → move to /website/server/)
│
│── 📁 NLP/ # NLP & RAG setup
│ ├── RAG_Pinecodeloading_Sanjeevani.ipynb # Run this after Pinecone setup
│ └── dataset.zip # clinical notes dataset
|
│── 📁 website/
│ ├── 📁 client/ # React frontend
│ │ ├── package.json
│ │ └── src/
│ │
│ └── 📁 server/ # Flask backend
│ ├── app.py
│ ├── requirements.txt
│ └── *.pt (Model weights go here)
│
│── README.md
└── .gitignore
git clone https://github.com/DataGurus/Sanjeevani_AI.git
cd Sanjeevani_AI- Navigate to
CV/folder. - Run each
.ipynbfile (Jupyter/Colab). - Collect the generated
.ptfiles. - Move them into
/website/server/.
- Navigate to
NLP/. - Extract
datasets.zip. - Setup your Pinecone API key and environment variables.
- Run
RAG_Pinecodeloading_Sanjeevani.ipynbto build embeddings and upload them to Pinecone.
✅ Now your medical RAG database is ready.
cd website/server
pip install -r requirements.txt
python app.pycd website/client
npm install
npm start- Upload medical image (
.jpg/.dcm). - AI CV models analyze and classify diseases.
- NLP RAG system generates diagnostic reports + test recommendations.
- Download final PDF report.
- Add support for more diseases (multi-label classification).
- Multi-language reports.
- Integration with FHIR/HL7 medical standards.
- Cloud deployment (AWS/GCP/Azure).
For queries, reach out at:
📩 team.datagurus@gmail.com