Deepseek-coder-qlora-sber-codehelper

QLoRA fine-tuning pipeline and lightweight API assistant for SQL/PySpark generation in internal audit scenarios.

Demo

What this repo includes

FastAPI endpoints: /health, /generate, /explain, /refactor
Retrieval over internal catalog.json and examples.json
Prompt builder for SQL/PySpark/Explain/Refactor
qLoRA synthetic training pipeline: generate -> train -> eval -> merge
Tracking integrations:
- W&B (optional)
- MLflow (optional)

Architecture

UI/Client -> FastAPI -> Retrieval -> Prompt Builder -> LLM

Quick Start

Local run

python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt
cp .env.example .env
uvicorn app.main:app --host 0.0.0.0 --port 8000

Run UI in another terminal:

streamlit run ui/streamlit_app.py

Docker

cp .env.example .env
docker compose up --build

qLoRA Training

Recommended env

conda create -n audit-qlora python=3.11 -y
conda activate audit-qlora
pip install torch==2.4.1 transformers==4.45.2 peft==0.13.2 datasets==3.0.1 accelerate==1.0.1 sentencepiece==0.2.0 wandb==0.19.11 mlflow==2.16.2

1) Generate synthetic dataset

python training/data_gen/generate_synthetic.py --out-dir training/datasets --size 320 --edge-ratio 0.30

2) Train

python training/train_qlora.py \
  --base-model deepseek-ai/deepseek-coder-1.3b-instruct \
  --train-file training/datasets/train.jsonl \
  --val-file training/datasets/val.jsonl \
  --output-dir training/artifacts/adapter

3) Evaluate

python training/eval.py \
  --model-path training/artifacts/adapter \
  --base-model deepseek-ai/deepseek-coder-1.3b-instruct \
  --test-file training/datasets/test.jsonl \
  --out-dir training/artifacts/eval

4) Merge adapter with base model

python training/merge_lora.py \
  --base-model deepseek-ai/deepseek-coder-1.3b-instruct \
  --adapter-path training/artifacts/adapter \
  --output-path training/artifacts/merged

Tracking Setup

W&B

mkdir -p .secrets
cat > .secrets/wandb.env << 'EOF'
WANDB_API_KEY=your_key_here
WANDB_PROJECT=audit-code-assistant
WANDB_ENTITY=
USE_WANDB=true
EOF
chmod 600 .secrets/wandb.env

MLflow

Run local tracking server:

mlflow server --host 0.0.0.0 --port 5000

Or use local file backend:

export USE_MLFLOW=true
export MLFLOW_TRACKING_URI=file:./mlruns
export MLFLOW_EXPERIMENT_NAME=audit-code-assistant

Night Run

bash scripts/run_overnight_deepseek.sh

Tests

pytest -q
python scripts/smoke_test.py

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
app		app
docs/wandb		docs/wandb
scripts		scripts
tests		tests
training		training
ui		ui
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deepseek-coder-qlora-sber-codehelper

Demo

What this repo includes

Architecture

Quick Start

Local run

Docker

qLoRA Training

Recommended env

1) Generate synthetic dataset

2) Train

3) Evaluate

4) Merge adapter with base model

Tracking Setup

W&B

MLflow

Night Run

Tests

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Deepseek-coder-qlora-sber-codehelper

Demo

What this repo includes

Architecture

Quick Start

Local run

Docker

qLoRA Training

Recommended env

1) Generate synthetic dataset

2) Train

3) Evaluate

4) Merge adapter with base model

Tracking Setup

W&B

MLflow

Night Run

Tests

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages