African Health Studio

Mental health support platform for Africa - accessible, compassionate, and culturally aware.

A real-time conversational audio system using Audio Flamingo 3 for audio reasoning and Afro-TTS for speech synthesis.

Quick Start

Prerequisites

NVIDIA GPU (B200 or similar recommended)
Python 3.12
uv package manager
Audio Flamingo 3 model (local path configured)
Afro-TTS model (local path configured)

Installation

Install all dependencies (AF3 + TTS):

uv sync --all-extras

Or install specific groups:

# Only AF3 dependencies
uv sync --extra af3

# Only TTS dependencies  
uv sync --extra tts

# Both
uv sync --extra af3 --extra tts

See ENVIRONMENT_SETUP.md for detailed dependency group information.

Configure model paths: Edit backend/config.py to set paths:
- Audio Flamingo 3: AUDIO_FLAMINGO_MODEL_PATH
- Afro-TTS: AFRO_TTS_CONFIG_PATH, AFRO_TTS_CHECKPOINT_DIR, AFRO_TTS_SPEAKER_WAV

Running Locally

Start the backend server:

bash run_server.sh

Start the Next.js frontend:

cd frontend-next
npm run dev

Running on HiPerGator

See HIPERGATOR_SETUP.md for detailed instructions on running with GPU.

Quick commands:

Interactive: bash run_server_interactive.sh
Production: sbatch run_server.slurm

Architecture

Pipeline: Web App Mic Audio → AF3 (audio→text reasoning) → Afro-TTS (text→voice)

Frontend: Next.js with MediaRecorder and WebSockets
Backend: FastAPI with WebSocket support
- Audio Flamingo 3: Local model loading for audio reasoning
- Afro-TTS: Text-to-speech with African accent
- Audio storage: Organized by date in data/audio/sessions/

For detailed backend architecture, see backend/BACKEND_ARCHITECTURE.md.

Project Structure

├── backend/
│   ├── app/
│   │   ├── server.py              # FastAPI server
│   │   └── services/
│   │       ├── af3_inference.py   # Audio Flamingo 3
│   │       ├── tts_inference.py   # Afro-TTS
│   │       └── audio_converter.py # WebM to WAV
│   └── config.py                  # Configuration
├── frontend-next/                 # Next.js frontend
├── scripts/                       # Utility scripts
└── run_server.slurm              # SLURM batch script

Configuration

Model paths are configured in backend/config.py:

AUDIO_FLAMINGO_MODEL_PATH: Path to Audio Flamingo 3 model
AFRO_TTS_CONFIG_PATH: Path to Afro-TTS config.json
AFRO_TTS_CHECKPOINT_DIR: Path to Afro-TTS checkpoint directory
AFRO_TTS_SPEAKER_WAV: Path to speaker reference audio (6 seconds)

Development

The system uses:

UV for Python package management
Next.js for the frontend
FastAPI for the backend API
WebSockets for real-time audio streaming

See backend/BACKEND_ARCHITECTURE.md for detailed technical documentation.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
backend		backend
data		data
frontend-next		frontend-next
frontend		frontend
scripts		scripts
.gitignore		.gitignore
.python-version		.python-version
ENVIRONMENT_SETUP.md		ENVIRONMENT_SETUP.md
HIPERGATOR_SETUP.md		HIPERGATOR_SETUP.md
LICENSE		LICENSE
MICROSERVICES_SETUP.md		MICROSERVICES_SETUP.md
README.md		README.md
TTS_ENV_SETUP.md		TTS_ENV_SETUP.md
env.example		env.example
health-africa.1530623376.jpg		health-africa.1530623376.jpg
main.py		main.py
mock_server.py		mock_server.py
mother.jpeg		mother.jpeg
pyproject.toml		pyproject.toml
run_af3_backend.slurm		run_af3_backend.slurm
run_all.sh		run_all.sh
run_server.sh		run_server.sh
run_server.slurm		run_server.slurm
run_server_interactive.sh		run_server_interactive.sh
run_tts_backend.slurm		run_tts_backend.slurm
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

African Health Studio

Quick Start

Prerequisites

Installation

Running Locally

Running on HiPerGator

Architecture

Project Structure

Configuration

Development

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

ufdatastudio/ASRU-Hack

Folders and files

Latest commit

History

Repository files navigation

African Health Studio

Quick Start

Prerequisites

Installation

Running Locally

Running on HiPerGator

Architecture

Project Structure

Configuration

Development

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages