Skip to content

PurpleMaia/kamailio

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 

Repository files navigation

Purpose

This project uses AI + LLMs to transform real-time audio into searchable transcripts, reports, and Q&A experiences.

  • Goal A (primary): Build a working pipeline for English audio ingestion → transcription → reports → Q&A.
  • Goal B (stretch): Extend the pipeline for Hawaiian audio, including transcription, translation, and bilingual analysis.

Key Concepts

  • ASR (Automatic Speech Recognition): Converts speech to text (e.g., Whisper).
  • ACR (Audio Content Recognition): Broader concept that includes identifying speakers, topics, or content type.
  • LLM Q&A / RAG (Retrieval-Augmented Generation): Use LLMs to answer questions by pulling relevant transcript passages.
  • Evaluation Metrics:
    • WER (Word Error Rate) – measures transcription accuracy in English.
    • CER (Character Error Rate) – important for Hawaiian because of ʻokina and kahakō.
    • MT Metrics (COMET/BLEU) – evaluate translation quality.

Progression

  1. Data Ingestion & Processing
  2. Transcription Pipeline
  3. Transcript Storage & Indexing
  4. Reporting Features
    • Generate an auto-summary of each run.
    • Produce chapter markers (topics/themes with timestamps).
    • Extract highlighted quotes and keywords.
    • Deliver example reports for at least 2 English podcasts.
  5. Q&A System (Goal A)
    • Implement a retrieval-augmented Q&A prototype:
      • Users ask questions.
      • Relevant transcript chunks are retrieved.
      • LLM answers, citing transcript + timestamps.
      • Demo with at least 3 English podcast episodes.
  6. Evaluation

About

Kamaʻilio: To converse | This project uses AI + LLMs to transform podcast audio files into searchable transcripts, reports, and Q&A experiences.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors