Skip to content

june-oh/2023_AI_Academy_ASR

Repository files navigation

2023 AI Academy ASR

Jupyter Notebook PyTorch nVIDIA Python Markdown Colab Matplotlib NumPy

๋”ฅ๋Ÿฌ๋‹์„ ์ด์šฉํ•œ ์Œ์„ฑ์ธ์‹ ๊ธฐ์ดˆ ์‹ค์Šต

0. Tutorial

Open In Colab

Tutorial for python and data science packages

  • python review
  • numpy
  • matplotlib

1. Audio file handling

Open In Colab

Audio file handling using torchaudio

  • Load audio file(torchaudio.load)
  • Feature extraction(Mel-spectrogram, MFCC)

3. Audio Classification using MLP

Open In Colab

Audio MNIST classification using MLP(torch.Linear)

4. CTC

Open In Colab

Simple Exercise(model training using CTC loss) for Connectionist Temporal Classification

5. Whisper

Open In Colab

Exercise using OpenAI - Whisper and Gradio

6. E2E ASR model finetune with Nemo

Open In Colab

Quartznet Model finetune with Nemo(English to Korean)

7. WFST

Open In Colab

Exercise for WFST using k2

  • C,L,G transducer
  • composition, determinization

Libraries

About

๐ŸŽ™๏ธ Deep Learning ASR Tutorial - Audio handling, MLP, CTC, Whisper, NeMo finetuning, WFST with PyTorch & Colab notebooks

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors