Add WER metrics: Faster-Whisper, NeMo ASR, Facebook’s HuBERT-large-finetuned by whr-a · Pull Request #44 · wavlab-speech/versa

whr-a · 2025-06-23T18:58:45Z

This update adds three new WER metrics computed by different ASR models:

Faster-Whisper Large-v3 (faster_whisper_wer): Systran/faster-whisper-large-v3
NVIDIA Conformer-Transducer XLarge (nemo_wer): nvidia/stt_en_conformer_transducer_xlarge
Facebook HuBERT-Large-LS960-FT (hubert_wer): facebook/hubert-large-ls960-ft

ftshijt · 2025-06-24T04:57:55Z

setup.py

        "espnet_model_zoo",
        "discrete-speech-metrics @ git+https://github.com/ftshijt/DiscreteSpeechMetrics.git@v1.0.2",
        "cdpam",
+        "nemo_toolkit[asr]"


Thanks for fixing this. I'm a bit conservative to put nemo here as they are not always keep the latest related to the other packages version and it is difficult to keep them aligned all the time. Would be better to put it in tools as additional installers

ftshijt · 2025-06-24T04:58:17Z

setup.py

        "importlib-metadata",
        "kaggle",
        "kaldiio",
+        "jamo",


If this is a dependency for nemo, we can put it together to installers.

I built a versa conda environment from scratch, and when I run the test test/testpipeline/test_general.py I encounter this issue. It seems to be a dependency inside espnet2, but it doesn’t appear to be listed in the setup.py at https://github.com/ftshijt/espnet/blob/espnet_inference/setup.py.

Oh, I see the point. Thanks for brining it up. Sure, it should be fine then (I probably will clean it up later for this hhh)

versa/corpus_metrics/faster_whisper_wer.py

versa/scorer_shared.py

ftshijt · 2025-06-24T18:41:46Z

Looks great! I will merge it after the CI test.

whr-a added 5 commits June 23, 2025 13:08

add faster-whisper wer

461f668

add: three metrics

db72a27

fix: copyright

50d10c5

fix: a small bug in scorer_shared/list_scoring

718b2ed

add: docs

c289243

ftshijt reviewed Jun 24, 2025

View reviewed changes

update

878ef17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add WER metrics: Faster-Whisper, NeMo ASR, Facebook’s HuBERT-large-finetuned#44

Add WER metrics: Faster-Whisper, NeMo ASR, Facebook’s HuBERT-large-finetuned#44
whr-a wants to merge 6 commits intowavlab-speech:mainfrom
whr-a:main

whr-a commented Jun 23, 2025

Uh oh!

ftshijt Jun 24, 2025

Uh oh!

ftshijt Jun 24, 2025

Uh oh!

whr-a Jun 24, 2025

Uh oh!

ftshijt Jun 24, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ftshijt commented Jun 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

whr-a commented Jun 23, 2025

Uh oh!

ftshijt Jun 24, 2025

Choose a reason for hiding this comment

Uh oh!

ftshijt Jun 24, 2025

Choose a reason for hiding this comment

Uh oh!

whr-a Jun 24, 2025

Choose a reason for hiding this comment

Uh oh!

ftshijt Jun 24, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ftshijt commented Jun 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants