Commit 80f3f2a
committed
fix: SenseVoice+spk_model crash and torchaudio>=2.11 compatibility
1. auto_model.py: When ASR model (e.g. SenseVoice) doesn't produce
timestamps, automatically fall back to vad_segment mode for speaker
diarization instead of crashing. Fixes #2945.
2. load_utils.py: Add soundfile as fallback when torchaudio.load fails
(torchaudio>=2.11 requires torchcodec). Falls through to ffmpeg only
if soundfile also fails.1 parent 0d824c1 commit 80f3f2a
2 files changed
Lines changed: 14 additions & 4 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
819 | 819 | | |
820 | 820 | | |
821 | 821 | | |
| 822 | + | |
| 823 | + | |
| 824 | + | |
822 | 825 | | |
823 | 826 | | |
824 | 827 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
113 | 113 | | |
114 | 114 | | |
115 | 115 | | |
116 | | - | |
117 | | - | |
118 | | - | |
119 | | - | |
| 116 | + | |
| 117 | + | |
| 118 | + | |
| 119 | + | |
| 120 | + | |
| 121 | + | |
| 122 | + | |
| 123 | + | |
| 124 | + | |
| 125 | + | |
| 126 | + | |
120 | 127 | | |
121 | 128 | | |
122 | 129 | | |
| |||
0 commit comments