Skip to content
This repository was archived by the owner on Aug 28, 2024. It is now read-only.

Commit 16c4492

Browse files
committed
README fix for SpeechRecognition, taking a free ride in the PR for ImageSegmentation
1 parent 3e720e7 commit 16c4492

File tree

1 file changed

+4
-3
lines changed

1 file changed

+4
-3
lines changed

SpeechRecognition/README.md

Lines changed: 4 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -31,12 +31,13 @@ Be aware that the downloadable model file was created with PyTorch 1.9.0 and tor
3131

3232
### 2. Prepare the Model
3333

34-
To install PyTorch 1.9.0 and torchvision 0.10.0, you can do something like this:
34+
To install PyTorch 1.9.0, torchaudio 0.9.0 and the Hugging Face transformers, you can do something like this:
3535

3636
```
3737
conda create -n wav2vec2 python=3.8.5
3838
conda activate wav2vec2
39-
pip install torch torchvision
39+
pip install torch torchaudio
40+
pip install transformers
4041
```
4142

4243
Now with PyTorch 1.9.0 and torchaudio 0.9.0 installed, run the following commands on a Terminal:
@@ -58,7 +59,7 @@ open SpeechRecognition.xcworkspace/
5859

5960
### 3. Build and run with Xcode
6061

61-
After the app runs, tap the Start button and start saying something; after 12 seconds (you can change `private let AUDIO_LEN_IN_SECOND = 12` in `ViewController.swift` for the recording length), the model will infer to recognize your speech. Some example results are as follows:
62+
After the app runs, tap the Start button and start saying something; after 12 seconds (you can change `private let AUDIO_LEN_IN_SECOND = 12` in `ViewController.swift` for a longer or shorter recording length), the model will infer to recognize your speech. Some example results are as follows:
6263

6364
![](screenshot1.png)
6465
![](screenshot2.png)

0 commit comments

Comments
 (0)