-
Notifications
You must be signed in to change notification settings - Fork 11
Description
@smallest-admin We are experiencing inconsistent pronunciation issues when generating audio using the API with voice_id="shweta".
For example, the word "country" is sometimes pronounced correctly, but other times it is truncated to "cont".
However, when generating the same audio via the WEB interface, the pronunciation remains consistently correct.
My code snippet:
client.synthesize(
save_as=os.path.join(output_path,"test.wav"),
add_wav_header=True,
cfg=0,
is_pvc=False,
nfe_step=16,
sample_rate=24000,
save_history=True,
speed=1,
transliterate=False,
text= "Only a very small minority of the people of this country believe in gambling as a substitute for the old philosophy of Benjamin Franklin",
voice_id="shweta"
)
Could you suggest any way to match both outputs.
Audio files: https://drive.google.com/drive/folders/1hZf6LBLh74dIdNSuvYIAlqHraOH68rmC?usp=sharing