Skip to content

How is inference performed using a cascaded approach to generate speech from text? #64

@talkking

Description

@talkking

Ming/test_audio_tasks.py

Lines 159 to 161 in 480df09

for tts_speech, text_list in model.talker.omni_audio_generation(
output_text, audio_detokenizer=audio_detokenizer, thinker_reply_part=thinker_reply_part, speaker=speaker, stream=stream, **spk_input
):

Are you using a cascaded approach to generate the speech?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions