flutter_edge_tts

English

A free Flutter TTS package powered by Microsoft Edge online speech synthesis.

flutter_edge_tts provides a shared Dart synthesis layer for Android, iOS, macOS, Windows, and Linux, with a consistent API for voice discovery, text and SSML synthesis, file output, and synthesis metadata.

Features

Free and open-source Flutter TTS package
Shared Dart implementation for mobile and desktop Flutter apps
Live voice discovery from the Microsoft Edge online speech synthesis endpoint
Supports text synthesis to bytes, files, or stream events
Supports raw SSML synthesis
Supports optional sentence and word boundary metadata
Supports multiple output formats including MP3, WebM, Ogg, PCM, and WAV
As of 2026-06-03, the live endpoint returns 322 voices across 142 locales and 75 languages

Platform support

Android
iOS
macOS
Windows
Linux

Preview

Mobile	Desktop

Installation

dependencies:
  flutter_edge_tts: ^0.0.2

flutter pub get

Quick start

import 'package:flutter_edge_tts/flutter_edge_tts.dart';

final tts = FlutterEdgeTts(
  voice: 'en-US-AriaNeural',
  outputFormat: EdgeTtsOutputFormat.audio24Khz96KbitrateMonoMp3,
);

final result = await tts.synthesize('Hello from Flutter.');
print(result.audioBytes.length);

await tts.close();

Core API

Load voices

final voices = await tts.getVoices();
print(voices.first.shortName);

Synthesize text

final result = await tts.synthesize(
  'Hello from Flutter.',
  prosody: const EdgeTtsProsody(
    rate: '1.05',
    pitch: '+10Hz',
    volume: '100',
  ),
);

Synthesize to files

final result = await tts.synthesizeToFile(
  'Hello from Flutter.',
  audioFilePath: '/tmp/edge.mp3',
  metadataFilePath: '/tmp/edge.json',
);

Stream synthesis events

final stream = tts.synthesizeStream(
  'Hello from Flutter.',
  prosody: const EdgeTtsProsody(rate: '1.1', pitch: '+20Hz'),
);

await for (final event in stream) {
  if (event is EdgeTtsAudioChunkEvent) {
    print(event.chunk.length);
  } else if (event is EdgeTtsMetadataEvent) {
    print(event.metadata.items.length);
  }
}

Synthesize raw SSML

final ssml = '''
<speak version="1.0" xmlns="http://www.w3.org/2001/10/synthesis" xml:lang="en-US">
  <voice name="en-US-AriaNeural">
    <prosody rate="1.05" pitch="+10Hz">
      Hello from raw SSML.
    </prosody>
  </voice>
</speak>
''';

final result = await tts.synthesizeSsml(ssml);

Configuration

final tts = FlutterEdgeTts(
  voice: 'en-US-AriaNeural',
  voiceLocale: 'en-US',
  outputFormat: EdgeTtsOutputFormat.audio24Khz96KbitrateMonoMp3,
  enableSentenceBoundary: true,
  enableWordBoundary: true,
);

tts.updateConfig(
  tts.config.copyWith(
    voice: 'en-GB-SoniaNeural',
    enableWordBoundary: true,
  ),
);

Voice naming and compatibility

Common voice names look like:

en-US-AriaNeural
en-US-AndrewMultilingualNeural

The locale prefix identifies the primary locale of the voice. The package uses this naming pattern to infer a default voiceLocale when one is not provided explicitly.

The broader Microsoft Azure Speech ecosystem includes voice families such as Neural, MultilingualNeural, Neural HD, and MAI-Voice-1. This package is implemented around the Edge Read Aloud style transport path rather than the full Azure Speech SDK surface, so standard neural voices are the safest default choice. Advanced Azure-only voice families should be validated in your own environment before being exposed as supported options.

For production integrations, prefer voices returned by getVoices() over hard-coded assumptions.

Metadata

When sentence or word boundary metadata is enabled, synthesis can return structured metadata alongside audio. This is useful for read-along highlighting, subtitle timing, accessibility flows, and language learning interfaces.

Example app

A runnable demo app is included in example/lib/main.dart. It demonstrates voice loading, voice selection, output format selection, metadata toggles, and file synthesis.

Notes

The high-level text API escapes text by default. Raw SSML should still be XML-safe.
Audio playback is intentionally left to the host app.
Voice and language counts come from the live endpoint and may change over time.
Upstream service behavior can change over time. If request headers, websocket framing, or voice endpoints change, the package may need to be updated.

Project

Repository: Moosphan/flutter_edge_tts
Issue tracker: GitHub Issues

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.github/workflows		.github/workflows
doc/images		doc/images
example		example
lib		lib
scripts		scripts
test		test
.gitignore		.gitignore
.metadata		.metadata
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
README.zh-CN.md		README.zh-CN.md
RELEASE_NOTES_0.0.1.md		RELEASE_NOTES_0.0.1.md
analysis_options.yaml		analysis_options.yaml
pubspec.yaml		pubspec.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

flutter_edge_tts

Features

Platform support

Preview

Installation

Quick start

Core API

Load voices

Synthesize text

Synthesize to files

Stream synthesis events

Synthesize raw SSML

Configuration

Voice naming and compatibility

Metadata

Example app

Notes

Project

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

flutter_edge_tts

Features

Platform support

Preview

Installation

Quick start

Core API

Load voices

Synthesize text

Synthesize to files

Stream synthesis events

Synthesize raw SSML

Configuration

Voice naming and compatibility

Metadata

Example app

Notes

Project

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages