Skip to content

Update all Whisper recipes#397

Open
Copilot wants to merge 15 commits intomainfrom
copilot/update-whisper-recipe-model-builder
Open

Update all Whisper recipes#397
Copilot wants to merge 15 commits intomainfrom
copilot/update-whisper-recipe-model-builder

Conversation

Copy link
Copy Markdown
Contributor

Copilot AI commented May 6, 2026

This pull request adds CPU, CUDA, and WebGPU optimization recipes for the OpenAI Whisper models using Olive, enabling efficient INT8 quantization across multiple execution providers. Each backend (CPU, CUDA, WebGPU) gets its own configuration, documentation, and dependency files, making it easier to optimize and deploy the model on different hardware targets.

New optimization recipes and configuration:

  • Added Olive recipe JSON files for INT8 quantization targeting CPU (ex: whisper-base.en_cpu_int8.json), CUDA (ex: whisper-base.en_cuda_int8.json), and WebGPU (ex: whisper-base.en_webgpu_int8.json) backends, each specifying the appropriate execution provider and quantization pipeline.

Documentation and metadata:

  • Added README files for each backend (cpu/README.md, cuda/README.md, webgpu/README.md) with setup instructions and backend-specific notes.
  • Added info.yml files for each backend, describing the recipe, device, and execution provider.

Dependencies and licensing:

  • Added requirements.txt for each backend, listing olive-ai and onnxruntime-genai as dependencies.
  • Added an Apache 2.0 license file in the base directory.

Requires:

Copilot AI and others added 5 commits May 6, 2026 17:13
Agent-Logs-Url: https://github.com/microsoft/olive-recipes/sessions/93fb6115-fced-4b86-8f92-1e7086fb1669

Co-authored-by: kunal-vaishnavi <115581922+kunal-vaishnavi@users.noreply.github.com>
Agent-Logs-Url: https://github.com/microsoft/olive-recipes/sessions/0f07273c-6c12-4ab8-8cf9-3a706d15f1ea

Co-authored-by: kunal-vaishnavi <115581922+kunal-vaishnavi@users.noreply.github.com>
…ynamicQuantization pass

Agent-Logs-Url: https://github.com/microsoft/olive-recipes/sessions/73dad5b1-ff7e-449c-b55c-00d070a88d59

Co-authored-by: kunal-vaishnavi <115581922+kunal-vaishnavi@users.noreply.github.com>
…c quantization output)

Agent-Logs-Url: https://github.com/microsoft/olive-recipes/sessions/e85b5f33-587c-4313-b97e-951b5ee5e35a

Co-authored-by: kunal-vaishnavi <115581922+kunal-vaishnavi@users.noreply.github.com>
…-large-v3-turbo

Agent-Logs-Url: https://github.com/microsoft/olive-recipes/sessions/f59cbe21-d819-4b4e-b3b5-85c7218e4fb5

Co-authored-by: kunal-vaishnavi <115581922+kunal-vaishnavi@users.noreply.github.com>
@kunal-vaishnavi kunal-vaishnavi marked this pull request as ready for review May 6, 2026 20:43
Copilot AI review requested due to automatic review settings May 6, 2026 20:43
… (36 total)

Agent-Logs-Url: https://github.com/microsoft/olive-recipes/sessions/00275554-1df2-42e5-b52d-52c634ef94c8

Co-authored-by: kunal-vaishnavi <115581922+kunal-vaishnavi@users.noreply.github.com>
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

Copilot AI and others added 2 commits May 7, 2026 17:03
Agent-Logs-Url: https://github.com/microsoft/olive-recipes/sessions/c03b66e2-2af8-43e8-85d8-f3da8631dad4

Co-authored-by: kunal-vaishnavi <115581922+kunal-vaishnavi@users.noreply.github.com>
Agent-Logs-Url: https://github.com/microsoft/olive-recipes/sessions/38ce80db-c13b-4147-a3af-0daf773dd491

Co-authored-by: kunal-vaishnavi <115581922+kunal-vaishnavi@users.noreply.github.com>
Agent-Logs-Url: https://github.com/microsoft/olive-recipes/sessions/089e82c6-1bd3-479f-94f0-a2ed8d62edb0

Co-authored-by: kunal-vaishnavi <115581922+kunal-vaishnavi@users.noreply.github.com>
Agent-Logs-Url: https://github.com/microsoft/olive-recipes/sessions/f3040593-bcd7-40b4-854e-1655f3754e97

Co-authored-by: kunal-vaishnavi <115581922+kunal-vaishnavi@users.noreply.github.com>
Copilot stopped work on behalf of kunal-vaishnavi due to an error May 7, 2026 22:24
Copilot stopped work on behalf of kunal-vaishnavi due to an error May 7, 2026 22:28
@kunal-vaishnavi kunal-vaishnavi changed the title Whisper large-v3: restructure recipes with per-EP precision, dynamic quantization, and correct int8 labeling Update all Whisper recipes May 7, 2026
Copilot stopped work on behalf of kunal-vaishnavi due to an error May 7, 2026 23:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants