[Draft] Sound TTS generation doc & demo#3195
Conversation
There was a problem hiding this comment.
Code Review
This pull request introduces the first phase of AI-powered sound generation, specifically focusing on Text-to-Speech (TTS) capabilities. It includes comprehensive design documentation, updates to the OpenAPI specification, and the implementation of the frontend UI and logic, including a new SoundGen model and a SoundGenModal component. Feedback highlights an endpoint inconsistency in the documentation, missing enum values in SoundCategory for synchronization with the API, a typo in the UI text, and a suggestion to implement state persistence for the sound generation modal to match the behavior of other asset generators.
| export function useSoundGenModal() { | ||
| const invokeModal = useModal(SoundGenModal) | ||
| return function invokeSoundGenModal(project: SpxProject) { | ||
| return invokeModal({ project }) | ||
| } | ||
| } |
There was a problem hiding this comment.
Unlike useSpriteGenModal and useBackdropGenModal, the useSoundGenModal implementation does not currently support state persistence via GenState. This means any user progress in the sound generation modal will be lost if the modal is closed and reopened. Consider integrating SoundGen into GenState and GenHelpers to provide a consistent user experience across all asset generators.
[skip review]
update: #2768