Skip to content

fix: support qwen3 model_type and mlx-4bit subdir for moss models#2

Open
teddywangsg wants to merge 1 commit intoappautomaton:mainfrom
teddywangsg:fix/moss-sound-effect-model-type-and-subdir
Open

fix: support qwen3 model_type and mlx-4bit subdir for moss models#2
teddywangsg wants to merge 1 commit intoappautomaton:mainfrom
teddywangsg:fix/moss-sound-effect-model-type-and-subdir

Conversation

@teddywangsg
Copy link
Copy Markdown

Problem

Two issues with appautomaton/openmoss-sound-effect-mlx and related models:

1. model_type: "qwen3" not recognized

The model has config.json with model_type: "qwen3" (incorrect, should be moss_tts_delay), causing _resolve_tts_family() to reject it with:

ValueError: Unknown TTS model_type 'qwen3' in ... Supported: fish_qwen3_omni, vibevoice, audiodit, step1, moss_tts_delay.

2. config.json in subdirectory

The model's config.json lives in mlx-4bit/ subdirectory, but MossTTSDelayConfig.from_path() looks directly in the snapshot root.

Fix

  1. _registry.py: Add qwen3 as a valid model_type that dispatches like moss_tts_delay — checking directory name for sound_effect to return moss_sound_effect.

  2. config.py: Fall back to searching common subdirs (mlx-4bit, mlx-int8, mlx-8bit) when config.json is not found directly in model_dir.

Testing

Verified against appautomaton/openmoss-sound-effect-mlx — both issues resolved, audio generation works.

- Add recognition of 'qwen3' model_type as valid moss family,
  matching the behavior of 'moss_tts_delay' for sound_effect/local/delay dispatch
- Look for config.json in common subdirs (mlx-4bit, mlx-int8, mlx-8bit)
  when not found directly in model_dir, fixing appautomaton/openmoss-*
  model layout issues
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant