You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Jan 19, 2026. It is now read-only.
Some overlapping / confusing lyrics (e.g. when there are background vocals at the same time as primary)
The scrolling lyrics should ideally only track the primary vocals for now; although this does inspire me that we could possible handle duets with this approach in future!
One approach to resolving this might be to separate the audio with a background-vocals (BVE) model first and only pass the primary vocal stem to whisper in the first place. TBD if this helps.
One or two lines of totally unexpected lyrics; need to investigate why. I suspect these are also backing vocals which couldn't find a place.
At least one case where there was still a misheard word left in the corrected lyrics ("shattered" vs. "flattered"); this is less critical but possibly an opportunity to tweak the prompt to give it more guidance, e.g. asking it to watch out for sound-alike words like this and correct them.
It would be good to add some more functionality to the correction method to help debug and tweak the prompt, e.g. logging changes to each segment to a file, printing / writing an overall diff of the spotify lyrics vs. the corrected lyrics, etc.