Skip to content

Update YouTube with Supadata for transcript service#1733

Merged
santicomp2014 merged 5 commits intomainfrom
yt-change-to-supadata-captions
Feb 5, 2026
Merged

Update YouTube with Supadata for transcript service#1733
santicomp2014 merged 5 commits intomainfrom
yt-change-to-supadata-captions

Conversation

@santicomp2014
Copy link
Copy Markdown
Contributor

Update transcript service to use supadata instead.

Comment on lines 154 to +161
TranscriptInfo(
language_code=caption_track_dict["languageCode"].lower(),
autogenerated=caption_track_dict.get("kind", None) == "asr",
name=caption_track_dict["name"]["simpleText"],
url=caption_track_dict["baseUrl"],
language_code=lang.lower(),
# Supadata doesn't tell us if it's autogenerated, assume False
autogenerated=False,
name=_get_language_name(lang),
# URL format for Supadata (stores video_id and original lang code)
url=f"supadata://{video_id}/{lang}",
)
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could we add a video_id attribute? That way, get_transcript can use the video_id directly instead of having to parse it from the URL

@santicomp2014 santicomp2014 merged commit 0231db1 into main Feb 5, 2026
8 checks passed
@santicomp2014 santicomp2014 deleted the yt-change-to-supadata-captions branch February 5, 2026 19:58
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants