Turn raw video footage into golden content: Auto-generate PowerPoint slides, transcribe speech to text with Whisper, and upscale video to 4K.
A set of Python tools by tanbaycu to process and transform video content.
- Video to PPTX: Extracts key slides from a video based on visual changes and saves them to a PowerPoint presentation.
- Video to Text: Extracts audio and uses OpenAI's Whisper model to transcribe speech to text.
- Video Upscaler: Upscales video to 4K resolution using OpenCV detail enhancement.
- Clone the repository.
git clone https://github.com/tanbaycu/video-alchemy.git
- Install dependencies:
Note: You need
pip install -r requirements.txt
ffmpeginstalled on your system or accessible viaimageio-ffmpeg(included).
Detects slides in a video and converts them to a PPTX presentation.
python src/video_to_pptx.py input_video.mp4 output.pptx --threshold 2000 --rate 1.0
Tip: Want to edit your slides like a pro? Join my Canva team to get access to premium design tools!
Extracts audio and transcribes it to a text file.
python src/video_to_text.py input_video.mp4 --output results/Upscales a video to 4K resolution.
python src/video_upscaler.py input_video.mp4 output_4k.mp4src/: Source code for the tools.examples/: Sample inputs and outputs.
Copyright (c) 2025 tanbaycu. All rights reserved.