Batch Media Insight Extractor

A local-first AI workflow software prototype that converts batches of images and videos into structured text, summaries, keywords, key points, and Word/PDF reports.

Product Positioning

Batch Media Insight Extractor is not just a small script. It is a local AI workflow software prototype designed to process unstructured media files and turn them into reusable knowledge assets.

It is built for scenarios where users need to handle many screenshots, images, short videos, lectures, screen recordings, or social media clips and convert them into structured reports.

Key Features

Batch image and video file detection
Image OCR with Chinese and English support
OCR text cleanup and formatting
Local image summary and keyword extraction
Video metadata extraction
Video preview frame generation
Video audio extraction
Local Whisper speech-to-text transcription
Video transcript cleanup and local summarization
Word and PDF report generation
Batch report archiving
Apple-style Streamlit dashboard UI
Chinese and English UI switching
Theme color switching
One-click local full workflow
Environment check and repair launchers

Screenshots

Screenshots should be added later using safe demo files only.

Suggested screenshot placeholders:

Dashboard overview
One-click local workflow success
Video info and preview frames
Reports center
Environment READY check
Demo Word/PDF report preview

Workflow

Add images and videos into the local input folder.
Start the local web software.
Run the one-click local full workflow.
The system extracts image text, video transcripts, metadata, summaries, and keywords.
Word/PDF reports are generated and archived by batch.

Tech Stack

Python
Streamlit
Tesseract OCR
faster-whisper
OpenCV
imageio-ffmpeg
pandas
python-docx
pywin32
Windows CMD launchers

Local-First and Privacy-Aware Design

The current version is designed around local processing. Private media files, generated reports, logs, and local outputs are intentionally excluded from the public showcase version.

OpenAI or ChatGPT API enhanced summarization is reserved for a later content creation, portfolio demonstration, or commercialization stage.

Public Showcase Safety

This public version should not include:

input_media/
output/
logs/
API keys
private images
private videos
generated private reports
model cache files

Portfolio Value

This project demonstrates practical AI workflow design, local automation, OCR integration, speech-to-text integration, report generation, UI design, and privacy-aware GitHub packaging.

Roadmap

Add safe demo screenshots
Improve onboarding and settings page
Strengthen local summarization quality
Add optional OpenAI enhanced summary later
Explore Windows packaging and installer options
Prepare a polished public portfolio page

Status

Checkpoint: VIDEO-EXTRACT-045

Current stage: online public GitHub showcase release.

Public Showcase Status

Current public showcase checkpoint: VIDEO-EXTRACT-041

This repository is a public portfolio version. Private media files, generated outputs, logs, API keys, and private backups are intentionally excluded.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
assets		assets
config		config
docs		docs
modules		modules
portfolio		portfolio
.gitignore		.gitignore
Check_Environment.cmd		Check_Environment.cmd
GITHUB_RELEASE_CHECKLIST.md		GITHUB_RELEASE_CHECKLIST.md
GITHUB_UPLOAD_READY.md		GITHUB_UPLOAD_READY.md
PROJECT_STATUS.md		PROJECT_STATUS.md
PUBLIC_RELEASE_NOTES.md		PUBLIC_RELEASE_NOTES.md
PUBLIC_SHOWCASE_MANIFEST.md		PUBLIC_SHOWCASE_MANIFEST.md
README.md		README.md
Repair_Environment.cmd		Repair_Environment.cmd
Run_Local_Full_Workflow.cmd		Run_Local_Full_Workflow.cmd
Start_VideoExtractSkill.cmd		Start_VideoExtractSkill.cmd
app.py		app.py
requirements.txt		requirements.txt
run_check_environment.py		run_check_environment.py
run_launcher.py		run_launcher.py
run_step_001_inventory.py		run_step_001_inventory.py
run_step_002_image_info.py		run_step_002_image_info.py
run_step_005_image_ocr.py		run_step_005_image_ocr.py
run_step_005d_clean_ocr_text.py		run_step_005d_clean_ocr_text.py
run_step_007a_generate_summary_report.py		run_step_007a_generate_summary_report.py
run_step_007a_local_ai_summary.py		run_step_007a_local_ai_summary.py
run_step_009_video_info.py		run_step_009_video_info.py
run_step_010_video_audio_extract.py		run_step_010_video_audio_extract.py
run_step_011_video_transcribe.py		run_step_011_video_transcribe.py
run_step_012_video_summary_report.py		run_step_012_video_summary_report.py
run_step_015_local_full_workflow.py		run_step_015_local_full_workflow.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Batch Media Insight Extractor

Product Positioning

Key Features

Screenshots

Workflow

Tech Stack

Local-First and Privacy-Aware Design

Public Showcase Safety

Portfolio Value

Roadmap

Status

Public Showcase Status

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Batch Media Insight Extractor

Product Positioning

Key Features

Screenshots

Workflow

Tech Stack

Local-First and Privacy-Aware Design

Public Showcase Safety

Portfolio Value

Roadmap

Status

Public Showcase Status

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages