NiFi Audio Processors

Python-based custom processors for Apache NiFi, specialized in audio/media ingestion and transformation pipelines.

Overview

This repository contains reusable Python scripts designed for NiFi's ExecuteDocumentPython processor (or similar scripted processors). The focus is on handling long-form audio/video files — extracting, chunking, converting, and enriching with metadata — to prepare content for transcription, search, AI analysis, or archiving.

Current highlight: A robust MPG → 30-second MP3 chunk extractor with diagnostic bypass for troubleshooting large-file/content issues.

Key Features

Efficient FFmpeg-based audio extraction
Fixed-duration chunking (configurable)
High-quality MP3 output (VBR)
Rich FlowFile attributes for downstream routing/metadata
Diagnostic modes for real-world NiFi deployment issues
Clean temporary file handling and immediate transfer of results

Processors

Extract MP3 Chunks from MPG (Diagnostic Bypass)

File: processors/extract_mp3_chunks_diagnostic.py

Converts a single MPEG file into sequential 30-second MP3 audio segments.

Bypasses FlowFile content reading (uses direct file path from idol.reference)
Ideal for large files or when NiFi content claiming is problematic
Outputs one FlowFile per chunk with attributes like start time, original source, etc.

Full detailed documentation (inline comments + usage notes in the file)

Requirements

Apache NiFi
FFmpeg installed on NiFi host(s)
ExecuteDocumentPython processor configured

Getting Started

Clone this repo
Copy the Python script to your NiFi script directory or load directly
Configure an ExecuteDocumentPython processor with this script as the handler
Ensure input FlowFiles have required attributes (idol.reference, filename)

Future Plans

Normal (non-bypass) mode version
Configurable chunk duration/quality via attributes
Additional processors: metadata enrichment, format validation, silence trimming
Example NiFi flow templates

Contributions welcome! Open issues or PRs for new processors or improvements.

License

MIT License — see LICENSE file.

Maintained by Vinay (@josepheternity) · Melbourne, Australia

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
processors		processors
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NiFi Audio Processors

Overview

Key Features

Processors

Extract MP3 Chunks from MPG (Diagnostic Bypass)

Requirements

Getting Started

Future Plans

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

NiFi Audio Processors

Overview

Key Features

Processors

Extract MP3 Chunks from MPG (Diagnostic Bypass)

Requirements

Getting Started

Future Plans

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages