Skip to content
#

pipeline-architecture

Here are 18 public repositories matching this topic...

BetWork is a two-stage project focused on designing a robust data pipeline and performing comprehensive data analytics for a betting platform. The project analyzes player behavior, betting patterns, and financial trends to extract valuable business insights, empowering decision-making and enhancing operational strategies.

  • Updated Sep 20, 2024
  • Jupyter Notebook

Structured data-cleaning pipeline built on the Olist e-commerce dataset, focusing on reproducible ingestion, schema profiling, and light standardization using Python and pandas. Emphasizes clear pipeline stages, auditability, and portfolio-ready documentation.

  • Updated Apr 3, 2026
  • Python

Enterprise-grade ML framework showcasing advanced Scikit-Learn implementations with production-ready pipelines, algorithm-optimized synthetic data generation, comprehensive evaluation suite with statistical testing, custom transformers, ensemble methods, and real-world industry applications across healthcare, finance, and manufacturing domains.

  • Updated Dec 14, 2025
  • Jupyter Notebook

A real-time Hindi → English voice translation pipeline with streaming speech recognition and offline ASR. Hindi-to-English speech translation system combining ASR (Kaldi/Vosk) with a transformer-based machine translation model, forming a complete speech → text → translation pipeline.

  • Updated Dec 24, 2025
  • Python

Improve this page

Add a description, image, and links to the pipeline-architecture topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pipeline-architecture topic, visit your repo's landing page and select "manage topics."

Learn more