Skip to content
View arifme071's full-sized avatar

Block or report arifme071

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
arifme071/README.md

Md Arifur Rahman

ML Engineer · AI Researcher · Data Scientist · Data Engineer

Deploying production ML systems at the intersection of AI, manufacturing, and transportation

Google Scholar LinkedIn Email ORCID

US Work Authorized Location

---

👤 About Me

I am a Rail Operations and Applied AI/ML Engineer with 10+ years of combined experience across industrial engineering, rail operations, and machine learning systems. My work spans the full ML lifecycle — from peer-reviewed research to production deployment — with a focus on industrial optimization, predictive analytics, and condition monitoring for transportation and manufacturing systems.

Currently a PIN Fellow at Georgia Tech (Georgia-AIM grant), I develop AI-driven manufacturing optimization pipelines. Previously at Norfolk Southern Corporation, I built GIS-enabled analytics dashboards adopted company-wide. I hold an MSc in Applied Engineering from Georgia Southern University and have been admitted to Georgia Tech's OMSCS (MS in Computer Science) starting Fall 2026.

🔬  PIN Fellow @ Georgia Tech              →  HMM + RL optimization pipelines for manufacturing · RAG-based knowledge systems (2025–Present)
🚂  Supervisor Associate — Operations Division, Mechanical Maintenance @ Norfolk Southern → Rail equipment management, FRA/49 CFR compliance & real-time asset health monitoring (2024–2025)
🎓  Research Assistant @ Georgia Southern  →  95–97% anomaly detection on live rail DAS datasets · AAR/TTCI collaboration (2022–2023)
📄  7 peer-reviewed papers (ASME, Springer, SPIE, Elsevier) — 200+ citations
🎯  Admitted: Georgia Tech OMSCS (CS) →  Fall 2026

🛠️ Technical Stack

Languages

Python SQL Java C++ MATLAB

ML & AI

TensorFlow PyTorch scikit-learn Hugging Face LangChain LangGraph Vertex AI YOLOv8 CrewAI PEFT / LoRA FAISS OpenVINO

Cloud & MLOps

Google Cloud AWS Azure Docker BigQuery Git

Data & Visualization

Pandas Power BI Tableau Streamlit Alteryx


📌 Featured Projects

🤖 Generative AI & LLM Engineering

llm-finetuning-engineering-domain

Two complementary fine-tuning pipelines on railroad AI and manufacturing domain data. BERT/RoBERTa classification: fine-tuned bert-base-uncased94.2% accuracy; roberta-base95.8% on 4-class DAS signal conditions. LoRA generation: Mistral-7B instruction-tuned with only 4.2M trainable params (0.06% of model) using QLoRA 4-bit quantization — ROUGE-L 0.68. Both models published on HuggingFace Hub →
Python BERT RoBERTa Mistral-7B PEFT LoRA QLoRA Hugging Face Transformers NLP Jupyter Notebook

engineering-knowledge-rag  Live Demo

Production RAG pipeline grounded in 7 peer-reviewed publications (200+ citations). Retrieves domain knowledge via FAISS + SentenceTransformers (all-MiniLM-L6-v2, 384-dim cosine search), generates citation-backed answers with Flan-T5 — zero hallucination on domain specifics. Live on HuggingFace Spaces (Docker). Supports drop-in PDF ingestion to extend the knowledge base to any domain.
Python RAG FAISS SentenceTransformers Flan-T5 LangChain Streamlit Docker HuggingFace Spaces


🏭 Manufacturing & Industrial AI

warehouse-visual-intelligence

Production multi-agent AI system for real-time warehouse monitoring, safety violation detection, layout optimization, and cost reduction. A 5-agent pipeline (Vision → Layout → Anomaly → Cost → Orchestrator) processes images via YOLOv8 + GCP Vision API and outputs $/day cost impact per detected inefficiency. FastAPI backend, Streamlit dashboard, GitHub Actions CI/CD.
Python YOLOv8 CrewAI LangChain Google Cloud Vision GCS FastAPI Streamlit Multi-Agent Computer Vision

cv-manufacturing-defect-detection

Real-time surface defect detection for steel manufacturing using YOLOv8 on the NEU Surface Defect benchmark (1,800 images, 6 defect classes). Achieves 95.2% mAP@50 with 2.1ms GPU inference. Exported to Intel OpenVINO IR format for 2–4× CPU speedup on Intel hardware. Extends published WAAM research (Georgia Tech / Springer 2026). Includes Colab training notebook and Streamlit demo.
Python YOLOv8 Intel OpenVINO Jupyter Notebook Streamlit Computer Vision Manufacturing QC Deep Learning

[waam-hmm-rl-optimizer] -coming soon

Hidden Markov Model + Reinforcement Learning pipeline for material design optimization in Wire Arc Additive Manufacturing (WAAM). Deployed under Georgia-AIM grant at Georgia Tech. 5% improvement in material utilization. Peer-reviewed Springer publication (2026).
Python Reinforcement Learning HMM Manufacturing AI MLOps

[ai-polymer-optimisation-mip-v3] -coming soon

AI-guided optimization of MIP/CIP polymer film synthesis using physics-informed data generation, ML surrogates, and multi-objective Pareto optimization — without wet-lab experiments. Achieves 95.2% capture efficiency and 1.998 µm thickness (±0.002 µm of target). Target: JOM / Springer (2026).
Python scikit-learn Random Forest MLP Gaussian Process Latin Hypercube Pareto Optimization Materials AI

manufacturing-powerbi-demo

Power BI + T-SQL demo for manufacturing analytics and real-time monitoring dashboards.
T-SQL Power BI Manufacturing Analytics


🚂 Railroad & Transportation AI

railroad-anomaly-detection-cnn-lstm

Hybrid CNN-LSTM with sliding window for railroad condition monitoring via distributed acoustic sensing (DAS). Achieved 97% train position detection rate on live HTL fiber-optic datasets from AAR/TTCI, Pueblo CO. Published in Green Energy & Intelligent Transportation, Elsevier 2024.
Python TensorFlow CNN LSTM Time-Series Anomaly Detection Signal Processing

das-railroad-gru-lstm

GRU and LSTM models for train presence detection along fiber-optic DAS-instrumented track. 94% detection rate. Published in SPIE Journal of Applied Remote Sensing, 2024.
Python Deep Learning GRU Distributed Acoustic Sensing Rail Safety

train-scheduling-system

Java console-based Train Movement & Scheduling System with CRUD operations, station management, scheduling, file I/O, and full OOP architecture.
Java OOP File I/O Scheduling Algorithms


📊 Data Science & ML Applications

Building-a-Rainfall-Prediction-Classifier

End-to-end ML classification pipeline for rainfall prediction using supervised learning, feature engineering, and model evaluation.
Python scikit-learn Jupyter Notebook Classification

Store-Recommendation-System-Atlanta-GA

Interactive store recommendation system with analytics, K-Means clustering, and Folium map visualizations for Atlanta, GA.
Python Clustering Geospatial Recommender Systems Jupyter Notebook

Python-and-ML-Learning-Apps

Interactive Python & Machine Learning course with 28 structured lessons, built with React.
JavaScript React EdTech Machine Learning


📄 Selected Publications

Year Title Venue Metric
2026 AI-Driven(HMM-RL) decision support system for WAAM Springer HMM + RL — 5% material utilization gain
2026 AI-Guided Polymer Film Synthesis Optimization Springer Manufacturing quality improvement
2024 CNN-LSTM-SW for Railroad Anomaly Detection via DAS Green Energy & Intelligent Transportation, Elsevier 97% detection rate
2024 Deep Learning for DAS-based Railroad CM SPIE Journal of Applied Remote Sensing GRU model: 94% detection
2023 Review of DAS Applications for Railroad CM Mechanical Systems & Signal Processing, Elsevier Widely cited systematic review
2022–2023 ML Models for Rail Safety & Anomaly Detection (3 papers) ASME / Springer 95% accuracy on live HTL datasets

📚 Full publication list on Google Scholar →  |  200+ total citations


🏆 Certifications & Education

Type Details
🎓 Georgia Tech OMSCS MSc Computer Science — Admitted, Fall 2026
🎓 Georgia Southern University MSc Applied Engineering (Advanced Manufacturing)
🎓 CUET BSc Mechanical Engineering
☁️ Google Cloud Data Analytics Certificate
⚙️ Alteryx Designer Core Certification
🐍 Coursera / U of Michigan Applied Machine Learning in Python
🤖 Google Generative AI Leader

📈 GitHub Stats

GitHub Stats    Top Languages

GitHub Streak


🔭 Currently Working On

  • 🏭 WAAM AI Prototype — Deploying HMM + RL material design system at Georgia Tech (Georgia-AIM)
  • 🏗️ Warehouse Visual Intelligence — Extending with real-time RTSP camera feed + BigQuery analytics trend dashboard
  • 🔬 CV Defect Detection — Fine-tuning PPE detection model; Vertex AI deployment pipeline
  • 🤖 LLM Fine-tuning — Scaling LoRA Mistral-7B training dataset; evaluating RAG vs fine-tuned generation

🤝 Let's Connect

I'm actively seeking roles in ML Engineering, AI/Data Science, Data Engineering, and AI Research — particularly in manufacturing, transportation, railway, energy, infrastructure intelligence, or large-scale ML systems.

Open to: Full-time roles at tech & industrial AI companies.

Email LinkedIn Google Scholar HuggingFace ORCID

🇺🇸 Authorized to work in the United States — no sponsorship required


⭐ If any of my projects or research helped you, consider starring the repo!

Pinned Loading

  1. engineering-knowledge-rag engineering-knowledge-rag Public

    RAG pipeline over published research in railroad AI & manufacturing — live demo on HuggingFace Spaces

    Python

  2. warehouse-visual-intelligence warehouse-visual-intelligence Public

    Multi-agent AI system for warehouse monitoring using YOLOv8, Google Cloud & AWS

    Python

  3. llm-finetuning-engineering-domain llm-finetuning-engineering-domain Public

    Fine-tuned BERT (94.2% accuracy) + LoRA Mistral-7B on railroad AI domain data — PEFT · QLoRA · HuggingFace Hub

    Jupyter Notebook

  4. cv-manufacturing-defect-detection cv-manufacturing-defect-detection Public

    YOLOv8 real-time surface defect detection for steel manufacturing — NEU dataset · Intel OpenVINO · 93.7% mAP

    Jupyter Notebook

  5. railroad-anomaly-detection-cnn-lstm railroad-anomaly-detection-cnn-lstm Public

    Hybrid CNN-LSTM with Sliding Window for railroad anomaly detection via DAS fiber-optic sensing — Elsevier GEITS 2024

    Jupyter Notebook

  6. Store-Recommendation-System-Atlanta-GA Store-Recommendation-System-Atlanta-GA Public

    Interactive store recommendation system with analytics, clustering, and map visualizations.

    Jupyter Notebook