AI Engineer | Building Production-Scale AI Systems | Open Source Contributor
I specialize in developing and deploying advanced AI systems with a focus on RAG, LLMs, Voice Agents, Speech-to-Text, and Text-to-Speech. Currently engineering AI solutions for Fortune 100 companies at Avirso.
- Contributing to Hugging Face TRL (Transformer Reinforcement Learning) - 16.3k+ β
- Contributing to OpenVoiceChat - A library for creating voice agents (252 β)
- Researching hallucination-free speech-to-text systems
- Developing end-to-end audio models for voice interactions
π£οΈ Unhallucinated Faster Whisper - Reducing hallucinations in OpenAI Whisper models when processing audio with human noise. Available on PyPI and actively used in production environments.
π€ OpenVoiceChat - Open-source library enabling developers to build sophisticated voice agents with integrated RAG capabilities and end-to-end audio processing.
βοΈ CaseLink - AI-powered all-in-one solution for law firms featuring legal research assistant with RAG, document generation, and Redis caching for optimized performance.
π First Arabic EOU Model - Created the first open-source end-of-utterance detection model for Arabic language, enabling natural conversation flow in voice agents.
β‘ Enterprise-Grade RAG - Modular RAG implementation with NVIDIA NeMo guardrails, semantic caching, and Redis vector database.
Mitigating Hallucinations in Speech-to-Text Systems - IEEE 4th International Conference on Computing and Machine Intelligence (ICMI)
AI/ML: PyTorch β’ TensorFlow β’ LangChain β’ LangGraph β’ CrewAI β’ Hugging Face
Infrastructure: CUDA β’ TensorRT β’ Triton Inference Server β’ Microsoft Foundry β’ Google Vertex AI
Databases: Redis β’ Weaviate β’ MongoDB β’ DynamoDB β’ Google Bigtable
Languages: Python β’ C++ β’ C β’ C# β’ CUDA β’ Go
- 3 merged PRs to Hugging Face's TRL repository
- Published research paper at IEEE ICMI conference
- 2x AWS Scholarship recipient for Nanodegrees
- President of Microsoft Learn Student Ambassadors at FAST NUCES Islamabad
- Model Parallelism: Building and Deploying Large Neural Networks - NVIDIA
- Custom ASR for Speech AI - NVIDIA
- Computer Vision for Industrial Inspection - NVIDIA
π‘ Passionate about pushing the boundaries of AI, particularly in voice and speech technologies. Open to collaborations on cutting-edge AI projects.



