Data Scientist & Software Engineer with 4.6+ years of experience in Python, predictive modeling, and production-level deployments. Kaggle Top 2% globally and experienced in leading cross-functional teams at Omdena. Strong technical foundation spanning the full ML lifecycle. Successfully delivered production ML systems processing 150K+ data points with 83.5% accuracy, deployed real-time prediction APIs, and built retraining workflows.
- 🌱 I’m currently learning GenAI
- 👯 I’m looking to collaborate on Data Science Projects
-
Senior Software Engineer at SS&C Technologies Apr 2024 - Jan 2026 | Software Engineer Aug 2021 - Mar 2024 | Hyderabad, India
- Developed an automated monitoring framework for FOCG maintenance data pipelines, comparing Postgres vs DB2 tables in 5-minute intervals; ensuring early detection of missing MQ messages and improved data reliability. Reduced manual incident discovery time by 80%.
- Engineered ETL pipelines by integrating datasets across sources, preprocessing them, and ingesting into enterprise DBs with alerts and failure notifications, which reduced manual monitoring by 90%.
- Built an address enrichment pipeline that parses Smarty API responses, applies business logic & updated DataOps tables; enhanced data quality and integrity across millions of address records.
- Automated 50+ critical features validation scenarios simulating production workflows — cutting manual QA efforts by 50%.
-
Omdena - Freelance Data Scientist AQI Calculator - AQI-Calculator.onrender.com | GitHub | Blog | Pandas | Seaborn | Scikit-learn | Flask | Certificate
- Trained predictive models for Air Quality Index (AQI) based on input pollutants, attained 83.5% accuracy (R-squared) on 147,000+ data points.
- Spearheaded a multi-national team of 5 at Omdena, managed Data Processing, EDA, Model Training, & Documentation phases.
- Resolved 30% of missing values using Day-wise mean imputation technique after performing extensive EDA using Pandas & Seaborn.
- Deployed the best model as a Flask app on Render for real-time prediction.
-
AirCast - AirCastAQI.netlify.app | GitHub | Python | Time-Series Forecasting | GitHub Actions | Netlify
- Engineered a fully automated MLOps pipeline using GitHub Actions and a SARIMAX model to continuously ingest Air Quality Index data and retrain daily for the next 5-day AQI forecasts.
- Collected 3.5 years of historical data from CPCB. Analyzed the data - identified trends & seasonalities using Pandas & Seaborn.
- Scheduled a real-time data ingestion Cron Job (GitHub Actions) leveraging a real-time API call to collect new data & retrain the model daily.
- Implemented CI/CD for ML: model validation & deployment triggers - Built monitoring & logging system ensuring 99%+ uptime for prediction service.
-
Practice Abacus Online | https://practiceabacusonline.com/ | Demo video
- A responsive website for practicing Abacus (Math Tool) online & downloading practice sheets for free. An innovative solution that replaces old-school way of learning & practicing Abacus by introducing new ways to hone skills.
- Currently, this website boasts a monthly user base of over 2000+ individuals globally and has garnered commendable feedback for its user-friendly interface and effectiveness.
- Leveraged a comprehensive tech stack including Bootstrap, Vanilla JavaScript, Particle JS, Hover CSS, JsPdf, SpeechSynthesis to deliver a seamless user experience.
- Created 16+ interactive pages with diverse functionalities and customization options, providing a comprehensive and engaging learning experience.
- Future ideas: Set up login mechanism, database integration, analyse student’s performance by creating dashboards.
- Top 2% globally on Kaggle.
- Secured Rank 660 in TCS Codevita among 185,000+
- Top 7% in HackWithInfy.
- Scored 1417/1800 in TCS NQT.
- Data Science:
- Machine Learning • Deep Learning • Predictive Modeling • Regression & Classification • Time Series Forecasting • Exploratory Data Analysis (EDA) • Probability & Statistics • Data Cleaning • Data Visualization • Neural Networks
- Python & ML Libraries:
- NumPy • Pandas • Matplotlib • Seaborn • Plotly • Scikit-learn • TensorFlow • Keras • Statsmodels
- Programming & MLOps:
- Python • SQL (PostgreSQL) • CI/CD Pipelines • GitHub Actions • Model Deployment (Flask, Render, Netlify)
- Familiar With:
- Generative AI • LLMs • AWS • NLP • Agentic AI • RAG • Docker/Kubernetes
- Osmania University
- B.E in CSE
- July 2021 | Hyderabad, India | CGPA: 8.59 / 10
- Sri Chaitanya Junior College
- Maths, Physics & Chemistry
- May 2017 | Hyderabad, India | 983 / 1000
- St. Adams High School
- Mar 2015 | Hyderabad, India | CGPA: 9.7 / 10
- Kaggle: https://kaggle.com/pranaii
- LinkedIn: https://www.linkedin.com/in/bomma-pranay/
- GitHub: https://github.com/Bomma-Pranay
- Email: gullupranay@gmail.com
- Honored by former President of India Mrs. Pratibha Patil in Rashtrapati Bhavan for securing State 1st in Abacus Competition in 2011 & 5 International and National Level competitions.
- Special appreciation award in National Science Day Celebrations held at our college

