Skip to content
View steviecurran's full-sized avatar

Block or report steviecurran

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
steviecurran/README.md

SUMMARY

Data scientist with 20+ years of experience modelling complex, noisy systems — originally in astrophysics, now applied to real-world data problems.

My work focuses on machine learning, statistical inference, and time series analysis, with an emphasis on extracting signal from difficult datasets and supporting decision-making under uncertainty.

INDUSTRY‑ALIGNED PROJECT HIGHLIGHTS

  • Fraud Detection (Machine Learning)

    Developed supervised models to identify rare events in highly imbalanced data, achieving strong precision while maintaining a low alert rate. Focused on threshold optimisation, uncertainty, and real-world trade-offs between detection and operational cost.

  • A/B Testing Toolkit (Statistical Inference)

    Built reusable Python tools for comparing groups using confidence intervals and hypothesis testing. Designed to support practical decision-making in experimentation workflows.

  • Time Series Forecasting Toolkit

    Created an interactive framework for comparing forecasting models (ARIMA, Holt-Winters, Prophet) with built-in backtesting and error analysis to evaluate real-world performance.

  • Deep Learning for Regression

    Implemented neural network models for continuous parameter estimation from high-dimensional data, including full pipelines for preprocessing, training, validation, and uncertainty assessment.

TOOLS & METHODS

Python (pandas, NumPy, scikit-learn, TensorFlow), statistical modelling, machine learning, time series forecasting, hypothesis testing, data visualisation.

CORE SKILLS

Technical Skills Soft Skills Python Other languages Documentation
Data Analysis Team Leadership dash C HTML
Machine Learning Project Management jupyter IDL Latex
Neural Networks Teaching & Supervision matplotlib PHP Markdown
Data Visualisation Science Communication numpy SQL dashboards
Statistical Analysis Public Speaking pandas Shell scripting Office
Scientific Research TV and Radio scikit-learn Pgplot
Simulations International Collaboration tensorflow Gnuplot

Pinned Loading

  1. SQL2pandas SQL2pandas Public

    Translate SQL queries into pandas operations for fast exploratory analysis and data transformation in Python

    Jupyter Notebook

  2. UK-employment UK-employment Public

    Exploratory analysis and time-series visualisation of UK unemployment trends using Python

  3. ab-testing-toolkit ab-testing-toolkit Public

    A/B testing toolkit

    Jupyter Notebook

  4. quasar-distances quasar-distances Public

    Using Machine Learning to Estimatie Galaxy Distances from Multi-Wavelength Data

  5. fraud-detection-ml fraud-detection-ml Public

    Fraud Detection with Machine Learning

    Jupyter Notebook

  6. time-series-toolkit time-series-toolkit Public

    To run and compare time series methods interactively, giving the option to make a forecast or compare a putatative forecast with how the data actually evolved.

    Jupyter Notebook