Gradient Descent from Scratch

Numerical Optimization | From-Scratch Implementation | R

Overview

This project implements a gradient descent optimization algorithm from scratch in R, applied to credit balance prediction on a high-multicollinearity dataset. The objective is algorithm design, not prediction, building a custom iterative solver and validating its mathematical integrity against standard pre-packaged regularized regression.

The implementation draws directly on numerical optimization principles (AM 230) and econometric theory (Econ 217, 294A), combining the gradient calculus of MSE minimization with matrix-form linear regression to build a solver that converges to the same solution as glmnet without using it.

Algorithm

Model: Linear specification Y = Xβ, solved iteratively via gradient descent rather than OLS closed-form.

Loss function:

$$L(\beta) = \frac{1}{n}\sum_{i=1}^{n}(y_i - X_i\beta)^2$$

Gradient (derived analytically):

$$\nabla L(\beta) = -\frac{2}{n}X^T(y - X\beta)$$

Update rule:

$$\beta_{k+1} = \beta_k - \alpha \nabla L(\beta_k)$$

with step size α = 0.01. Both the loss function and gradient are implemented as standalone modular functions with matrix input validation.

Results

Metric	Value
Convergence	4,906 iterations
Final MSE	9,406.32
Accuracy (1 − WAPE)	92.46%

The custom solver converges to coefficients consistent with closed-form OLS on the structured credit dataset, validating the correctness of the gradient derivation and update implementation.

Dataset

Credit Balance dataset (n = 400), designed for high multicollinearity, variables are closely correlated predictors of one another. This makes it well-suited for testing optimizer precision and convergence stability rather than feature selection.

Features: Income, Limit, Cards, Age, Education, Own, Student, Married, Region, Credit Score

Outcome: Balance (credit card balance)

Because the data is structurally aligned, the challenge is navigating the loss surface precisely, not finding the right features.

Repository Structure

├── methods/
│   ├── mse_loss.R        # MSE loss function L(β) = (1/n)Σ(y - Xβ)²
│   └── grad_mse_loss.R   # Negative gradient ∇L(β) = -(2/n)Xᵀ(y - Xβ)
├── analysis/
│   └── grad_descent.Rmd  # Full implementation, convergence analysis, benchmarking
├── output/
│   └── grad_descent.pdf  # Rendered results and convergence plots
└── README.md

Tech Stack

R — tidyverse, ggplot2, dplyr

Core concepts: multivariable calculus, linear algebra, numerical optimization

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gradient Descent from Scratch

Overview

Algorithm

Results

Dataset

Repository Structure

Tech Stack

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
analysis		analysis
methods		methods
output		output
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

Gradient Descent from Scratch

Overview

Algorithm

Results

Dataset

Repository Structure

Tech Stack

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages