CS336 Assignments Implementation

This repository contains the implementation of assignments for the CS336 course (Spring 2025). The project covers various aspects of Large Language Models (LLMs), from basic Transformer architecture to systems, scaling laws, data processing, and alignment.

Project Structure

The repository is organized into five main assignments:

Assignment 1: Basics
- Implementation of core Transformer components (Attention, BPE, etc.).
- Basics of language modeling.
Assignment 2: Systems
- Distributed training techniques.
- Data Distributed Parallelism (DDP) and optimizer sharding.
Assignment 3: Scaling
- Investigation of scaling laws.
- Compute-optimal training (IsoFLOPs).
Assignment 4: Data
- Data processing pipelines for LLM pre-training.
- Deduplication, PII redaction, and quality filtering.
Assignment 5: Alignment
- Techniques for aligning LLMs with human intent.
- Supervised Fine-Tuning (SFT), DPO, and RLHF (GRPO).

Getting Started

Prerequisites

This project uses uv for dependency management. Ensure you have uv installed.

Installation & Usage

Each assignment is a self-contained Python project. To work on a specific assignment, navigate to its directory and sync dependencies.

Example for Assignment 1:

cd assignment1-basics
uv sync
uv run pytest

Please refer to the README.md within each assignment directory for specific instructions and details.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
assignment1-basics		assignment1-basics
assignment2-systems		assignment2-systems
assignment3-scaling		assignment3-scaling
assignment4-data		assignment4-data
assignment5-alignment		assignment5-alignment
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CS336 Assignments Implementation

Project Structure

Getting Started

Prerequisites

Installation & Usage

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CS336 Assignments Implementation

Project Structure

Getting Started

Prerequisites

Installation & Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages