Sustainability Assessment of a Small Language Model (SLM)

This repository contains the codebase for the case study “Assessing the Sustainability of Language Models”, developed for the course 1210X Quantitative Methods to Assess Sustainability.

In this project, you will train a Small Language Model (SLM) based on a GPT-style Transformer architecture and assess the sustainability impacts of both the training phase and the prompting phase across environmental, economic, and social dimensions.

The goal of the case study is not to optimize model performance, but to understand how computational design and usage choices translate into sustainability impacts.

Repository structure

.
├── data/
│   └── prepare.py        # Dataset preparation (Tiny Shakespeare, character-level)
│
├── src/
│   ├── model.py          # Model architecture
│   ├── train.py          # Training script
│   └── prompt.py         # Inference/ prompting script
│
└── README.md

Create a new environment for this project

To create a new environment, in the project directory run:

conda env create -f env_requirements.yaml

To activate it:

conda activate slm-sustainability

General workflow

Prepare the dataset
Run the model arquitecture
Train the language model
Run inference (prompting)
Quantify sustainability impacts

Each step is described below.

1. Dataset preparation (`data/prepare.py`)

This script prepares the Tiny Shakespeare dataset for character-level language modeling.

It performs the following steps:

Downloads the dataset (if not already present)
Builds a character-level vocabulary
Splits the data into training and validation sets
Generates:
- train.bin
- val.bin
- meta.pkl (vocabulary and encoding metadata)

Run this script once before training:

python data/prepare.py

You are not required to modify this file.

2. Model architecture (`src/model.py`)

This file contains the complete definition of the language model.

Architecture: Transformer, decoder-only
Conceptually similar to GPT-family models (e.g. GPT‑2 / ChatGPT), but much smaller
Includes:
- Token and positional embeddings
- Masked multi-head self-attention
- Feed-forward (MLP) layers
- Residual connections and Layer Normalization

You are not required to modify this file. However, students interested in model design are encouraged to experiment with more complex variants.

In all cases, you should:

Inspect it to understand the model structure
Report the architecture and number of parameters in your sustainability assessment

3. Training (`src/train.py`)

This script trains the Small Language Model from scratch.

Run this script using:

python src/train.py

Tunable parameters

At the top of train.py, you will find a configuration section where you can adjust parameters such as:

Model size
- Number of layers
- Embedding dimension
- Number of attention heads
Training workload
- Batch size
- Number of training iterations
Hardware
- CPU or GPU

These parameters are the main levers you should use for:

Sensitivity analysis
Scenario comparison
Sustainability trade-off evaluation

What you should NOT change

The training loop logic
The loss function
The data loading logic

Where to implement CodeCarbon (training)

You must integrate CodeCarbon in this file to measure:

Energy consumption
CO₂-equivalent emissions during training

4. Inference / Prompting (`src/prompt.py`)

Run this script using:

python src/prompt.py

This script performs inference using a trained model checkpoint.

It:

Loads the trained model
Accepts a text prompt
Generates new tokens autoregressively

Tunable parameters

You may adjust:

Prompt text
Number of generated tokens
Sampling parameters (e.g. temperature, top-k)

These parameters control the inference workload, which is essential for:

Comparing training vs usage impacts
Scaling impacts to realistic deployment scenarios

Where to implement CodeCarbon (inference)

You must also integrate CodeCarbon in this file to measure:

Energy and emissions per prompt
Energy and emissions as a function of generated tokens

Learning objectives

After completing this case study, you should be able to:

Apply life-cycle thinking to digital and computational systems
Understand the structure of modern language models
Quantify environmental impacts of training and inference
Perform sensitivity analysis on computational parameters
Reflect on trade-offs across environmental, economic, and social dimensions
Connect small-scale experiments to large-scale AI deployment

Important notes

Model performance (text quality) is not graded
Transparency, assumptions, and reproducibility are essential
Clearly document all parameter choices and scenarios in your report
Focus on sustainability insights, not deep learning optimization

If you have questions about the code structure or the scope of allowed modifications, refer to the assignment description or contact the course TAs.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sustainability Assessment of a Small Language Model (SLM)

Repository structure

Create a new environment for this project

General workflow

1. Dataset preparation (`data/prepare.py`)

2. Model architecture (`src/model.py`)

3. Training (`src/train.py`)

Tunable parameters

What you should NOT change

Where to implement CodeCarbon (training)

4. Inference / Prompting (`src/prompt.py`)

Tunable parameters

Where to implement CodeCarbon (inference)

Learning objectives

Important notes

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data		data
src		src
.gitignore		.gitignore
README.md		README.md
env_requirements.yaml		env_requirements.yaml

albedamm/GWPinLanguageModels

Folders and files

Latest commit

History

Repository files navigation

Sustainability Assessment of a Small Language Model (SLM)

Repository structure

Create a new environment for this project

General workflow

1. Dataset preparation (data/prepare.py)

2. Model architecture (src/model.py)

3. Training (src/train.py)

Tunable parameters

What you should NOT change

Where to implement CodeCarbon (training)

4. Inference / Prompting (src/prompt.py)

Tunable parameters

Where to implement CodeCarbon (inference)

Learning objectives

Important notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

1. Dataset preparation (`data/prepare.py`)

2. Model architecture (`src/model.py`)

3. Training (`src/train.py`)

4. Inference / Prompting (`src/prompt.py`)

Packages