Skip to content

bjoernhommel/dppd2025-workshop

Repository files navigation

Introduction to Large Language Modeling

This repository contains material for the DPPD2025 conference workshop Introduction to Large Language Modeling.

Abstract

Recent progress in the field of natural language processing (NLP) has had a transformative impact on research practice and methodology in the behavioral and social sciences. Most prominently, the transformer model architecture proposed by Vaswani et al. (2017) has led to the rise of large language models (LLMs) and drastically advanced both natural language generation and understanding. While numerous studies employ methods that rely on accessible but rather limited means of interacting with modern LLMs (e.g., prompt engineering), this workshop offers a conceptual and practical deep dive into the technical foundations of transformer models. The workshop will cover a) conceptual and historical foundations (e.g., distributional semantics), b) the anatomy of the transformer model architecture (e.g., the attention mechanism), c) model training and inference, d) types of transformer models and their individual use cases, e) applications of transformer models in behavioral science (e.g., decoder models for automatic generation of personality items, encoder models for estimating social desirability at the item level, predicting survey response patterns with sentence transformers, and extracting trait information from natural language), and f) working with the Hugging Face ecosystem. The workshop combines theoretical foundations with hands-on coding examples, allowing participants to bridge conceptual understanding with practical implementation. The goal of this workshop is to empower researchers at all career stages to use state-of-the-art NLP techniques in their own research and help them make informed methodological decisions when employing LLMs.

Instructor(s): Björn E. Hommel

Setup

1. Enable WSL2 (Windows only)

Windows: Open PowerShell as Administrator and run:

wsl --install

Restart your computer, then open WSL2 by running the following command in PowerShell or Command Prompt:

wsl

Follow Linux instructions for all subsequent steps.

2. Install Python ≥3.12

Check if python already is installed:

python3 --version
python --version

If Python 3.12+ is not installed:

Linux (Ubuntu/Debian) & WSL2 on Windows:

sudo apt update
sudo apt install python3.12 python3.12-venv python3-pip

macOS:

# Using Homebrew
brew install python@3.12

3. Install Poetry

curl -sSL https://install.python-poetry.org | python3 -

Add Poetry to PATH:

echo 'export PATH="$HOME/.local/bin:$PATH"' >> ~/.bashrc
source ~/.bashrc

Verify installation:

poetry --version

4. Clone Repository

Click the <> Code dropdown and copy the HTTPS repository url to clipboard.

git clone <repository-url>
cd <repository-name>

5. Install Dependencies

poetry install

Installation may take several minutes to complete.

6. Launch Tutorial

poetry run marimo edit ./notebooks/01-neural-language-model.py

Your browser should open with the marimo notebook interface.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages