Fine-Tuning LLMs for Multi-Task Predictive Process Monitoring

Paper under review.

Overview

This repo has code and scripts to fine-tune large language models (LLMs) for multi-task PPM.
We use uv to manage our local environment.
Tested only on Ubuntu 24.04 using Python 3.12.

Requirements

Install all dependencies with:

uv venv .venvv -python 3.12
source .venv/bin/activate
uv pip install -r requirements.txt

Scripts and Structure

.
├── data/                           # Event logs (automatically downloaded)
├── scripts/                        # Experiment scripts and configs
│   ├── *.sh                        
│   ├── *.txt                       
│   └── *.slurm                     
├── notebooks/                      # Analysis notebooks
├── ppm/                            # Source code
├── luijken_transfer_learning.py    # Competitor training script
├── rebmann_et_al.py                # Narrative-style competitor training script
├── next_event_prediction.py        # Main training script
├── requirements.txt                # Python dependencies
└── README.md                       # This file

Data

We use five public event logs. They will be downloaded via SkPM under data/<LOG>/:

BPI20PTC (Prepaid Travel Costs)
BPI20RfP (Request for Payment)
BPI20TPD (Permit Data)
BPI12
BPI17

Usage

Single experiments

RNN baseline

python next_event_prediction.py \
  --dataset BPI20PrepaidTravelCosts \
  --backbone rnn \
  --embedding_size 32 \
  --hidden_size 128 \
  --lr 0.0005 \
  --batch_size 64 \
  --epochs 25 \
  --categorical_features activity \
  --continuous_features all \
  --categorical_targets activity \
  --continuous_targets remaining_time

LLM fine-tuning

In order to use LLMs, you need a HuggingFace token. A few options on how to use it:

Create an .env file in the root of this repository and write your token like HF_TOKEN=<YOUR_TOKEN>
Export a local variable export HF_TOKEN="<YOUR_TOKEN>"
Hard code it here

For local debugging purposes, try the tiny setup below with a small r value for BPI20PrepaidTravelCosts and qwen25-05b. If it doesn't fit your GPU memory, keep decreasing the batch_size (=4 uses less than 2gb).

python next_event_prediction.py \
  --dataset BPI20PrepaidTravelCosts \
  --backbone qwen25-05b \
  --embedding_size 896 \
  --hidden_size 896 \
  --lr 0.00005 \
  --batch_size 64 \
  --epochs 1 \
  --categorical_features activity \
  --continuous_features all \
  --categorical_targets activity \
  --continuous_targets remaining_time \
  --fine_tuning lora \
  --r 2 \
  --lora_alpha 4

Alternatively, use the argument --wandb to enable wandb.

Hyperparameter search

We used Slurm on our HPC clusters. Check scripts/*.sh, scripts/*.txt, and scripts/*.slurm to see how to reproduce our jobs or run other configurations locally.

Results

All metrics and analysis notebooks are in the notebooks/ folder. Check this notebook for plots that have not fit in the paper.

Contact

For questions or feedback, reach me at rafael.oyamada@kuleuven.be or open an issue here.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fine-Tuning LLMs for Multi-Task Predictive Process Monitoring

Overview

Requirements

Scripts and Structure

Data

Usage

Single experiments

Hyperparameter search

Results

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
notebooks		notebooks
ppm		ppm
scripts		scripts
.gitignore		.gitignore
README.md		README.md
fetch_wandb.py		fetch_wandb.py
luijken_transfer_learning.py		luijken_transfer_learning.py
next_event_prediction.py		next_event_prediction.py
rebmann_et_al.py		rebmann_et_al.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Fine-Tuning LLMs for Multi-Task Predictive Process Monitoring

Overview

Requirements

Scripts and Structure

Data

Usage

Single experiments

Hyperparameter search

Results

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages