SpeedTransformer

Official implementation of SpeedTransformer:

Detecting Transportation Mode Using Dense Smartphone GPS Trajectories and Transformer Models

📄 Read the paper on arXiv
🌐 Project Page

Preparing the Data

Geolife Dataset

The Geolife dataset provides GPS trajectories collected from users. To preprocess this dataset:

Download the Dataset
- Obtain the Geolife GPS trajectory dataset from Microsoft Research.
- Unzip the dataset to a directory on your machine.
Run the Preprocessing Script

Use the data/geolife.py script to process the data. This script utilizes multiprocessing for efficient processing and typically completes in under 20 minutes:
```
python process_geolife.py --data-folder "Geolife Trajectories 1.3/Data" --output-file "geolife.csv"
```
Post-Processing

After preprocessing, run extract_speed_geolife.py to compute additional features like speed and distance:

python extract_speed_geolife.py geolife.csv --output_file geolife_processed.csv

MOBIS Dataset

The MOBIS dataset can be processed using a similar method. The processed MOBIS data can be found here: https://zenodo.org/records/17429944

Running the Models

This repository provides two primary model architectures:

LSTM-based trip classification (models/lstm/).
Transformer-based trip classification (models/transformer/).

Each architecture includes dedicated scripts for training and fine-tuning. The following shell scripts are available:

Shell Scripts Overview

Replication helpers (`models/replication/`)

run_training_experiments.sh – replays the best transformer and LSTM Geolife/Mobis training jobs.
run_gl_finetune_experiments.sh – reproduces the Geolife finetuning winners for both model families.
run_gl_lowshot_finetune_experiments.sh – fine-tunes the MOBIS transformer on 100/200 Geolife trajectories (low-shot).
run_miniprogram_finetune_experiments.sh – regenerates the CarbonClever finetuning leaderboard models.
run_window_sweep_experiments.sh – reruns the top Geolife window sweep configuration.
metrics_gen.py – converts experiment logs into the replication figures and summary table (experiment_summary.csv).

All scripts assume the datasets under data/ and write results back into their respective models/**/experiments/ folders so checkpoints, logs, and metrics line up with the paper tables.

Quick Start Snippet

# Example: reproduce the key training checkpoints
cd /data/A-SpeedTransformer/models/replication
./run_training_experiments.sh

# Then regenerate plots / tables
python metrics_gen.py

Colab Notebook

For an end-to-end, notebook-based replication you can open SpeedTransformer.ipynb directly in Google Colab. Appendix I in the paper lists the expected runtimes and resource notes for that workflow.

License & Contact

This project is licensed under the MIT License. Feel free to open issues or pull requests on GitHub. For questions or contributions, please reach out to Othmane Echchabi.

Citation

If you find this work useful, please cite:

@article{zhang2026speedtransformer, title = {Detecting Transportation Mode Using Dense Smartphone GPS Trajectories and Transformer Models}, author = {Zhang, Yuandong and Echchabi, Othmane and Feng, Tianshu and Zhang, Wenyi and Liao, Hsuai-Kai and Chang, Charles}, journal = {International Journal of Geographical Information Science (IJGIS)}, year = {2026} }

Name		Name	Last commit message	Last commit date
Latest commit History 66 Commits
data		data
models		models
.gitignore		.gitignore
README.md		README.md
SpeedTransformer.ipynb		SpeedTransformer.ipynb
sampling.py		sampling.py
sampling_frequency_histograms.png		sampling_frequency_histograms.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SpeedTransformer

Preparing the Data

Geolife Dataset

MOBIS Dataset

Running the Models

Shell Scripts Overview

Replication helpers (`models/replication/`)

Quick Start Snippet

Colab Notebook

License & Contact

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SpeedTransformer

Preparing the Data

Geolife Dataset

MOBIS Dataset

Running the Models

Shell Scripts Overview

Replication helpers (models/replication/)

Quick Start Snippet

Colab Notebook

License & Contact

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Replication helpers (`models/replication/`)

Packages