Robusttail

Quick start

After installation, from the repo root run a short tail-probability experiment:

cd experiments/run_scripts
PYTHONPATH="$(git rev-parse --show-toplevel)" python exp_tail_probability.py --experiment quick_run

(If not in a git repo, set PYTHONPATH to the project root instead.)

Experiments

Run scripts for tail probability estimation, quantile estimation, and benchmarks.

All commands below assume you are in the run_scripts directory with the project root on PYTHONPATH.

Setup

Add the project root to PYTHONPATH (so imports like droevt, tail_probability resolve).
```
export PYTHONPATH=/path/to/robusttail
```
Use run_scripts as the working directory.
```
cd experiments/run_scripts
```

Example run (tail probability):

PYTHONPATH=/path/to/robusttail python exp_tail_probability.py --experiment thresholds

Scripts

1. `exp_tail_probability.py` — DRO tail probability experiments

Runs distributionally robust (DRO) tail probability estimation on synthetic and real (CMT) data: multiple distributions, threshold percentages, objective functions, and scarce data.

Option	Description
`--experiment quick_run`	Short run (1 rep) for a quick sanity check (default).
`--experiment thresholds`	Vary threshold percentages (single and multi-threshold).
`--experiment percentage_lhs`	Vary left quantile level (0.9–0.99) in the objective function.
`--experiment scarce_data`	Scarce synthetic data (n=30).
`--experiment real_data`	Real CMT earthquake data, multiple regions and thresholds.

How to run:

PYTHONPATH=/path/to/robusttail python exp_tail_probability.py --experiment thresholds

2. `exp_quantile_estimation.py` — DRO quantile estimation

Runs DRO quantile estimation (ellipsoidal ambiguity) on synthetic data: gamma, lognorm, pareto; multiple threshold percentages (single and multi).

How to run:

PYTHONPATH=/path/to/robusttail python exp_quantile_estimation.py

3. `exp_cmt.py` — Real data (CMT) vs EVT

Compares DRO tail probability with EVT-based methods on parsed CMT (earthquake) data. Lets you vary threshold percentage, confidence level, or run bootstrap estimation.

Option	Description
`--function run_different_threshold_percentages`	Vary threshold percentage (default).
`--function run_different_exceedance_level`	Vary exceedance level.
`--function run_different_confidence_levels`	Vary confidence levels.
`--function run_bootstrap_estimation`	Bootstrap estimation.

How to run:

PYTHONPATH=/path/to/robusttail python exp_cmt.py --function run_different_threshold_percentages

4. `exp_bound_support.py` — Bounded-support (Gen Pareto) experiment

Runs DRO tail probability with bounded right endpoint on generalized Pareto data: multiple quantiles (0.95, 0.99, 0.995) and right endpoints.

How to run:

PYTHONPATH=/path/to/robusttail python exp_bound_support.py

5. `exp_benchmark_run.py` — EVT benchmark (synthetic + real)

Runs traditional EVT tail probability methods (POT, POT bootstrap, PL, Bayesian, PWM) on synthetic or real CMT data for comparison with DRO. --exp is required.

Option	Description
`--exp synthetic`	Synthetic data only (gamma, lognorm, pareto, genpareto; n=500; percentiles 0.9, 0.95, 0.99).
`--exp real`	Real CMT data only (multiple regions and left-endpoint thresholds).

How to run:

PYTHONPATH=/path/to/robusttail python exp_benchmark_run.py --exp synthetic
PYTHONPATH=/path/to/robusttail python exp_benchmark_run.py --exp real

Summary

Script	Purpose
`exp_tail_probability.py`	DRO tail probability (synthetic + real); `--experiment` chooses which experiment.
`exp_quantile_estimation.py`	DRO quantile estimation on synthetic data; no args.
`exp_cmt.py`	CMT real data: DRO vs EVT; `--function` chooses which experiment.
`exp_bound_support.py`	Bounded-support (Gen Pareto) tail probability; no args.
`exp_benchmark_run.py`	EVT benchmark (synthetic or real); `--exp` required: `synthetic` or `real`.

Installation

Prerequisites

Conda or Miniconda
Python 3.12
R

Platform support

Supported / tested: macOS (ARM64) and Linux via WSL (Ubuntu/WSL2).
Windows: not supported natively at the moment. Please use WSL.

Environment Setup

Option 1: Using Conda Environment (Recommended)

Create the conda environment from a platform-specific file (recommended):
- Linux / WSL: conda env create -f environment_linux.yml
- macOS (ARM64): conda env create -f environment_osx-arm64.yml
(Optional) Use an explicit lockfile (most reproducible):

This pins exact builds and is best for CI / exact reproduction on the same platform.
```
conda create -n rs --file explicit-linux.txt
```
```
conda create -n rs --file explicit-osx-arm64.txt
```
Activate the environment:
```
conda activate rs
```

Verify installation:

python --version   # Should show Python 3.12
R --version       # Should show R version

Option 2: Manual Installation

If you prefer to install packages manually:

# Create a new conda environment
conda create -n rs python=3.12

# Activate the environment
conda activate rs

# Install core dependencies
conda install -c conda-forge numpy pandas scipy r-base=4.2 rpy2 r-matrix r-mgcv r-ks

# Install Python packages via pip
pip install mosek==11.0.20

R Installation

R is included in the conda environment. For a system-wide R installation, see CRAN.

Mosek Academic Licenses

For academic use, obtain a free license at Mosek Academic Licenses.

After obtaining the license:

Download the license file.
Place it in:
- Linux / macOS: ~/mosek/mosek.lic

Verify installation:

import mosek
print(mosek.Env.getversion())

Verification

Test that everything is installed correctly:

python -c "import numpy, pandas, scipy, mosek, rpy2; print('All packages imported successfully')"

Troubleshooting

R/rpy2 issues: Ensure R is properly installed and R_HOME is set correctly.
Mosek license: Check that the license file is in the correct location and not expired.
Platform-specific issues: Use the appropriate environment file for your platform.

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
droevt		droevt
droevt_tests		droevt_tests
experiments		experiments
.gitignore		.gitignore
README.md		README.md
environment_linux.yml		environment_linux.yml
environment_osx-arm64.yml		environment_osx-arm64.yml
explicit-linux.txt		explicit-linux.txt
explicit-osx-arm64.txt		explicit-osx-arm64.txt
tail_threshold_diagnostics.md		tail_threshold_diagnostics.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Robusttail

Table of contents

Quick start

Experiments

Setup

Scripts

1. `exp_tail_probability.py` — DRO tail probability experiments

2. `exp_quantile_estimation.py` — DRO quantile estimation

3. `exp_cmt.py` — Real data (CMT) vs EVT

4. `exp_bound_support.py` — Bounded-support (Gen Pareto) experiment

5. `exp_benchmark_run.py` — EVT benchmark (synthetic + real)

Summary

Installation

Prerequisites

Platform support

Environment Setup

Option 1: Using Conda Environment (Recommended)

Option 2: Manual Installation

R Installation

Mosek Academic Licenses

Verification

Troubleshooting

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Robusttail

Table of contents

Quick start

Experiments

Setup

Scripts

1. exp_tail_probability.py — DRO tail probability experiments

2. exp_quantile_estimation.py — DRO quantile estimation

3. exp_cmt.py — Real data (CMT) vs EVT

4. exp_bound_support.py — Bounded-support (Gen Pareto) experiment

5. exp_benchmark_run.py — EVT benchmark (synthetic + real)

Summary

Installation

Prerequisites

Platform support

Environment Setup

Option 1: Using Conda Environment (Recommended)

Option 2: Manual Installation

R Installation

Mosek Academic Licenses

Verification

Troubleshooting

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

1. `exp_tail_probability.py` — DRO tail probability experiments

2. `exp_quantile_estimation.py` — DRO quantile estimation

3. `exp_cmt.py` — Real data (CMT) vs EVT

4. `exp_bound_support.py` — Bounded-support (Gen Pareto) experiment

5. `exp_benchmark_run.py` — EVT benchmark (synthetic + real)

Packages