Pipeline Commands Reference

This document shows the actual commands end users run to profile their models. This is the workflow you follow after setting up the environment.

Workflow Overview

The profiling pipeline is model-agnostic. Once you have a .pte file, the same commands work for any model.

Setup → Build Runners → Export Model → Run Pipeline → Analyze Results

Step-by-Step Commands

1. Setup Environment (One-time)

# From executorch_sme2_kit/ directory
bash model_profiling/scripts/setup_repo.sh

This creates:

.venv/ - Python virtual environment
executorch/ - ExecuTorch checkout

2. Build Runners (One-time, or when CMake configs change)

# From executorch_sme2_kit/ directory
bash model_profiling/scripts/build_runners.sh

This builds:

executorch/cmake-out/mac-arm64/executor_runner (SME2 ON)
executorch/cmake-out/mac-arm64-sme2-off/executor_runner (SME2 OFF)
Android runners (if ANDROID_NDK is set)

3. Export Your Model (Per-model)

# From executorch_sme2_kit/ directory
# Activate venv first
source .venv/bin/activate

# Export model
python model_profiling/export/export_model.py \
  --model <model_name> \
  --dtype fp16 \
  --outdir out_<model>/artifacts/

# This creates:
# - out_<model>/artifacts/<model>_xnnpack_fp16.pte
# - out_<model>/artifacts/<model>_xnnpack_fp16.pte.etrecord

Model-specific notes:

For models in model_profiling/models/, use the registered name
For custom models, you may need to add model registration code first
See learning path documentation for model onboarding details

4. Create Pipeline Config (Per-experiment)

# Copy template
cp model_profiling/configs/templates/mac_template.json \
   model_profiling/configs/my_experiment.json

# Edit the config:
# - Set "model" to your .pte path (e.g., "out_<model>/artifacts/<model>_xnnpack_fp16.pte")
# - Adjust "experiments" array (SME2 on/off, threads, runs, etc.)
# - Set "output_root" (e.g., "out_<model>/runs/mac")

5. Run Profiling Pipeline (Per-experiment)

macOS:

# From executorch_sme2_kit/ directory
source .venv/bin/activate

python3 model_profiling/scripts/mac_pipeline.py \
  --config model_profiling/configs/my_experiment.json

Android:

# Ensure device is connected and ANDROID_NDK is set
python3 model_profiling/scripts/android_pipeline.py \
  --config model_profiling/configs/android_experiment.json

With options:

# Run only specific experiments
python3 model_profiling/scripts/mac_pipeline.py \
  --config model_profiling/configs/my_experiment.json \
  --only mac_sme2_on mac_sme2_off

# Re-run analysis only (skip profiling execution)
python3 model_profiling/scripts/mac_pipeline.py \
  --config model_profiling/configs/my_experiment.json \
  --analysis-only

# Verbose output
python3 model_profiling/scripts/mac_pipeline.py \
  --config model_profiling/configs/my_experiment.json \
  --verbose

# Android with remote device
python3 model_profiling/scripts/android_pipeline.py \
  --config model_profiling/configs/android_experiment.json \
  --remote-device 192.168.1.100:5555

Note: The pipeline automatically runs analysis after profiling, generating CSV files from ETDump. No separate analyze_results.py step needed.

This creates:

out_<model>/runs/<platform>/ - Run output directory
out_<model>/runs/<platform>/manifest.json - Run metadata
out_<model>/runs/<platform>/metrics.json - Timing metrics
out_<model>/runs/<platform>/<model_stem>_pipeline_summary.json - Pipeline summary
out_<model>/runs/<platform>/<experiment_name>/ - Per-experiment results
- *.etdump - ETDump trace files
- *_exec_all_runs_timeline.csv - Timeline CSV (all runs)
- *_exec_run0_timeline.csv - Timeline CSV (run 0)
- *_exec_ops_stats.csv - Operator statistics CSV
- *.log - Runner logs

6. Analyze Results (Optional - Automatic by Default)

Note: The pipeline automatically runs analysis after profiling. You only need to run this manually if:

You want to re-analyze existing ETDump files
Analysis failed during pipeline execution

# From executorch_sme2_kit/ directory
source .venv/bin/activate

python3 model_profiling/scripts/analyze_results.py \
  --run-dir out_<model>/runs/mac

This creates:

out_<model>/runs/mac/analysis_summary.json - Operator-level breakdown
CSV files in same directory as ETDump files (if not already generated)

7. Validate Results (Optional)

python model_profiling/scripts/validate_results.py \
  --results out_<model>/runs/mac

Quick Test (End-to-End Validation)

After setup and build, validate everything works:

# From executorch_sme2_kit/ directory
source .venv/bin/activate

python model_profiling/scripts/run_quick_test.py

This runs: validate → build → export toy model → pipeline (with automatic analysis) → validate

Directory Structure After Running

executorch_sme2_kit/
├── .venv/                          # Python environment
├── executorch/                     # ExecuTorch checkout
│   └── cmake-out/
│       ├── mac-arm64/executor_runner
│       └── mac-arm64-sme2-off/executor_runner
├── model_profiling/
│   ├── scripts/                    # Pipeline scripts
│   ├── export/                     # Export script
│   ├── configs/                    # Experiment configs
│   └── models/                     # Model registry
├── out_<model>/                    # Per-model outputs
│   ├── artifacts/                  # Exported .pte files
│   └── runs/                       # Profiling results
│       └── mac/
│           ├── manifest.json
│           ├── metrics.json
│           ├── <model_stem>_pipeline_summary.json
│           ├── <model_stem>_pipeline_summary.md
│           ├── analysis_summary.json (generated automatically by pipeline)
│           ├── <experiment_name>/              # Per-experiment directory
│           │   ├── <model_stem>_<experiment>_t<threads>.etdump
│           │   ├── <model_stem>_<experiment>_t<threads>_exec_all_runs_timeline.csv
│           │   ├── <model_stem>_<experiment>_t<threads>_exec_run0_timeline.csv
│           │   ├── <model_stem>_<experiment>_t<threads>_exec_ops_stats.csv
│           │   └── <model_stem>_<experiment>_t<threads>_latency.log
└── models/                         # Legacy location (if used)

Key Principles

Model-agnostic pipeline: Once you have a .pte, the same pipeline commands work
Config-driven experiments: JSON configs define what to run, scripts execute them
Output organization: Results go under out_<model>/runs/<platform>/ for clear organization
Version traceability: Runners stay in executorch/cmake-out/ to track ExecuTorch version

Common Workflows

Compare SME2 On vs Off

Export model once
Create config with two experiments (SME2 on, SME2 off)
Run pipeline
Analyze results to see operator-level differences

Profile Multiple Thread Counts

Export model once
Create config with "threads": [1, 2, 4] in experiments
Run pipeline
Compare metrics.json across thread counts

Trace-Enabled Runs (Kernel Analysis)

Build trace-enabled runners (separate CMake preset)
Create config pointing to trace-enabled runners
Run pipeline (note: trace logging impacts timing)
Analyze ETDump for kernel selection insights

Troubleshooting

"executor_runner not found": Run build_runners.sh first
"Model not found": Check .pte path in config JSON
"No .etdump files": Check runner logs in experiment directory
"Analysis failed": Ensure .etrecord file exists (re-export if needed)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pipeline Commands Reference

Workflow Overview

Step-by-Step Commands

1. Setup Environment (One-time)

2. Build Runners (One-time, or when CMake configs change)

3. Export Your Model (Per-model)

4. Create Pipeline Config (Per-experiment)

5. Run Profiling Pipeline (Per-experiment)

6. Analyze Results (Optional - Automatic by Default)

7. Validate Results (Optional)

Quick Test (End-to-End Validation)

Directory Structure After Running

Key Principles

Common Workflows

Compare SME2 On vs Off

Profile Multiple Thread Counts

Trace-Enabled Runs (Kernel Analysis)

Troubleshooting

FilesExpand file tree

pipeline_commands.md

Latest commit

History

pipeline_commands.md

File metadata and controls

Pipeline Commands Reference

Workflow Overview

Step-by-Step Commands

1. Setup Environment (One-time)

2. Build Runners (One-time, or when CMake configs change)

3. Export Your Model (Per-model)

4. Create Pipeline Config (Per-experiment)

5. Run Profiling Pipeline (Per-experiment)

6. Analyze Results (Optional - Automatic by Default)

7. Validate Results (Optional)

Quick Test (End-to-End Validation)

Directory Structure After Running

Key Principles

Common Workflows

Compare SME2 On vs Off

Profile Multiple Thread Counts

Trace-Enabled Runs (Kernel Analysis)

Troubleshooting