These commands assume:
- Python env:
source /path/to/StructTokenBench/bin/activate - Python:
python - H5 embeddings:
./inference_embed_results/.../embeddings - Data roots:
- Functional:
./pst_evaluation/struct_token_bench_release_data/data/functional/local - Structural:
./pst_evaluation/struct_token_bench_release_data/data/structural - Physicochemical (ATLAS):
./pst_evaluation/struct_token_bench_release_data/data/physicochemical - Sensitivity (Apo/Holo):
./pst_evaluation/struct_token_bench_release_data/data/sensitivity
- Functional:
Adjust paths if your local layout is different.
Optional shorthand:
export EMB_DIR=./inference_embed_results/.../embeddings- StructTokenBench evaluation data (labels/splits used by eval scripts via
--data-root): download and extract into./pst_evaluation/struct_token_bench_release_data/ - PST raw CIF/H5 structures (used to generate embeddings): download and extract into, then run
inference_embed.py(repo root) to produce H5 embeddings for the eval scripts (--h5) - For PST evaluation, embeddings should exclude missing residues (no
-1rows). This matches per-residue labels, and some tasks (e.g., remote homology) do not filter-1rows.
python pst_evaluation/biolip2_binding_eval.py \
--h5 "$EMB_DIR/vq_embed_biolip2_binding_lite_model.h5" \
--data-root ./pst_evaluation/struct_token_bench_release_data/data/functional/localpython pst_evaluation/biolip2_catalytic_eval.py \
--h5 "$EMB_DIR/vq_embed_biolip2_catalytic_lite.h5" \
--data-root ./pst_evaluation/struct_token_bench_release_data/data/functional/localpython pst_evaluation/proteinshake_binding_eval.py \
--h5 "$EMB_DIR/vq_embed_proteinshake_lite.h5" \
--data-root ./pst_evaluation/struct_token_bench_release_data/data/functional/localpython pst_evaluation/interpro_binding_eval.py \
--h5 "$EMB_DIR/vq_embed_interpro_binding_lite.h5" \
--data-root ./pst_evaluation/struct_token_bench_release_data/data/functional/localpython pst_evaluation/interpro_activesite_eval.py \
--h5 "$EMB_DIR/vq_embed_interpro_activesite_lite.h5" \
--data-root ./pst_evaluation/struct_token_bench_release_data/data/functional/localpython pst_evaluation/interpro_conserved_eval.py \
--h5 "$EMB_DIR/vq_embed_conserved_lite.h5" \
--data-root ./pst_evaluation/struct_token_bench_release_data/data/functional/localpython pst_evaluation/interpro_repeats_eval.py \
--h5 "$EMB_DIR/vq_embed_interpro_repeats.h5" \
--data-root ./pst_evaluation/struct_token_bench_release_data/data/functional/localpython pst_evaluation/proteinglue_epitope_region_eval.py \
--h5 "$EMB_DIR/vq_embed_proteinglue_lite.h5" \
--data-root ./pst_evaluation/struct_token_bench_release_data/data/functional/localpython pst_evaluation/atlas_rmsf_eval.py \
--h5 "$EMB_DIR/vq_embed_atlas_lite.h5" \
--data-root ./pst_evaluation/struct_token_bench_release_data/data/physicochemicalpython pst_evaluation/atlas_bfactor_eval.py \
--h5 "$EMB_DIR/vq_embed_atlas_lite.h5" \
--data-root ./pst_evaluation/struct_token_bench_release_data/data/physicochemicalpython pst_evaluation/atlas_neq_eval.py \
--h5 "$EMB_DIR/vq_embed_atlas_lite.h5" \
--data-root ./pst_evaluation/struct_token_bench_release_data/data/physicochemicalpython pst_evaluation/remote_homology_eval.py \
--h5 "$EMB_DIR/vq_embed_remote_homology_train_tst_valid.h5" \
--data-root ./pst_evaluation/struct_token_bench_release_data/data/structural \
--progresspython pst_evaluation/apolo_unsupervised_eval.py \
--h5 "$EMB_DIR/vq_embed_apolo_lite.h5" \
--data-root ./pst_evaluation/struct_token_bench_release_data/data/sensitivity \
--target-field tm_score