Improving oracle by arifimtiaz012 · Pull Request #18 · igreat/agentic-bo

arifimtiaz012 · 2026-03-17T23:18:31Z

Summary

Added new models HGBT and GP.
Refactored model selection criteria from RMSE toward Spearman ranking quality, and added a benchmarking workflow strictly for oracle performance observation (per chosen dataset).

Key Changes

Model Pool

Added:
- HistGradientBoostingRegressor (HGBT)
- GaussianProcessRegressor (GP) with ConstantKernel * RBF + WhiteKernel
All models (including Extra Trees) now use n_estimators=200
Model pool configurable via model_candidates

Model Selection (Ranking-first)

Selection priority:
1. Spearman (top-K%)
2. Spearman (full dataset)
3. RMSE (tiebreaker)
Metrics (via cross_val_predict):
- rmse
- spearman_all
- spearman_top_k
Added top_k_pct (default 3%)

Categorical Handling

Strategy change for missing categorical values: most_frequent → constant("no_value")
Preserves missingness as explicit "no_value" category
Avoids ambiguity on CSV reload

Oracle Metadata

Now stores:
- Per-model metrics (rmse, Spearmans)
- Selected model scores
- top_k_pct, objective, seed, backend_id
Replaces RMSE-only tracking

Utilities

Added _safe_spearman
- Handles small samples (<3)
- Guards against NaNs

CLI Updates

build-oracle:
- --top-k-pct
- --model-candidates

Benchmarking (New)

New script: oracle_benchmark.py

Design

Seed with true targets, then iterate using oracle predictions
Compare all models vs random baseline

Features

Stratified seed sampling (by target quantiles)
Mixed-feature nearest-neighbour lookup
Metrics:
- Convergence (best value per iteration)
- Top-K% recovery rate
- First-hit iteration

Output

Multi-panel PDF (per model + random baseline)
See above image as an example

Note

Code defaults to trying all four implemented models. For development purposes, you may want to run using only ExtraTrees and RandomForest as before if runs feel too slow

arifimtiaz012 added 5 commits March 14, 2026 03:32

Added some benchmark checks that give more info than RMSE alone

6db15f1

Uncertainty/confidence factor considered for oracle points

b941a77

GBT and GP models added. Oracle benchmark improved.

e84ee9d

removing unnecessary comparison code

76c4165

refactor to match main branch

22da884

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improving oracle#18

Improving oracle#18
arifimtiaz012 wants to merge 5 commits into
mainfrom
improve_oracle

arifimtiaz012 commented Mar 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

arifimtiaz012 commented Mar 17, 2026

Summary

Key Changes

Model Pool

Model Selection (Ranking-first)

Categorical Handling

Oracle Metadata

Utilities

CLI Updates

Benchmarking (New)

Note

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant