Releases: IBM/text2sql-eval-toolkit
Releases · IBM/text2sql-eval-toolkit
v1.0.0
Pip-installable text2sql-eval-toolkit library (v1.0.0)
Added
- Pip-installable text2sql-eval-toolkit library with packaged benchmark metadata.
- Curated top-level Python API for evaluation (evaluate_prediction, evaluate_predictions, run_evaluation).
- Execution orchestration helper (run_execution) and benchmark discovery utilities (get_available_benchmarks, get_benchmarks_info, get_benchmark_info).
- Public inference pipelines (LLMSQLGenerationPipeline, AgenticSQLGenerationPipeline) for reproducing baseline and agentic experiments.
- Re-exported low-level SQL comparison and parsing helpers (compare_result_dfs, sql_exact_match, etc.) from unitxt.text2sql_utils.
- Library-focused README examples showing record-level, file-level, and benchmark-level usage.