Skip to content
#

confidence-calibration

Here are 20 public repositories matching this topic...

[ICCV 2025 CVAMD] The official implementation of the paper "Prompt4Trust: A Reinforcement Learning Prompt Augmentation Framework for Clinically-Aligned Confidence Calibration in Multimodal Large Language Models".

  • Updated Dec 11, 2025
  • Python

Here’s a complete Streamlit app scaffold that lets you: Enter your Gemini API key in the sidebar Upload up to four MRI images Invoke Gemini’s advanced image‐analysis (labels, objects, text) View the raw JSON analytics directly in the app

  • Updated Jul 15, 2025
  • Python

yuragi — LLM Confidence Fragility Analyzer. Perturbation-driven hallucination detection with workshop-grade real benchmarks (TruthfulQA n=412 ensemble AUC 0.73, TriviaQA n=200 confidence-inversion AUC 0.75).

  • Updated Apr 20, 2026
  • Python

🔍 Analyze the mathematical reasoning abilities of the Mistral-7B model using diverse prompting techniques on multi-step math problems.

  • Updated Apr 24, 2026
  • HTML

Improve this page

Add a description, image, and links to the confidence-calibration topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the confidence-calibration topic, visit your repo's landing page and select "manage topics."

Learn more