Skip to content
@ASTRAL-Group

ASTRAL Group @ UIUC

ASsured and TRustworthy AI research Lab @ University of Illinois Urbana-Champaign (UIUC), led by Prof. Huan Zhang

Popular repositories Loading

  1. AlphaOne AlphaOne Public

    [EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

    Python 88 5

  2. ASTRA ASTRA Public

    [CVPR 2025] Official implementation for "Steering Away from Harm: An Adaptive Approach to Defending Vision Language Model Against Jailbreaks"

    Python 58 5

  3. data-efficient-llm-rl data-efficient-llm-rl Public

    Python 38 1

  4. LoRe LoRe Public

    When Reasoning Meets Its Laws

    Python 36 3

  5. SVIP SVIP Public

    SVIP: Towards Verifiable Inference of Open-Source Large Language Models

    Python 15 1

  6. BDC-mitigation-assessment BDC-mitigation-assessment Public

    [ICML 2025] Official implementation for "The Emperor's New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Contamination"

    Python 15

Repositories

Showing 10 of 13 repositories
  • ASTRAL-Group/ASTRAL-Group.github.io’s past year of commit activity
    0 0 0 0 Updated Apr 4, 2026
  • ASTRAL-Group/WebAgent_Visual_Attribution’s past year of commit activity
    HTML 6 0 0 0 Updated Apr 3, 2026
  • MonitorBench Public

    Official implementation for "MonitorBench: A Comprehensive Benchmark for Chain-of-Thought Monitorability in Large Language Models" (Under Constructuion)

    ASTRAL-Group/MonitorBench’s past year of commit activity
    9 0 1 0 Updated Mar 31, 2026
  • ReCAP-Agent Public

    ReCAP-Agent is an open stack for generating, evaluating, and training CAPTCHA-capable GUI agents.

    ASTRAL-Group/ReCAP-Agent’s past year of commit activity
    Python 9 MIT 0 0 0 Updated Mar 26, 2026
  • ASTRAL-Group/data-efficient-llm-rl’s past year of commit activity
    Python 38 1 0 0 Updated Jan 16, 2026
  • LoRe Public

    When Reasoning Meets Its Laws

    ASTRAL-Group/LoRe’s past year of commit activity
    Python 36 3 1 0 Updated Jan 1, 2026
  • LRM_Conta_Detection_Arena Public

    [ICLR 2026] Official implementation for "On the Fragility of Benchmark Contamination Detection in Reasoning Models"

    ASTRAL-Group/LRM_Conta_Detection_Arena’s past year of commit activity
    Jupyter Notebook 11 0 0 0 Updated Oct 9, 2025
  • ZO_Fine_tuner Public
    ASTRAL-Group/ZO_Fine_tuner’s past year of commit activity
    Python 2 0 0 0 Updated Oct 7, 2025
  • DecepChain Public

    Official implementation for "DecepChain: Inducing Deceptive Reasoning in Large Language Models"

    ASTRAL-Group/DecepChain’s past year of commit activity
    Python 4 Apache-2.0 0 0 0 Updated Oct 5, 2025
  • ASTRA Public

    [CVPR 2025] Official implementation for "Steering Away from Harm: An Adaptive Approach to Defending Vision Language Model Against Jailbreaks"

    ASTRAL-Group/ASTRA’s past year of commit activity
    Python 58 5 1 0 Updated Jul 4, 2025

Most used topics

Loading…