Extending PrismBench

Comprehensive guide to extending the PrismBench framework with custom components

PrismBench is designed with extensibility at its core. The framework provides three primary extension points that allow you to customize behavior for your specific evaluation needs:

Extension Overview

Agent Extensions

Create custom LLM agents with specialized prompts, behaviors, and interaction patterns.

Use Cases:

Custom problem generators for domain-specific challenges
Specialized solution validators
Domain-expert evaluators
Multi-modal agents

Learn More →

Environment Extensions

Build custom evaluation environments with specialized workflows and agent orchestration.

Use Cases:

Multi-step problem solving environments
Interactive coding challenges
Domain-specific evaluation workflows
Custom test generation and validation

Learn More →

Phase Extensions

Implement custom MCTS phases with specialized search strategies and evaluation objectives.

Use Cases:

Alternative exploration strategies
Domain-specific scoring functions
Multi-objective optimization phases
Custom convergence criteria

Learn More →

Architecture Overview

PrismBench uses a registry pattern for all extensions, enabling:

Plugin Architecture: Extensions are automatically discovered and loaded
Decorator-Based Registration: Simple decorators register new components
Runtime Resolution: Components are resolved dynamically at execution time
Configuration-Driven: All behavior customizable through YAML files

graph TD
    A[PrismBench Core] --> B[Agent Registry]
    A --> C[Environment Registry] 
    A --> D[Phase Registry]
    
    B --> E[Custom Agents]
    C --> F[Custom Environments]
    D --> G[Custom Phases]
    
    E --> H[Agent Configs]
    F --> I[Environment Configs]
    G --> J[Phase Configs]

Quick Start Guide

1. Choose Your Extension Type

Extension Type	Use Case
Agent	Custom prompts, specialized behaviors
Environment	Custom workflows, agent orchestration
Phase	Search strategies, evaluation objectives

2. Development Workflow

Create your extension using the provided templates
Configure behavior through YAML files
Test your extension with the framework
Deploy by placing files in the appropriate directories

3. Extension Integration

All extensions integrate seamlessly:

# Extensions are automatically discovered
from src.services.llm_interface.src.llm.interface import LLMInterface
from src.services.environment.src.environment.utils import create_environment
from src.services.search.src.mcts.utils import create_phase

# Use your custom components
custom_environment = create_environment("my_custom_environment")
custom_phase = create_phase("my_custom_phase", tree, environment, config)

Extension Combinations

The true power of PrismBench comes from combining extensions:

Combination Strategies →

Best Practices

Development Guidelines

Follow Naming Conventions: Use descriptive, consistent names
Document Thoroughly: Include docstrings and configuration examples

Performance Considerations

Async Operations: Use async/await for IO-bound operations
Resource Management: Properly cleanup resources (sessions, files)

Integration Tips

Configuration Schema: Follow existing YAML patterns
Error Handling: Use framework exception types
State Management: Leverage framework session management

Community Extensions

Contributing Extensions

Follow the Contributing Guide
Test thoroughly with multiple scenarios
Document usage and configuration
Submit pull request with examples

Extension Gallery

Coming soon

Support & Resources

Documentation

Architecture Overview - Framework design
Configuration Guide - YAML configuration
API Reference - Service APIs

Community

Related Pages

Extension Guides

Custom Agents - Creating specialized agents with custom prompts
Custom Environments - Building custom evaluation environments
Custom MCTS Phases - Implementing custom search strategies
Extension Combinations - Advanced extension strategies

System Understanding

Architecture Overview - Framework design and components
Agent System - Multi-agent architecture
Environment System - Evaluation environments
MCTS Algorithm - Monte Carlo Tree Search

Getting Started

Quick Start - Basic setup and first run
Configuration Overview - Configuration system
Troubleshooting - Common issues and solutions

PrismBench Wiki

Getting Started

Core Framework

MCTS System

Agent System

Environment System

Configuration Reference

Main Configuration

Development

Extension

Analysis & Results

Examples & Tutorials

Support

Community

Back to Top

Extending PrismBench

Extending PrismBench

Extension Overview

Agent Extensions

Environment Extensions

Phase Extensions

Architecture Overview

Quick Start Guide

1. Choose Your Extension Type

2. Development Workflow

3. Extension Integration

Extension Combinations

Best Practices

Development Guidelines

Performance Considerations

Integration Tips

Community Extensions

Contributing Extensions

Extension Gallery

Support & Resources

Documentation

Community

Related Pages

Extension Guides

System Understanding

Getting Started

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

PrismBench Wiki

Getting Started

Core Framework

Configuration Reference

Development

Analysis & Results

Examples & Tutorials

Support

Community

Clone this wiki locally