PRISM Configuration Guide

Complete guide to configuring PRISM for optimal performance.

Overview
Environment Variables
Worker Configuration
File Patterns
Performance Tuning
Language-Specific Settings
Advanced Options
Examples

Overview

PRISM can be configured at multiple levels:

Environment Variables - Shell-level configuration
Worker Config (wrangler.toml) - Worker behavior and resources
CLI Options - Per-command configuration
File Patterns - What files to index
Runtime Settings - Performance and optimization

Environment Variables

Set these in your shell or shell profile (.bashrc, .zshrc, etc.)

Required Variables

PRISM_URL

Your deployed Cloudflare Worker URL.

export PRISM_URL=https://claudes-friend.your-username.workers.dev

How to find:

Deploy worker: npm run deploy
Copy URL from deployment output
Or check Cloudflare Dashboard > Workers & Pages

Usage:

# Add to shell profile
echo 'export PRISM_URL=https://your-worker.workers.dev' >> ~/.bashrc
source ~/.bashrc

# Verify
echo $PRISM_URL
prism health

Optional Variables

CLOUDFLARE_ACCOUNT_ID

Your Cloudflare account ID.

export CLOUDFLARE_ACCOUNT_ID=your-account-id

How to find:

wrangler whoami
# Or Cloudflare Dashboard > Workers & Pages > Overview

When needed:

Multiple Cloudflare accounts
CI/CD pipelines
Automated deployments

CLOUDFLARE_API_TOKEN

API token for Cloudflare API access.

export CLOUDFLARE_API_TOKEN=your-api-token

How to create:

Cloudflare Dashboard > My Profile > API Tokens
Click "Create Token"
Use "Edit Cloudflare Workers" template
Copy token

Permissions needed:

Workers: Edit
D1: Edit
Vectorize: Edit
KV: Edit

Security:

# Store securely
echo "export CLOUDFLARE_API_TOKEN=xxx" >> ~/.bashrc.private
source ~/.bashrc.private

# Or use password manager
# Or use CI/CD secrets

NODE_ENV

Node.js environment.

export NODE_ENV=production  # or development, test

Values:

production - Production settings
development - Development settings, more logging
test - Test settings, mock data

LOG_LEVEL

Logging verbosity.

export LOG_LEVEL=info  # debug, info, warn, error

Levels:

debug - Everything (very verbose)
info - Normal operations (default)
warn - Warnings only
error - Errors only

Worker Configuration

Configure worker behavior in wrangler.toml.

Basic Settings

# Worker name (appears in Cloudflare Dashboard)
name = "claudes-friend"

# Main entry point
main = "src/worker-vectorize.ts"

# Compatibility date
compatibility_date = "2024-12-01"

# Node.js compatibility
compatibility_flags = ["nodejs_compat_v2"]

Resource Bindings

D1 Database

[[d1_databases]]
binding = "DB"
database_name = "claudes-friend-db"
database_id = "your-database-id"

Create database:

wrangler d1 create claudes-friend-db
# Copy database_id from output

Run migrations:

wrangler d1 execute claudes-friend-db --file=./migrations/002_vector_index.sql

Vectorize Index

[[vectorize]]
binding = "VECTORIZE"
index_name = "claudes-friend-index"
remote = true  # Always use remote

Create index:

wrangler vectorize create claudes-friend-index \
  --dimensions=384 \
  --metric=cosine

Check status:

wrangler vectorize list
wrangler vectorize get claudes-friend-index

KV Namespace

[[kv_namespaces]]
binding = "KV"
id = "your-kv-namespace-id"

Create namespace:

wrangler kv:namespace create PRISM_INDEX
# Copy id from output

Usage:

Embedding cache
Search history
Query suggestions

R2 Bucket

[[r2_buckets]]
binding = "R2"
bucket_name = "claudes-friend-storage"

Create bucket:

wrangler r2 bucket create claudes-friend-storage

Usage:

Store raw files (optional)
Backup index data
Large file storage

Workers AI

[ai]
binding = "AI"
remote = true  # Always use remote

No setup needed - automatically available with Cloudflare account.

Usage:

Generate embeddings
Text analysis
Semantic search

Environment Variables (Worker)

[vars]
# Environment name
ENVIRONMENT = "production"

# Logging level
LOG_LEVEL = "info"

# AI model for embeddings
EMBEDDING_MODEL = "@cf/baai/bge-small-en-v1.5"

# Default limits
MAX_TOKENS = "2048"
TEMPERATURE = "0.7"

# Free tier limits (for monitoring)
MAX_NEURONS_PER_DAY = "10000"
MAX_REQUESTS_PER_DAY = "100000"

Available embedding models:

@cf/baai/bge-small-en-v1.5 - 384d, fast, good quality (recommended)
@cf/baai/bge-base-en-v1.5 - 768d, slower, better quality
@cf/baai/bge-large-en-v1.5 - 1024d, slowest, best quality

Environment-Specific Config

Development

[env.development]
name = "claudes-friend-dev"

[env.development.vars]
ENVIRONMENT = "development"
LOG_LEVEL = "debug"

Usage:

wrangler deploy --env development

Testing

[env.test]
name = "claudes-friend-test"

[env.test.vars]
ENVIRONMENT = "test"
LOG_LEVEL = "error"

Usage:

wrangler deploy --env test
npm test

Production

# Default environment (no [env.xxx] prefix)
name = "claudes-friend"

[vars]
ENVIRONMENT = "production"
LOG_LEVEL = "info"

Usage:

npm run deploy
# Or: wrangler deploy

File Patterns

Included Extensions

Default file extensions that are indexed:

const EXTENSIONS = [
  '.ts',    // TypeScript
  '.tsx',   // TypeScript + JSX
  '.js',    // JavaScript
  '.jsx',   // JavaScript + JSX
  '.py',    // Python
  '.rs',    // Rust
  '.go',    // Go
  '.java',  // Java
  '.c',     // C
  '.cpp',   // C++
  '.h',     // C/C++ headers
  '.cs',    // C#
  '.php',   // PHP
  '.rb',    // Ruby
  '.kt',    // Kotlin
  '.swift', // Swift
  '.sh',    // Shell
  '.bash',  // Bash
  '.zsh',   // Zsh
  '.yaml',  // YAML
  '.yml',   // YAML
  '.json',  // JSON
  '.md',    // Markdown
];

Customize:

# Index only TypeScript
prism index ./src --extensions .ts,.tsx

# Add custom extensions
prism index ./src --extensions .ts,.tsx,.vue,.svelte

Excluded Patterns

Default patterns that are skipped:

const SKIP_PATTERNS = [
  // Test files
  '.test.',
  '.spec.',
  '.mock.',

  // Dependencies
  'node_modules',
  'vendor',
  'bower_components',

  // Build outputs
  'dist',
  'build',
  '.next',
  'out',
  'coverage',

  // Version control
  '.git',
  '.svn',
  '.hg',

  // IDE
  '.vscode',
  '.idea',
  '.DS_Store',
];

Customize:

# Exclude specific patterns
prism index ./src --exclude test,dist,node_modules

# Exclude with glob patterns
prism index ./src --exclude "**/*.test.ts" --exclude "**/*.spec.ts"

.prismignore File

Create a .prismignore file in your project root:

# .prismignore
# Similar to .gitignore

# Dependencies
node_modules/
vendor/

# Build outputs
dist/
build/
*.min.js
*.bundle.js

# Tests
**/*.test.ts
**/*.spec.ts
**/*.mock.ts
coverage/

# Docs
docs/
*.md

# Config
*.config.js
*.config.ts

# Large files
data/
*.csv
*.json

Usage:

# PRISM automatically reads .prismignore
prism index ./src

Performance Tuning

Embedding Cache

Cache embeddings to avoid regenerating for unchanged content.

Configuration:

[[kv_namespaces]]
binding = "KV"
id = "your-kv-namespace-id"

Cache behavior:

Automatic caching based on content hash
TTL: 30 days
Hit rate: 70-80% typical

Clear cache:

# List keys
wrangler kv:key list --binding=KV --namespace-id=your-kv-id

# Delete specific key
wrangler kv:key delete "embedding:hash" --namespace-id=your-kv-id

# Bulk delete (clear cache)
wrangler kv:key list --namespace-id=your-kv-id | \
  jq -r '.[].name' | \
  xargs -I {} wrangler kv:key delete {} --namespace-id=your-kv-id

Incremental Indexing

Skip unchanged files using SHA-256 checksums.

Enable:

prism index ./src --incremental

Performance:

First index: ~8s for 67 files
Incremental: ~0.4s (21x faster)
Only indexes changed files

How it works:

Calculate SHA-256 for each file
Check if checksum exists in database
Skip if unchanged
Index only new/changed files

Batch Size

Control how many files are processed at once.

Default: 10 files per batch

Adjust:

// In prism-cli.js
const BATCH_SIZE = 20;  // Process 20 files at once

Trade-offs:

Larger batches: Faster but more memory
Smaller batches: Slower but less memory

Vectorize Settings

[[vectorize]]
binding = "VECTORIZE"
index_name = "claudes-friend-index"

# Dimensions (must match embedding model)
# bge-small-en-v1.5 = 384
# bge-base-en-v1.5 = 768
# bge-large-en-v1.5 = 1024

# Metric (similarity calculation)
# cosine = cosine similarity (recommended)
# euclidean = euclidean distance
# dot = dot product

Create with options:

wrangler vectorize create my-index \
  --dimensions=384 \
  --metric=cosine

Search Optimization

Limit Results

# Get fewer results for faster response
prism search "auth" --limit 5

# Default is 10, max is 100
prism search "auth" --limit 100

Set Score Threshold

# Only return high-confidence matches
prism search "auth" --min-score 0.7

# Lower threshold for more results
prism search "auth" --min-score 0.5

Use Filters

# Reduce search space
prism search "database" --lang typescript --path src/db/

Language-Specific Settings

TypeScript

# Index TypeScript files
prism index ./src --extensions .ts,.tsx

# Exclude type definitions
prism index ./src --exclude "**/*.d.ts"

# Include tests
prism index ./src  # Tests excluded by default

Recommended:

Include: .ts, .tsx
Exclude: .d.ts (type definitions), .test.ts, .spec.ts

JavaScript

# Index JavaScript files
prism index ./src --extensions .js,.jsx

# Exclude minified
prism index ./src --exclude "**/*.min.js"

Recommended:

Include: .js, .jsx
Exclude: .min.js, .bundle.js, dist/

Python

# Index Python files
prism index ./src --extensions .py

# Exclude virtual env
prism index ./src --exclude venv,__pycache__

Recommended:

Include: .py
Exclude: venv/, __pycache__/, .pyc

Rust

# Index Rust files
prism index ./src --extensions .rs

# Include Cargo.toml
prism index ./src --extensions .rs,.toml

Recommended:

Include: .rs
Exclude: target/

Go

# Index Go files
prism index ./src --extensions .go

# Exclude vendor
prism index ./src --exclude vendor

Recommended:

Include: .go
Exclude: vendor/

Multi-Language Projects

# Index multiple languages
prism index ./src --extensions .ts,.py,.go,.rs

# Exclude language-specific files
prism index ./src \
  --exclude node_modules \
  --exclude venv \
  --exclude target \
  --exclude vendor

Advanced Options

Custom Embedding Model

Change the embedding model for different quality/speed trade-offs.

Edit wrangler.toml:

[vars]
# Fast, good quality (default)
EMBEDDING_MODEL = "@cf/baai/bge-small-en-v1.5"

# Better quality, slower
# EMBEDDING_MODEL = "@cf/baai/bge-base-en-v1.5"

# Best quality, slowest
# EMBEDDING_MODEL = "@cf/baai/bge-large-en-v1.5"

Note: Changing models requires:

Updating Vectorize dimensions
Reindexing all files
More quota usage

Custom Chunking

Adjust how code is split into chunks.

Default: Tree-sitter-based chunking (50 lines per chunk)

Customize (in code):

// src/shared/utils.ts
export const CONFIG = {
  CHUNK_SIZE: 50,        // Lines per chunk
  CHUNK_OVERLAP: 5,      // Overlap between chunks
  MAX_CHUNK_SIZE: 100,   // Maximum lines
};

Trade-offs:

Smaller chunks: More granular, more storage
Larger chunks: More context, less granular

CORS Configuration

Control which origins can access your worker.

Edit src/worker-vectorize.ts:

const ALLOWED_ORIGINS = [
  'http://localhost:3000',
  'http://localhost:8080',
  'https://yourdomain.com',
  'https://app.yourdomain.com',
];

Wildcard (not recommended):

const ALLOWED_ORIGINS = ['*'];

Rate Limiting

Implement custom rate limiting (not built-in).

Example:

// In worker
const rateLimit = async (env: Env, identifier: string) => {
  const key = `ratelimit:${identifier}`;
  const count = await env.KV.get(key);

  if (count && parseInt(count) > 100) {
    throw new Error('Rate limit exceeded');
  }

  await env.KV.put(key, String((parseInt(count || '0') + 1)), {
    expirationTtl: 60  // 1 minute
  });
};

Custom Logging

Structured logging with log levels.

Configuration:

[vars]
LOG_LEVEL = "info"  # debug, info, warn, error

Usage in worker:

import { createLogger } from './shared/utils.js';

const logger = createLogger('MyComponent');

logger.debug('Debug message', { data });
logger.info('Info message');
logger.warn('Warning message');
logger.error('Error message', { error });

Observability

Enable analytics and monitoring.

Edit wrangler.toml:

[observability]
enabled = true

# Log forwarding (optional)
# logpush = true

View analytics:

Cloudflare Dashboard > Workers & Pages > Analytics

Metrics:

Requests per second
Errors
CPU time
Success rate

Examples

Example 1: TypeScript Monorepo

# .prismignore
node_modules/
dist/
coverage/
**/*.test.ts
**/*.spec.ts
*.d.ts

# Index
prism index ./packages --extensions .ts,.tsx --incremental

# Search
prism search "API endpoints" --lang typescript --path packages/api/

Example 2: Python Data Science Project

# .prismignore
venv/
__pycache__/
*.pyc
.ipynb_checkpoints/
data/
*.csv
*.parquet

# Index
prism index ./src --extensions .py --exclude notebooks

# Search
prism search "data preprocessing" --lang python

Example 3: Full-Stack App

# .prismignore
node_modules/
dist/
build/
.next/
coverage/
**/*.test.*
**/*.spec.*

# Index frontend
prism index ./frontend --extensions .ts,.tsx --incremental

# Index backend
prism index ./backend --extensions .py,.ts --incremental

# Search
prism search "authentication" --lang typescript,python

Example 4: Microservices

# Index each service
prism index ./services/auth --path services/auth/
prism index ./services/api --path services/api/
prism index ./services/database --path services/database/

# Search specific service
prism search "user validation" --path services/auth/

# Search all services
prism search "logging"

Example 5: CI/CD Pipeline

# .github/workflows/index.yml
name: Index Codebase

on:
  push:
    branches: [main]

jobs:
  index:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v2

      - name: Setup Node
        uses: actions/setup-node@v2
        with:
          node-version: '18'

      - name: Install PRISM
        run: npm install -g @claudes-friend/prism

      - name: Index codebase
        env:
          PRISM_URL: ${{ secrets.PRISM_URL }}
        run: prism index ./src --incremental

Configuration Checklist

Initial Setup

Worker Setup

Optimization

Testing

Best Practices

Use incremental indexing for faster updates
Exclude test files unless specifically needed
Set appropriate score thresholds (0.7+ recommended)
Monitor quota usage in Cloudflare Dashboard
Cache search results client-side when possible
Use filters to narrow search scope
Version your configuration in git
Document custom settings for your team
Test configuration changes in development first
Monitor performance and adjust as needed

Troubleshooting Configuration

Check Current Configuration

# Environment variables
env | grep PRISM

# Worker configuration
cat wrangler.toml

# Test configuration
prism health
prism stats

Validate Configuration

# Validate wrangler.toml
wrangler deploy --dry-run

# Test bindings
wrangler d1 list
wrangler vectorize list
wrangler kv:namespace list

Reset Configuration

# Clear environment
unset PRISM_URL

# Reset worker config
git checkout wrangler.toml

# Redeploy
npm run deploy

Next Steps

User Guide - USER_GUIDE.md
API Reference - API_REFERENCE.md
Troubleshooting - TROUBLESHOOTING.md
Architecture - docs/architecture/

Last Updated: 2026-01-15 Version: 0.3.2

Need help? Check Troubleshooting or open an issue.

FilesExpand file tree

CONFIGURATION.md

Latest commit

History

CONFIGURATION.md

File metadata and controls

PRISM Configuration Guide

Table of Contents

Overview

Environment Variables

Required Variables

PRISM_URL

Optional Variables

CLOUDFLARE_ACCOUNT_ID

CLOUDFLARE_API_TOKEN

NODE_ENV

LOG_LEVEL

Worker Configuration

Basic Settings

Resource Bindings

D1 Database

Vectorize Index

KV Namespace

R2 Bucket

Workers AI

Environment Variables (Worker)

Environment-Specific Config

Development

Testing

Production

File Patterns

Included Extensions

Excluded Patterns

.prismignore File

Performance Tuning

Embedding Cache

Incremental Indexing

Batch Size

Vectorize Settings

Search Optimization

Limit Results

Set Score Threshold

Use Filters

Language-Specific Settings

TypeScript

JavaScript

Python

Rust

Go

Multi-Language Projects

Advanced Options

Custom Embedding Model

Custom Chunking

CORS Configuration

Rate Limiting

Custom Logging

Observability

Examples

Example 1: TypeScript Monorepo

Example 2: Python Data Science Project

Example 3: Full-Stack App

Example 4: Microservices

Example 5: CI/CD Pipeline

Configuration Checklist

Initial Setup

Worker Setup

Optimization

Testing

Best Practices

Troubleshooting Configuration

Check Current Configuration

Validate Configuration

Reset Configuration

Next Steps