Codescope - Project Context

Project Overview

Codescope is a code structure analysis tool written in Rust, using Tree-sitter for AST parsing. The project follows a phased development approach with v0.1 focusing on CLI-only functionality.

Current Version: v0.1 License: MIT Design Doc: docs/design-v0.1.md

Architecture

Crate Structure

/crates
  /codescope-core        # Core engine - NO HTTP/UI deps
  /codescope-adapters    # Language adapters (TypeScript/JS/TSX)
  /codescope-cli         # CLI interface (view/scan commands)

Key Principle: codescope-core has ZERO dependency on CLI/HTTP/Frontend. This ensures the core can be reused in v0.2 (MCP server) and v0.3 (Web server) without modification.

Multi-Dimensional Quality Metrics System

Design Philosophy: Instead of a single complexity score, Codescope provides independent, measurable quality metrics similar to ESLint rules. Each metric represents a distinct code quality dimension.

Core Types:

QualityMetric: Individual metric with name, value, threshold, severity, and optional message
Severity: Info | Warning | Error
ModuleIR: Contains metrics: Vec<QualityMetric> instead of single complexity field
Symbol: Contains metrics: Vec<QualityMetric> for symbol-level analysis, plus structural attributes (loc, cyclomatic_complexity)

Symbol Structure Attributes:

loc: u32 - Lines of code (measured)
cyclomatic_complexity: Option<u32> - Cyclomatic complexity (measured for functions only)
metrics: Vec<QualityMetric> - Quality checks based on the above attributes

Metric Categories:

Size Metrics:
- file_loc: Code lines (excluding comments/blanks)
- comment_lines: Comment line count
- blank_lines: Empty line count
Structure Metrics:
- function_count: Number of functions
- class_count: Number of classes
- type_definition_count: Types/interfaces/enums
- large_function_count: Functions exceeding threshold
Coupling Metrics:
- fan_out: Number of dependencies
- fan_in: Number of dependents
- import_count: Import statement count
Complexity Metrics:
- cyclomatic_complexity: Measures function complexity based on decision points
- high_complexity_count: Number of functions exceeding complexity threshold

Rule System

Architecture:

pub trait QualityRule: Send + Sync {
    fn name(&self) -> &str;
    fn check_module(&self, module: &ModuleIR) -> Vec<QualityMetric>;
    fn check_symbol(&self, symbol: &Symbol) -> Vec<QualityMetric>;
}

Built-in Rules:

FileSizeRule - File size checks (default: 300 LOC)
FunctionSizeRule - Function size checks (default: 40 LOC)
CouplingRule - Fan-out and import checks (default: 7 deps, 15 imports)
StructureStatsRule - Symbol count checks (default: 20 functions, 30 types per file)
ComplexityRule - Cyclomatic complexity checks (default: 10)

Configuration via .codescope.toml:

[rules]
max_file_loc = 300
max_function_loc = 40
max_functions_per_file = 20
max_types_per_file = 30
max_fan_out = 7
max_imports = 15
max_complexity = 10

[rules.severity]
max_file_loc = "Warning"
max_function_loc = "Warning"
max_fan_out = "Warning"
max_complexity = "Warning"

Configuration is loaded from .codescope.toml in the analyzed file's parent directory, falling back to defaults if not found.

Development Guidelines

Code Style

Functional programming style preferred
Files < 200 lines, functions < 40 lines
No unnecessary try/catch - let errors propagate naturally
Type over interface (TypeScript convention)
Named exports over default exports

Testing

Core logic MUST have unit tests
Test files: src/module_name_test.rs with #[cfg(test)] and #[path = "..."] pattern

Tree-sitter Integration

Important: Tree-sitter 0.24.x API requires careful iteration:

// Correct iteration pattern
let mut cursor = QueryCursor::new();
let mut captures = cursor.captures(&query, tree.root_node(), source.as_bytes());

while let Some((m, _)) = captures.next() {
    // Process captures
}

Language versions:

tree-sitter = "0.24"
tree-sitter-typescript = "0.23"
tree-sitter-javascript = "0.23"

Adapter Implementation

TypeScript adapter uses node walking (simplified for v0.1):

fn walk_node(&self, node: tree_sitter::Node, source: &str, symbols: &mut Vec<Symbol>) {
    match node.kind() {
        "class_declaration" => extract_symbol(SymbolKind::Class),
        "function_declaration" => extract_symbol(SymbolKind::Function),
        "interface_declaration" => extract_symbol(SymbolKind::Interface),
        "type_alias_declaration" => extract_symbol(SymbolKind::Type),
        "enum_declaration" => extract_symbol(SymbolKind::Enum),
        _ => {}
    }
    // Recursively walk children
}

Note: Symbols are extracted with metrics: vec![] - metrics are populated by rule system in CLI layer, not in adapter.

Cyclomatic Complexity

Calculation: For each function, adapter calculates cyclomatic complexity using tree-sitter AST:

fn calculate_complexity(&self, node: tree_sitter::Node, source: &str) -> u32 {
    let mut complexity = 1; // Base complexity
    // Count decision points: if, for, while, switch cases, catch, ternary, &&, ||
    self.count_decision_points(node, source, &mut complexity);
    complexity
}

Decision Points Counted:

Conditional statements: if_statement
Loops: for_statement, for_in_statement, while_statement, do_statement
Switch cases: switch_case
Exception handling: catch_clause
Ternary operators: ternary_expression
Logical operators: &&, || in binary_expression

Architecture Flow:

Adapter calculates cyclomatic_complexity during parsing (stored in Symbol)
CLI applies ComplexityRule to check against threshold
Rule generates QualityMetric if threshold exceeded

Building & Running

Build

cargo build --release
# Binary at: target/release/codescope

CLI Usage

# Analyze file (default: table format with quality metrics)
./target/release/codescope path/to/file.ts

# JSON output (matches design doc schema)
./target/release/codescope path/to/file.ts -f json

# Markdown output
./target/release/codescope path/to/file.ts -f md

# Output to file
./target/release/codescope path/to/file.ts -o report.json -f json

# Generate config file
./target/release/codescope init

Batch Analysis

Analyze multiple files and generate CSV reports:

# Build first
cargo build --release

# Install bun (if needed)
curl -fsSL https://bun.sh/install | bash

# Run batch analysis
bun scripts/analyze-batch.ts /path/to/target/directory

# Outputs:
# - analysis-results.json (full JSON)
# - analysis-files.csv (file-level summary)
# - analysis-symbols.csv (symbol-level details)
# - analysis-metrics.csv (quality metrics details)

Note: Analysis output files are git-ignored via .gitignore patterns.

Key Implementation Notes

LOC Counting

Implemented in metrics::count_lines():

Returns LOCStats { code, comment, blank }
Handles // and /* */ comments
Simple line-based scanner (not AST-based)

Metrics Application Flow

Adapter parses file → Returns ModuleIR with:
- Structural attributes: loc, cyclomatic_complexity (calculated)
- Empty metrics vectors (to be filled by rules)
CLI loads config → Creates RuleRegistry from .codescope.toml
CLI applies rules → Populates module.metrics and symbol.metrics based on structural attributes
Output formatter → Displays metrics in requested format

Important: Adapters calculate objective structural attributes (loc, cyclomatic_complexity). CLI applies subjective quality rules (thresholds, severity) to generate metrics.

Config Loading

// Load from .codescope.toml in parent directory, or use defaults
let config_path = args.path.parent().and_then(|p| {
    let config = p.join(".codescope.toml");
    if config.exists() { Some(config) } else { None }
});
let config = Config::load_or_default(config_path.as_deref());
let registry = config.to_rule_registry();

Roadmap Context

v0.1 (current): CLI-only, multi-dimensional metrics, configurable rules, cyclomatic complexity
v0.2 (next): MCP server integration
v0.3: Optional web server + React frontend
v0.4+: Multi-language, call graphs, time-series analysis

Git Workflow

Commit Message Style

Do not include these in commit messages:

🤖 Generated with [Claude Code]
Co-Authored-By: Claude <noreply@anthropic.com>

User's pre-commit hooks may reject commits with these markers. Use --no-verify if needed.

Analysis Output Files

Ignored by git (.gitignore):

analysis-*.csv
analysis-*.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Codescope - Project Context

Project Overview

Architecture

Crate Structure

Multi-Dimensional Quality Metrics System

Rule System

Development Guidelines

Code Style

Testing

Tree-sitter Integration

Adapter Implementation

Cyclomatic Complexity

Building & Running

Build

CLI Usage

Batch Analysis

Key Implementation Notes

LOC Counting

Metrics Application Flow

Config Loading

Roadmap Context

Git Workflow

Commit Message Style

Analysis Output Files

FilesExpand file tree

CLAUDE.md

Latest commit

History

CLAUDE.md

File metadata and controls

Codescope - Project Context

Project Overview

Architecture

Crate Structure

Multi-Dimensional Quality Metrics System

Rule System

Development Guidelines

Code Style

Testing

Tree-sitter Integration

Adapter Implementation

Cyclomatic Complexity

Building & Running

Build

CLI Usage

Batch Analysis

Key Implementation Notes

LOC Counting

Metrics Application Flow

Config Loading

Roadmap Context

Git Workflow

Commit Message Style

Analysis Output Files