Backend wrapper #1

TerrenceZhangX · 2025-09-17T05:06:18Z

This pull request introduces a new minimal FastAPI-based backend service for LLMCompass, providing an HTTP API for submitting and querying kernel simulation tasks. It includes a Dockerized deployment, an in-memory task queue with background workers, modularized simulation logic, and clear extension points for adding new synchronous simulators. The changes also include comprehensive documentation for building, running, and extending the backend.

The most important changes are:

Backend API and Task Management:

Added a new FastAPI application in backend_app/main.py that implements endpoints for health checks, listing supported operations, submitting simulation tasks (with async background processing and optional synchronous wait), and querying task status/results. It uses an in-memory task store and a configurable pool of background worker coroutines for processing simulation tasks.
Implemented a new async scheduler and dispatcher in backend_app/scheduler.py, which offloads blocking simulation work to threads and standardizes result formatting and error handling.

Simulation Logic and Extensibility:

Added backend_app/sim_utils.py with shared helpers for dtype mapping, tensor creation, error formatting, and supported operations listing.
Added backend_app/sync_simulators.py with modular, synchronous simulation implementations for matmul, bmm, layernorm, gelu, and softmax, each using the software/hardware model APIs. Includes a routing function for selecting the appropriate simulator based on the requested operation.

Deployment and Documentation:

Updated the Dockerfile to copy the full application source, install FastAPI and dependencies, set up the conda environment, and launch the API server with Uvicorn. Removed legacy GitHub clone and shell activation logic.
Added a detailed backend_app/README.md with instructions for building/running the backend, API usage examples, environment variables, code structure, and guidelines for adding new simulators.

Dependency Management:

Updated environment.yml to include required Python and pip dependencies (e.g., torch, FastAPI, pytest) and removed unused channels and packages.

Copilot

Pull Request Overview

This pull request introduces a new FastAPI-based backend service for LLMCompass that provides HTTP API endpoints for submitting and querying kernel simulation tasks. The backend features async task processing, Docker deployment, and modular simulation logic with clear extension points.

Adds a FastAPI application with endpoints for health checks, supported operations, task submission, and status querying
Implements async task scheduling with background workers and thread-based simulation execution
Provides modular synchronous simulators for matmul, bmm, layernorm, gelu, and softmax operations

Reviewed Changes

Copilot reviewed 8 out of 9 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
tests/test_api_integration.py	Comprehensive integration tests covering all supported operations with artifact generation
environment.yml	Updated dependencies to include torch, fastapi, and pytest with version pinning
backend_app/sync_simulators.py	Modular synchronous simulation implementations with operation routing
backend_app/sim_utils.py	Shared utilities for dtype mapping, tensor creation, and error handling
backend_app/scheduler.py	Async scheduler for dispatching simulation tasks to threads
backend_app/main.py	FastAPI application with task management and background workers
backend_app/README.md	Comprehensive documentation for building, running, and extending the backend
Dockerfile	Updated container setup for FastAPI deployment

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

backend_app/sync_simulators.py

backend_app/README.md

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot

Pull Request Overview

Copilot reviewed 8 out of 9 changed files in this pull request and generated 2 comments.

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

backend_app/sync_simulators.py

tests/test_api_integration.py

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

zhangt added 5 commits September 16, 2025 08:17

Init commit for docker env and simulator API

d20d70b

Cover all ops in scheduler, simulator and tests

c1fb334

Update readme

9915fab

Loose input dim format to Any

081d905

Fix bug and set default worker to 32

5c2e969

TerrenceZhangX requested a review from Copilot September 17, 2025 05:06

TerrenceZhangX self-assigned this Sep 17, 2025

Copilot AI reviewed Sep 17, 2025

View reviewed changes

TerrenceZhangX and others added 5 commits September 17, 2025 13:20

Update backend_app/sync_simulators.py

3d76873

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update backend_app/sync_simulators.py

51b1f2a

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update backend_app/README.md

5525d15

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

update

8a893f0

Update backend_app/README.md

88fb3d4

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

TerrenceZhangX requested a review from Copilot September 17, 2025 05:25

Copilot AI reviewed Sep 17, 2025

View reviewed changes

backend_app/sync_simulators.py Outdated Show resolved Hide resolved

tests/test_api_integration.py Outdated Show resolved Hide resolved

TerrenceZhangX and others added 5 commits September 17, 2025 13:27

Update backend_app/sync_simulators.py

e2fb8fd

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update tests/test_api_integration.py

f82b085

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Improve readme

d178fbb

Remove redundant information and improve field readability

5b537f3

Remove duplicated kernel name in response and update readme

65dec66

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Backend wrapper #1

Backend wrapper #1

Uh oh!

TerrenceZhangX commented Sep 17, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Backend wrapper #1

Are you sure you want to change the base?

Backend wrapper #1

Uh oh!

Conversation

TerrenceZhangX commented Sep 17, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants