Highlights
- Pro
Pinned Loading
-
DiSpec
DiSpec PublicDiSpec — a from-scratch LLM inference engine: paged attention, continuous batching, CUDA-graph decode, speculative decoding, and prefill/decode disaggregation
Python
-
Parallel-Eagle
Parallel-Eagle PublicA lossless speculative decoder: a parallel multi-token drafter + dynamic tree verification.
Python
-
Schema-Linking-with-SFT
Schema-Linking-with-SFT PublicLoRA fine-tuned Qwen2.5-Coder-1.5B for schema linking in NL-to-SQL: given a question and a database schema, predicts the referenced tables and columns as JSON. Uses self-consistency decoding (k=10)…
Python
-
search-hotstar
search-hotstar PublicSemantic search engine over a streaming catalog — scrapes, enriches, embeds, and indexes titles into Qdrant, then serves vector + lexical hybrid search via FastAPI with a lightweight frontend.
Python
-
-
CSE291g-MIC-winter-2026/humerus_atlas_building
CSE291g-MIC-winter-2026/humerus_atlas_building PublicRepository containing the implementation for Humerus Atlas construction using Implicit Neural Representations (INRs), developed as part of the CSE291G course project (Winter 2026).
Python
If the problem persists, check the GitHub status page or contact support.
