Skip to content

Commit dc72d4f

Browse files
authored
Merge pull request #9 from vikranth22446/feature/update_publications_preble_wuklab
[WukLab] Update blog post with cognify and preble ICLR 2025 publication
2 parents 1b9ccb4 + 3721d08 commit dc72d4f

3 files changed

Lines changed: 9 additions & 3 deletions

File tree

data/projectsData.ts

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -6,6 +6,12 @@ interface Project {
66
}
77

88
const projectsData: Project[] = [
9+
{
10+
title: 'Cognify: A Comprehensive, Multi-Faceted Gen AI Workflow Optimizer',
11+
description: `TL;DR: Building high-quality, cost-effective generative AI applications is challenging due to the absence of systematic methods for tuning, testing, and optimization. We introduce Cognify, a tool that automatically enhances generation quality and reduces costs for generative AI workflows, including those written with LangChain, DSPy, and annotated Python. Built on a novel foundation of hierarchical, workflow-level optimization, Cognify delivers up to a 48% improvement in generation quality and up to 9x cost reduction. Cognify is publicly available at https://github.com/GenseeAI/cognify.`,
12+
imgSrc: '/static/images/projects/cognify.jpg',
13+
href: 'https://mlsys.wuklab.io/posts/cognify/',
14+
},
915
{
1016
title: 'Efficient LLM Scheduling by Learning to Rank',
1117
description: `Traditional Large Language Model (LLM) serving systems use first-come-first-serve (FCFS) scheduling, leading to delays when longer requests block shorter ones. We introduced a learning-to-rank method to predict output length rankings, enabling a Shortest Job First-like policy and reducing chatbot latency by 6.9x under high load compared to FCFS.`,
Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,8 +1,8 @@
11
---
22
title: 'Preble: Efficient Distributed Prompt Scheduling for LLM Serving'
3-
date: '2024-05-17'
4-
tags: ['PREPRINT']
3+
date: '2025-04-24'
4+
tags: ['ICLR 2025']
55
draft: false
66
authors: ['Vikranth Srivatsa', 'Zijian He', 'Reyna Abhyankar', 'Dongming Li', 'Yiying Zhang']
7-
canonicalUrl: https://escholarship.org/uc/item/1bm0k1w0#main
7+
canonicalUrl: https://arxiv.org/abs/2407.00023
88
---
33.8 KB
Loading

0 commit comments

Comments
 (0)