WIP : ADD: Batch request (Non-streaming) for OpenAI Plugin by thameem-abbas · Pull Request #66 · openshift-psap/llm-load-test

thameem-abbas · 2024-10-23T13:36:07Z

Adds non-streaming batch request support in OpenAI plugin.

thameem-abbas · 2024-11-04T13:52:54Z

@sjmonson @dagrayvid @npalaska
It would be great if someone could review the PR.

npalaska · 2024-11-04T14:55:57Z

Thanks @thameem-abbas 👍 Initial tests with changes from this PR work well. I'll add more in the afternoon.

sjmonson · 2024-11-04T15:58:54Z

I still don't like this approach. It duplicates too much code; fine for a one-off, but to merge it would be preferable to have all request methods take a list of queries and ~~return a list of Results~~ (see later comment).

sjmonson · 2024-11-04T15:59:27Z


        return result

+    def request_batch_http(self, queries, user_id, test_end_time: float = 0):


Changing the interface of request_func() based on calling args is bad practice. See above comment.

sjmonson · 2024-11-04T16:39:09Z

-            # if timeout passes, queue.Empty will be thrown
-            # User should continue to poll for inputs


Why was this comment removed?

sjmonson · 2024-11-04T17:13:29Z

    if plugin_type == "openai_plugin":
        plugin = openai_plugin.OpenAIPlugin(
-            config.get("plugin_options")
+            config.get("plugin_options"), batch_size


Again, this should be handled in a way the the plugin does not need to know the batch size in advance. However, if you must... just set config["plugin_options"] = batch_size to avoid changing the interface.

Agreeing on change of condition. Co-authored-by: Samuel Monson <smonson@irbash.net>

Don't know what was running through my head when I wrote that. Thanks Co-authored-by: Samuel Monson <smonson@irbash.net>

Fix redundant annotation Co-authored-by: Samuel Monson <smonson@irbash.net>

rgreenberg1 · 2025-02-18T13:02:48Z

Has this worked been picked back up/re-reviewed?

thameem-abbas · 2025-02-18T13:35:55Z

@rgreenberg1 This hasn't been picked up for a while now. It's out of sync with main by quite a bit. Can prioritize this if there is a need for this to land in llm-load-test.

thameem-abbas added 2 commits October 23, 2024 09:31

ADD: Batch request for OpenAI Plugin

2b3ab33

FIX: Reduce redundant code-paths

08d7bb8

thameem-abbas changed the title ~~ADD: Batch request for OpenAI Plugin~~ ADD: Batch request (Non-streaming) for OpenAI Plugin Nov 4, 2024

sjmonson requested changes Nov 4, 2024

View reviewed changes

thameem-abbas and others added 5 commits November 4, 2024 12:38

Update plugins/openai_plugin.py

9d928a3

Agreeing on change of condition. Co-authored-by: Samuel Monson <smonson@irbash.net>

Update utils.py

1a1d57b

Don't know what was running through my head when I wrote that. Thanks Co-authored-by: Samuel Monson <smonson@irbash.net>

Update result.py

5f0f6e2

Fix redundant annotation Co-authored-by: Samuel Monson <smonson@irbash.net>

Update user.py : Cleanup

508366c

Fix: Potential KeyErrors

d326110

thameem-abbas changed the title ~~ADD: Batch request (Non-streaming) for OpenAI Plugin~~ WIP : ADD: Batch request (Non-streaming) for OpenAI Plugin Nov 11, 2024

rgreenberg1 mentioned this pull request Mar 31, 2025

Enable Batch Inferencing Benchmarking Support vllm-project/guidellm#102

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP : ADD: Batch request (Non-streaming) for OpenAI Plugin#66

WIP : ADD: Batch request (Non-streaming) for OpenAI Plugin#66
thameem-abbas wants to merge 7 commits intoopenshift-psap:mainfrom
thameem-abbas:dev-batch-non-streaming

thameem-abbas commented Oct 23, 2024

Uh oh!

thameem-abbas commented Nov 4, 2024

Uh oh!

npalaska commented Nov 4, 2024

Uh oh!

sjmonson Nov 4, 2024

Uh oh!

sjmonson Nov 4, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sjmonson Nov 4, 2024

Uh oh!

Uh oh!

Uh oh!

sjmonson Nov 4, 2024

Uh oh!

rgreenberg1 commented Feb 18, 2025

Uh oh!

thameem-abbas commented Feb 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants


		return result

		def request_batch_http(self, queries, user_id, test_end_time: float = 0):

		# if timeout passes, queue.Empty will be thrown
		# User should continue to poll for inputs

Conversation

thameem-abbas commented Oct 23, 2024

Uh oh!

thameem-abbas commented Nov 4, 2024

Uh oh!

npalaska commented Nov 4, 2024

Uh oh!

sjmonson Nov 4, 2024

Choose a reason for hiding this comment

Uh oh!

sjmonson Nov 4, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sjmonson Nov 4, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sjmonson Nov 4, 2024

Choose a reason for hiding this comment

Uh oh!

rgreenberg1 commented Feb 18, 2025

Uh oh!

thameem-abbas commented Feb 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants