LangGraph simple example#167

Merged

benjibc merged 5 commits intomainfrom

langgraph_simple_example

Sep 10, 2025

Contributor

benjibc commented Sep 9, 2025 •

edited

Loading

Simple database example
Chat example

benjibc added 2 commits

September 9, 2025 20:36


          LangGraph simple example

854cb5c


          simplify further

af8ac5c

benjibc requested review from Copilot, dphuang2 and xzrderek and removed request for xzrderek

September 9, 2025 20:48

Copilot AI reviewed

View reviewed changes

Copilot AI left a comment

Pull Request Overview

This PR introduces LangGraph support to the eval_protocol library by adding a new rollout processor and example implementations. The changes enable evaluation of LangGraph-based applications through a dedicated processor that handles conversion between eval_protocol and LangChain message formats.

Adds LangGraphRolloutProcessor for processing LangGraph-based evaluations
Implements database query example using Chinook database with LangGraph
Provides simple chat example with Fireworks model integration

Reviewed Changes

Copilot reviewed 11 out of 11 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
eval_protocol/pytest/default_langchain_rollout_processor.py	Complete rewrite to support LangGraph with message conversion and error handling
eval_protocol/adapters/langchain.py	Simplified message serialization and added EP to LC conversion function
tests/pytest/test_langgraph_processor.py	Unit tests for the new LangGraph processor functionality
tests/chinook/langgraph/test_langgraph_chinook.py	Integration test using Chinook database with LangGraph
tests/chinook/langgraph/graph.py	LangGraph implementation for database queries
examples/langgraph/test_langgraph_rollout.py	Example evaluation test using LangGraph
examples/langgraph/simple_graph.py	Simple LangGraph implementation with Fireworks integration
examples/langgraph/data/simple_prompts.jsonl	Test data for LangGraph examples
requirements-dev.txt	Added LangGraph and LangChain dependencies
eval_protocol/pytest/handle_persist_flow.py	Fixed dataset name generation for Fireworks API compatibility
eval_protocol/adapters/bigquery.py	Consolidated exception handling and removed unused import

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

eval_protocol/adapters/langchain.py Outdated Show resolved Hide resolved

tests/chinook/langgraph/test_langgraph_chinook.py Outdated Show resolved Hide resolved

examples/langgraph/simple_graph.py Outdated Show resolved Hide resolved

dphuang2 reviewed

View reviewed changes

requirements-dev.txt Show resolved Hide resolved

dphuang2 reviewed

View reviewed changes

tests/chinook/langgraph/test_langgraph_chinook.py Outdated Show resolved Hide resolved

dphuang2 reviewed

View reviewed changes

tests/chinook/langgraph/test_langgraph_chinook.py

+              @pytest.mark.skipif(os.getenv("FIREWORKS_API_KEY") in (None, ""), reason="FIREWORKS_API_KEY not set")
+              @evaluation_test(
+                  input_messages=[[[Message(role="user", content="What is the total number of tracks in the database?")]]],
+                  completion_params=[{"model": "accounts/fireworks/models/kimi-k2-instruct", "provider": "fireworks"}],

Collaborator

dphuang2 Sep 9, 2025

"provider" is a pydantic thing, is it necessary here?

Contributor Author

benjibc Sep 10, 2025

        model,
        model_provider=model_provider,
        temperature=temperature,
        reasoning_effort=reasoning_effort,
    )
```, it is a langgraph thing.

dphuang2 reviewed

View reviewed changes

tests/chinook/langgraph/test_langgraph_chinook.py

Comment on lines +56 to +60

+                  rollout_processor=LangGraphRolloutProcessor(
+                      graph_factory=lambda _: build_graph(),
+                      build_graph_kwargs=build_graph_kwargs,
+                      input_key="messages",
+                      output_key="messages",

Collaborator

dphuang2 Sep 9, 2025

reading up on langgraph to understand this—its confusing at first

dphuang2 reviewed

View reviewed changes

tests/chinook/langgraph/test_langgraph_chinook.py Outdated Show resolved Hide resolved

dphuang2 reviewed

View reviewed changes

tests/chinook/langgraph/test_langgraph_chinook.py Outdated Show resolved Hide resolved

dphuang2 reviewed

View reviewed changes

tests/chinook/langgraph/test_langgraph_chinook.py

Comment on lines +57 to +58

		graph_factory=lambda _: build_graph(),
		build_graph_kwargs=build_graph_kwargs,

Collaborator

dphuang2 Sep 9, 2025

do these need to be two separate things? why not just have an agent factory:

Callable[[RolloutProcessorConfig], Graph]

dphuang2 reviewed

View reviewed changes

tests/chinook/langgraph/test_langgraph_chinook.py Outdated Show resolved Hide resolved

Contributor

xzrderek commented Sep 9, 2025

i think we need to track row.tools, i just made similar change for pydantic processor: #168

benjibc added 3 commits

September 10, 2025 00:21


          update the test coverage, added tool call example

2d915d1


          tests(langgraph): skip when optional deps missing; chore(pyproject): …

ae67124

…add langgraph/langgraph_tools extras; relax extras to >= versions


          update lock

f942611

benjibc merged commit 8101180 into main

7 checks passed

benjibc deleted the langgraph_simple_example branch

September 10, 2025 07:29

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet