refactor(tools): Refactor tool provider system to prepare for MCP support #23

abrookins · 2025-12-08T19:34:42Z

Problem(s)

The current tool interface is requires a Python class (a "tool provider") to define available tools and give them categories. For example, a GitHubIssuesToolProvider might define a search_tickets() tool and categorize it as an issues tool.

This works for hand-coded tool wrappers, but it poses problems if, later, we allow the user to arbitrarily add any MCP server. Doing so would require the user to write at least one custom tool provider for that MCP server. However, previous design choice meant providers were 1:1 with tool categories, so the user would need one tool provider per category of tools the MCP server exposes. We should rather have one tool provider (ideally constructed automatically) for an MCP server, not seveal tool providers, one for each tool category.

There was also some overlapping functionality between our tool schema abstractions and LangChain and some oddities worth cleaning up.

Solution

This PR breaks apart and remixes some of the tool provider-related code, such that a tool provider can provide tools for multiple categories. We also lean into relying on LangChain's tool abstractions, reducing the boilerplate necessary for tool schemas.

Changes

Introduce Tool model combining schema, metadata, and invoke callable
Move ToolCapability, ToolMetadata, and ToolDefinition to tools/models.py
Add automatic wiring in ToolProvider.tools() that binds tool names to methods
Update all providers to use create_tool_schemas() + method pattern
Standardize provider_name as @Property across all providers
Rename redis_cli provider to redis_command
Update documentation and tests to match new patterns
Fix pre-existing linting issues (N806, N814, E402)

- Introduce Tool model combining schema, metadata, and invoke callable - Move ToolCapability, ToolMetadata, and ToolDefinition to tools/models.py - Add automatic wiring in ToolProvider.tools() that binds tool names to methods - Update all providers to use create_tool_schemas() + method pattern - Standardize provider_name as @Property across all providers - Rename redis_cli provider to redis_command - Update documentation and tests to match new patterns - Fix pre-existing linting issues (N806, N814, E402)

Copilot

Pull request overview

This PR refactors the tool provider system to prepare for MCP (Model Context Protocol) support by introducing a unified Tool model that combines schema, metadata, and execution callable. The refactoring standardizes patterns across all tool providers and includes cleanup of pre-existing linting issues.

Key changes:

Introduced Tool model combining ToolMetadata, ToolDefinition schema, and invoke callable
Moved ToolCapability, ToolMetadata, and ToolDefinition from tool_definition.py and protocols.py to new models.py
Implemented automatic method wiring in ToolProvider.tools() that binds tool names directly to provider methods, eliminating the need for manual resolve_tool_call() implementations
Standardized provider_name as @property across all providers
Renamed redis_cli provider to redis_command for clarity
Replaced defensive getattr() calls with direct attribute access throughout the codebase

Reviewed changes

Copilot reviewed 48 out of 49 changed files in this pull request and generated 15 comments.

Show a summary per file

File	Description
`redis_sre_agent/tools/models.py`	New file consolidating Tool, ToolCapability, ToolMetadata, ToolDefinition, and SystemHost models
`redis_sre_agent/tools/protocols.py`	Refactored ToolProvider base class with automatic method wiring via tools() and removed weakref pattern for _manager
`redis_sre_agent/tools/tool_definition.py`	File deleted; content moved to models.py
`redis_sre_agent/tools/manager.py`	Updated to work with Tool objects instead of ToolDefinition schemas; added _tool_by_name lookup
`redis_sre_agent/tools/*/provider.py`	All providers updated to remove resolve_tool_call(), add capability to schemas, and use @Property for provider_name
`tests/unit/tools/`	Tests updated to use tool.invoke() instead of resolve_tool_call() and removed tests for unknown operation errors
`tests/integration/tools/`	Integration tests updated to call tool.invoke() and verify capability metadata
`redis_sre_agent/core/`	Multiple files updated replacing getattr() with direct attribute access for better type safety
`redis_sre_agent/agent/`	Agent code updated to use new Tool model and build_adapters_for_tooldefs signature
`pyproject.toml`	Migrated dev-dependencies to PEP 735 dependency-groups format
`docs/how-to/tool-providers.md`	Documentation updated showing new provider pattern without resolve_tool_call()

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-12-08T19:39:48Z

redis_sre_agent/tools/models.py

+class Tool(BaseModel):
+    """Concrete tool object combining schema, metadata, and executor.
+
+    Attributes:
+        metadata: :class:`ToolMetadata` describing the tool for routing.
+        schema: The :class:`ToolDefinition` shown to the LLM (stored as ``Any``
+            here to avoid import cycles).
+        invoke: Async callable taking a single ``Dict[str, Any]`` of arguments
+            and returning the tool result.
+    """
+
+    metadata: ToolMetadata
+    schema: Any
+    invoke: Any


The Tool class uses Any for both schema and invoke fields without proper type hints or validation. This can lead to runtime errors if incorrect types are passed. Consider using proper typing with Protocol or more specific types:

from typing import Awaitable, Callable, Union class Tool(BaseModel): """Concrete tool object combining schema, metadata, and executor.""" metadata: ToolMetadata schema: ToolDefinition # More specific type instead of Any invoke: Callable[[Dict[str, Any]], Awaitable[Any]] # Proper callable signature class Config: arbitrary_types_allowed = True # Allow non-Pydantic types like callables

Copilot · 2025-12-08T19:39:48Z

redis_sre_agent/tools/protocols.py

+            async def _invoke(args: Dict[str, Any], _method=method) -> Any:
+                """Invoke the bound provider method with keyword args."""
+                return await _method(**(args or {}))


[nitpick] The closure in _invoke captures _method with a default argument, but the loop variable method changes on each iteration. This could lead to all tools invoking the last method in the loop. The current implementation with _method=method is correct, but the comment should clarify this is intentional to avoid late binding issues.

Copilot · 2025-12-08T19:39:48Z

redis_sre_agent/tools/protocols.py

+            # Use getattr for legitimate metaprogramming - dynamically binding methods
+            method = getattr(self, op_name, None) if op_name else None


[nitpick] Using getattr for legitimate metaprogramming is fine, but the comment "Use getattr for legitimate metaprogramming - dynamically binding methods" should be removed as it's defensive and unnecessary. The usage here is straightforward method lookup by name.

Copilot · 2025-12-08T19:39:48Z

redis_sre_agent/tools/protocols.py

+            # Last resort: treat the whole name as the operation.
+            return tool_name


The resolve_operation method has a fallback that treats the whole tool name as the operation (return tool_name). This could mask errors where tool names don't follow the expected pattern. Consider logging a warning or raising an error instead to catch misconfigurations early.

Copilot · 2025-12-08T19:39:49Z

redis_sre_agent/tools/protocols.py

+    # Back-reference to the manager (set by ToolManager on load)
+    _manager: Optional["ToolManager"] = None


The code removes the _manager weak reference pattern and replaces it with a direct reference. This could potentially create reference cycles that prevent garbage collection. If ToolProvider instances hold a reference to ToolManager and ToolManager holds references to providers in _providers and _routing_table, this creates a cycle. The weak reference pattern was likely intentional to avoid this.

Consider keeping the weak reference pattern or documenting why it's safe to remove.

Copilot · 2025-12-08T19:39:50Z

redis_sre_agent/observability/llm_metrics.py

+                        completion or usage.get("completion_tokens") or usage.get("output_tokens")
+                    )
+                    total = total or usage.get("total_tokens")
+        except AttributeError:


'except' clause does nothing but pass and there is no explanatory comment.

Suggested change

except AttributeError:

except AttributeError:

# It's expected that some response objects may not have response_metadata.

Copilot · 2025-12-08T19:39:51Z

redis_sre_agent/agent/langgraph_agent.py

+                            resp = e.response  # type: ignore[attr-defined]
+                            if resp:
+                                status = resp.status_code
+                        except AttributeError:


'except' clause does nothing but pass and there is no explanatory comment.

Copilot · 2025-12-08T19:39:51Z

redis_sre_agent/agent/langgraph_agent.py

+                                except AttributeError:
+                                    try:
+                                        body = resp.content
+                                    except AttributeError:


'except' clause does nothing but pass and there is no explanatory comment.

Suggested change

except AttributeError:

except AttributeError:

# Safe to ignore: response object does not have 'content' attribute

Copilot · 2025-12-08T19:39:51Z

redis_sre_agent/agent/langgraph_agent.py

+                                        pass
+                                if isinstance(body, (bytes, bytearray)):
+                                    body = body.decode("utf-8", errors="ignore")
+                        except AttributeError:


'except' clause does nothing but pass and there is no explanatory comment.

Suggested change

except AttributeError:

except AttributeError:

# Attribute missing on response object; safe to ignore

Copilot · 2025-12-08T19:39:51Z

redis_sre_agent/tools/metrics/prometheus/provider.py

+        if last_err:
+            logger.debug(f"_wait_for_targets encountered: {last_err}")
+
+        pass


Unnecessary 'pass' statement.

- Add warning logging when resolve_operation falls back to full tool name - Document that resolve_operation is for tracing/status, not routing - Add proper types to Tool class (Callable, ToolDefinition) with ConfigDict - Remove unnecessary pass statement in prometheus provider - Clarify _invoke closure default arg captures method to avoid late binding - Add explanatory comments for AttributeError handlers probing response formats - Simplify getattr comment in tools() method

Copilot

Pull request overview

Copilot reviewed 48 out of 49 changed files in this pull request and generated 10 comments.

Comments suppressed due to low confidence (1)

redis_sre_agent/tools/models.py:7

The module 'redis_sre_agent.tools.models' imports itself.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

redis_sre_agent/tools/models.py

Copilot · 2025-12-08T23:14:06Z

redis_sre_agent/tools/protocols.py

            if not op:
                return None
-            method = getattr(self, op, None)
+            method = self.__dict__.get(op) or type(self).__dict__.get(op)


Using __dict__.get() and type(self).__dict__.get() for attribute lookup is unusual and error-prone. The first call self.__dict__.get(op) will only find instance attributes, missing class-level methods. The second call type(self).__dict__.get(op) will miss inherited methods from parent classes. This should use getattr(self, op, None) to properly traverse the MRO (Method Resolution Order) and find both instance and class attributes correctly.

Copilot · 2025-12-08T23:14:07Z

redis_sre_agent/tools/protocols.py

+            method = self.__dict__.get(op) or type(self).__dict__.get(op)
            if not method:
                return None
-            template = getattr(method, "_status_update_template", None)
+            template = method._status_update_template if method else None


Accessing _status_update_template with a conditional expression on line 412 is inconsistent with line 409. If method is None (from line 410), this will raise an AttributeError. The access pattern should be consistent: either use getattr or handle None properly.

Copilot · 2025-12-08T23:14:07Z

redis_sre_agent/tools/manager.py

            module_path, class_name = provider_path.rsplit(".", 1)
            module = __import__(module_path, fromlist=[class_name])
-            cls._provider_class_cache[provider_path] = getattr(module, class_name)
+            cls._provider_class_cache[provider_path] = module.__dict__[class_name]


Using module.__dict__[class_name] instead of getattr(module, class_name) removes the ability to handle missing attributes gracefully. If the class doesn't exist in the module, this will raise a KeyError instead of the expected AttributeError, potentially breaking error handling code that catches AttributeError.

Copilot · 2025-12-08T23:14:07Z

redis_sre_agent/cli/main.py

        module_path, attr = target.split(":", 1)
        mod = importlib.import_module(module_path)
-        return getattr(mod, attr)
+        return mod.__dict__[attr]


Using mod.__dict__[attr] (line 39) instead of getattr(mod, attr) will raise KeyError instead of AttributeError when the attribute doesn't exist. This breaks the expected exception contract and could cause issues with error handling that specifically catches AttributeError.

Suggested change

return mod.__dict__[attr]

return getattr(mod, attr)

redis_sre_agent/tools/models.py

Copilot · 2025-12-08T23:14:08Z

redis_sre_agent/tools/logs/loki/provider.py

+                    ic = self.instance_config
+                    if ic and ic.prefer_streams:


Direct attribute access without checking if instance_config exists. While the try/except block provides some protection, line 382 assumes ic (which is self.instance_config) is not None when checking ic.prefer_streams. If instance_config_model is set but _load_instance_extension_config() returned None, this could cause an AttributeError. Should check ic is not None before accessing its attributes.

redis_sre_agent/core/threads.py

Copilot · 2025-12-08T23:14:08Z

redis_sre_agent/core/tasks.py

+                    "created_at",
+                    "updated_at",
+                ]:
+                    row[k] = res.__dict__.get(k)


Same issue: accessing res.__dict__.get(k) (line 418) assumes res has a __dict__ attribute. Should use getattr(res, k, None) for safer attribute access that works with all Python objects.

Suggested change

row[k] = res.__dict__.get(k)

row[k] = getattr(res, k, None)

redis_sre_agent/api/app.py

Simplify the property to return bool instead of Optional[bool]. Base class returns whether redis_instance was provided; subclasses override to return True or False explicitly.

- Rename Tool.schema to Tool.definition to avoid shadowing BaseModel attribute - Use getattr for llm.model and llm.temperature to handle ChatOpenAI which uses model_name

Use module-level tracer instead of defensive inline imports

Use module-level tracer instead of defensive inline imports in knowledge_agent.py, runbook_generator.py, and knowledge_base.py. Also update test for two ChatOpenAI instances in SRELangGraphAgent.

Patch at usage location (langgraph_agent) not definition location

abrookins requested a review from Copilot December 8, 2025 19:34

Copilot started reviewing on behalf of abrookins December 8, 2025 19:35 View session

Copilot AI reviewed Dec 8, 2025

View reviewed changes

abrookins requested a review from Copilot December 8, 2025 23:09

Copilot started reviewing on behalf of abrookins December 8, 2025 23:10 View session

Copilot AI reviewed Dec 8, 2025

View reviewed changes

abrookins added 7 commits December 8, 2025 15:19

refactor: Rename default_requires_instance to requires_redis_instance

c256e8b

Simplify the property to return bool instead of Optional[bool]. Base class returns whether redis_instance was provided; subclasses override to return True or False explicitly.

Incorporate review feedback

7b6eff6

Lint

e97da35

fix: Resolve runtime errors in agent execution

7707e87

- Rename Tool.schema to Tool.definition to avoid shadowing BaseModel attribute - Use getattr for llm.model and llm.temperature to handle ChatOpenAI which uses model_name

refactor: Remove redundant try/except around opentelemetry imports

d7efba2

Use module-level tracer instead of defensive inline imports

refactor: Remove redundant try/except around opentelemetry imports

3d1013d

Use module-level tracer instead of defensive inline imports in knowledge_agent.py, runbook_generator.py, and knowledge_base.py. Also update test for two ChatOpenAI instances in SRELangGraphAgent.

fix: Correct mock patch paths for safety_fact_corrector tests

938cec5

Patch at usage location (langgraph_agent) not definition location

abrookins merged commit 1a1be72 into main Dec 9, 2025
5 checks passed

abrookins changed the title ~~refactor(tools): Refactor tool provider system for MCP support~~ refactor(tools): Refactor tool provider system to prepare for MCP support Dec 9, 2025

		# Use getattr for legitimate metaprogramming - dynamically binding methods
		method = getattr(self, op_name, None) if op_name else None

		# Last resort: treat the whole name as the operation.
		return tool_name

		# Back-reference to the manager (set by ToolManager on load)
		_manager: Optional["ToolManager"] = None

	except AttributeError:
	except AttributeError:
	# It's expected that some response objects may not have response_metadata.

	except AttributeError:
	except AttributeError:
	# Safe to ignore: response object does not have 'content' attribute

	except AttributeError:
	except AttributeError:
	# Attribute missing on response object; safe to ignore

refactor(tools): Refactor tool provider system to prepare for MCP support #23

refactor(tools): Refactor tool provider system to prepare for MCP support #23

Uh oh!

Conversation

abrookins commented Dec 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem(s)

Solution

Changes

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Copilot AI Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

abrookins commented Dec 8, 2025 •

edited

Loading