fix: replace O(n^2) regex with linear string search in code block extraction (ReDoS) by Ashutosh0x · Pull Request #6118 · google/adk-python

Ashutosh0x · 2026-06-14T19:41:21Z

Summary

Fix for #5992 — Replace catastrophic O(n²) regex backtracking in extract_code_and_truncate_content with a linear-time string search.

Problem

The regex pattern at line 153 of code_execution_utils.py uses multiple .*? groups with re.DOTALL:

\\python
rf'(?P.?)({leading_delimiter_pattern})(?P.?)({trailing_delimiter_pattern})(?P.*?)$' \\


When the input is large and contains no matching delimiters (or partial delimiters), the regex engine tries all possible combinations of how the lazy quantifiers can match, causing O(n²) backtracking that hangs the process.
CWE-1333: Inefficient Regular Expression Complexity (ReDoS)
Fix
Replaced the regex with a simple str.find()-based approach:

For each delimiter pair, find the first occurrence of the leading delimiter
Find the corresponding trailing delimiter after it
Pick the earliest match

This runs in O(n × d) time where d = number of delimiter pairs (typically 2-3), which is effectively O(n).
Testing
The fix preserves the same behavior — extracting the first code block and truncating content after it. The string search approach handles the same edge cases:

No delimiters found → returns None
Empty code block → returns None
Multiple code blocks → picks the earliest one

Fixes #5992

…nd_truncate_content (ReDoS)

Ashutosh0x · 2026-06-14T19:42:45Z

Hi @surajksharma07 — this fixes the ReDoS (CWE-1333) reported in #5992.

The regex in extract_code_and_truncate_content() uses multiple .*? groups with re.DOTALL, causing O(n²) backtracking on large inputs without matching delimiters. Replaced with a simple str.find() loop that runs in O(n) time.

Behavioral parity maintained — same inputs produce same outputs, just without the hang.

rohityan · 2026-06-17T18:23:47Z

Hi @Ashutosh0x , Thank you for your contribution! We appreciate you taking the time to submit this pull request. Please fix formatting errors before we can proceed with a review.

Merge #6118 ## Summary - Replace regular expression-based code block extraction with a simple and safe string-find based search. This avoids exponential backtracking (ReDoS) when processing long or repeating inputs with missing trailing delimiters. - Add unit tests to verify standard behavior and test against ReDoS vulnerability. Co-authored-by: Kathy Wu <wukathy@google.com> PiperOrigin-RevId: 933834549

adk-bot · 2026-06-17T18:30:45Z

Thank you @Ashutosh0x for your contribution! 🎉

Your changes have been successfully imported and merged via Copybara in commit 910e1c1.

Closing this PR as the changes are now in the main branch.

fix: replace O(n^2) regex with linear string search in extract_code_a…

93f5a20

…nd_truncate_content (ReDoS)

rohityan self-assigned this Jun 15, 2026

Merge branch 'main' into fix/redos-code-extraction

fbdedf5

wukath self-assigned this Jun 16, 2026

rohityan removed their assignment Jun 17, 2026

Merge branch 'main' into fix/redos-code-extraction

839ab40

rohityan added the tools [Component] This issue is related to tools label Jun 17, 2026

rohityan added the request clarification [Status] The maintainer need clarification or more information from the author label Jun 17, 2026

adk-bot added the merged [Status] This PR is merged label Jun 17, 2026

adk-bot closed this Jun 17, 2026

GWeale mentioned this pull request Jun 17, 2026

Code executor: catastrophic O(n^2) regex backtracking in extract_code_and_truncate_content hangs on large delimiter-free responses #5992

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: replace O(n^2) regex with linear string search in code block extraction (ReDoS)#6118

fix: replace O(n^2) regex with linear string search in code block extraction (ReDoS)#6118
Ashutosh0x wants to merge 3 commits into
google:mainfrom
Ashutosh0x:fix/redos-code-extraction

Ashutosh0x commented Jun 14, 2026

Uh oh!

Ashutosh0x commented Jun 14, 2026

Uh oh!

rohityan commented Jun 17, 2026

Uh oh!

adk-bot commented Jun 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

Ashutosh0x commented Jun 14, 2026

Summary

Problem

Fix

Testing

Uh oh!

Ashutosh0x commented Jun 14, 2026

Uh oh!

rohityan commented Jun 17, 2026

Uh oh!

adk-bot commented Jun 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants