fix(ghidra): implement flow-insensitive block discovery by sashwathsubra · Pull Request #2990 · mandiant/capa

sashwathsubra · 2026-04-03T09:47:01Z

This is a focused PR to fix the Ghidra function truncation bug.
Related Issue
Fixes #2989

…nction truncation

gemini-code-assist

Code Review

This pull request introduces several bug fixes and performance optimizations, most notably a flow-insensitive block iteration for Ghidra to prevent function truncation and various safety checks in Ghidra extractors. However, the newly introduced _RuleFeatureIndex contains a critical bug that breaks Substring and Regex feature matching and causes a performance regression by re-indexing rules during every match call. A copy-paste error was also identified in the changelog.

gemini-code-assist · 2026-04-03T09:49:43Z

capa/engine.py

-#         inspect(match_details)
-#
-# aliased here so that the type can be documented and xref'd.
+class _RuleFeatureIndex:


The _RuleFeatureIndex implementation has a critical correctness bug: it breaks rules that use Substring or Regex features. These features match against String features in the FeatureSet via partial or pattern matching. However, get_candidates performs an exact lookup (feature in self.features). Since the extracted features in the FeatureSet are String objects and the indexed features are Substring/Regex objects, they will never match exactly, causing these rules to be incorrectly filtered out and never evaluated.

gemini-code-assist · 2026-04-03T09:49:43Z

capa/engine.py

+    index = _RuleFeatureIndex(rules)
+    candidates = index.get_candidates(features)


Instantiating _RuleFeatureIndex(rules) inside the match function introduces a significant performance regression. The match function is called for every scope (file, function, basic block) during analysis. Re-indexing the entire rule set on every call is computationally expensive ($O(Rules \times Features)$) and likely far outweighs the benefit of skipping rule.evaluate calls, especially since evaluate is already optimized for short-circuiting. The index should be constructed once and reused, or this change should be reverted.

CHANGELOG.md

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

mike-hunhoff · 2026-04-03T15:53:36Z

@sashwathsubra This pull request contains unrelated changes to the rules engine and cache. It also contains unrelated formatting changes. Please remove all unrelated changes and post a screenshot of all tests passing locally before we give this a review.

fix(ghidra): implement flow-insensitive block discovery to prevent fu…

2f74093

…nction truncation

gemini-code-assist bot reviewed Apr 3, 2026

View reviewed changes

sashwathsubra and others added 2 commits April 3, 2026 19:25

Update CHANGELOG.md

f0aa795

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Merge branch 'master' into fix/ghidra-function-truncation

bf05bec

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(ghidra): implement flow-insensitive block discovery#2990

fix(ghidra): implement flow-insensitive block discovery#2990
sashwathsubra wants to merge 3 commits intomandiant:masterfrom
sashwathsubra:fix/ghidra-function-truncation

sashwathsubra commented Apr 3, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Apr 3, 2026

Uh oh!

gemini-code-assist bot Apr 3, 2026

Uh oh!

Uh oh!

mike-hunhoff commented Apr 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		index = _RuleFeatureIndex(rules)
		candidates = index.get_candidates(features)

Conversation

sashwathsubra commented Apr 3, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Apr 3, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 3, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mike-hunhoff commented Apr 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants