fix: HierarchicalSessionStore stale extended cache after cross-instance writes by cursor[bot] · Pull Request #1785 · MervinPraison/PraisonAI

cursor · 2026-06-01T09:03:40Z

Bug and impact

HierarchicalSessionStore.get_extended_session() could return a truncated message list (and stale hierarchy fields) when another process or store instance had written newer messages to the same session file.

Trigger: Gateway worker A warms _extended_cache → worker B appends messages → worker A calls get_extended_session() → UI/fork/snapshot logic sees old state while get_chat_history() (fixed in #1759) looked correct.

Severity: Session data loss / wrong fork and snapshot behaviour in multi-worker deployments.

Root cause

PR #1759 added _read_session_fresh() for DefaultSessionStore reads but HierarchicalSessionStore kept a separate _extended_cache that was not refreshed on read. get_extended_session() still used _load_extended_session() without force_reload.

Fix

Override _read_session_fresh() to sync _extended_cache with disk-backed _cache
Route get_extended_session() through _read_session_fresh()
Clear _extended_cache in invalidate_cache()

Validation

New regression test: test_get_extended_session_sees_writes_from_other_store
pytest tests/unit/session/test_hierarchy.py tests/unit/session/test_session_store.py — 58 passed

Summary by CodeRabbit

Bug Fixes
- Improved session data consistency by ensuring extended session information is freshly reloaded from disk rather than relying on potentially stale cache in multi-store scenarios.
New Features
- Added cache invalidation capability for session management to clear in-memory cache when needed.

MervinPraison · 2026-06-01T09:03:52Z

@coderabbitai review

MervinPraison · 2026-06-01T09:03:53Z

/review

coderabbitai · 2026-06-01T09:03:59Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

qodo-code-review · 2026-06-01T09:04:00Z

Code Review by Qodo

🐞 Bugs (2) 📘 Rule violations (0)

1. Cached write overwrites updates 🐞 Bug ≡ Correctness

Description

Multiple mutating methods still do read-then-write via _load_extended_session() (which can return
stale _extended_cache data) followed by _save_extended_session(), which can overwrite newer
messages/fields written by another process. get_extended_session() now refreshes from disk, but
write paths like create_session(parent update), set_title/share_session/unshare_session, and
auto_title still risk session truncation in multi-worker deployments.

Code

src/praisonai-agents/praisonaiagents/session/hierarchy.py[R586-588]

Evidence
_load_extended_session() explicitly serves from an in-memory cache unless force_reload=True, while
_save_extended_session() writes the full session object back to disk; several methods use this
pattern, so a stale cached session can overwrite newer disk state. A safer read-modify-write
primitive already exists (_modify_session_locked) which reloads under FileLock before writing.
src/praisonai-agents/praisonaiagents/session/hierarchy.py[214-243]
src/praisonai-agents/praisonaiagents/session/hierarchy.py[245-273]
src/praisonai-agents/praisonaiagents/session/hierarchy.py[284-323]
src/praisonai-agents/praisonaiagents/session/hierarchy.py[501-516]
src/praisonai-agents/praisonaiagents/session/hierarchy.py[518-576]
src/praisonai-agents/praisonaiagents/session/hierarchy.py[167-183]
src/praisonai-agents/praisonaiagents/session/store.py[315-350]

Agent prompt

The issue below was found during a code review. Follow the provided context and guidance below and implement a solution

### Issue description
Several `HierarchicalSessionStore` methods mutate sessions by calling `_load_extended_session()` and then `_save_extended_session()`. Because `_load_extended_session()` serves from `_extended_cache` unless `force_reload=True`, these methods can write a stale in-memory snapshot back to disk and **overwrite newer messages/fields** written by another process.

### Issue Context
- `_load_extended_session()` returns cached data when `force_reload=False`.
- `_save_extended_session()` writes the *entire* session JSON (including the messages list), so saving a stale object can truncate newer messages.
- There is already an existing safe primitive: `_modify_session_locked()` (via `DefaultSessionStore._modify_session_locked`) which does a fresh read under `FileLock` and writes atomically.

### Fix Focus Areas
- src/praisonai-agents/praisonaiagents/session/hierarchy.py[214-323]
- src/praisonai-agents/praisonaiagents/session/hierarchy.py[495-577]
- src/praisonai-agents/praisonaiagents/session/hierarchy.py[167-183]
- src/praisonai-agents/praisonaiagents/session/store.py[315-350]

### What to change
1. For any method that **writes** an extended session (e.g., `set_title`, `share_session`, `unshare_session`, `revert_to_snapshot`, parent update inside `create_session`, and the write phase of `auto_title`):
  - Replace the pattern `session = self._load_extended_session(...); mutate; self._save_extended_session(session)` with a single-file-lock read-modify-write using `self._modify_session_locked(...)`.
  - Example for `set_title`:
    ```py
    def set_title(self, session_id: str, title: str) -> bool:
        def _apply(session: SessionData) -> None:
            assert isinstance(session, ExtendedSessionData)
            session.title = title
        return self._modify_session_locked(session_id, _apply, error_label="set session title")
    ```
2. In `create_session(parent_id=...)`, update the parent’s `children_ids` using `_modify_session_locked(parent_id, ...)` so concurrent child creation doesn’t lose updates.
3. In `auto_title`, ensure the final write uses `_modify_session_locked` (or at minimum a `force_reload=True` read under the same `FileLock`) so setting the title cannot overwrite messages appended by other workers.

### Acceptance criteria
- Concurrent cross-process appends to a session are preserved even when `set_title/share_session/unshare_session/create_session(parent update)/auto_title` run on a different worker with a warmed `_extended_cache`.
- Add/extend a regression test similar to `test_get_extended_session_sees_writes_from_other_store` that demonstrates no truncation when a stale reader performs one of these metadata-only writes.

ⓘ Copy this prompt and use it to remediate the issue with your preferred AI generation tools

2. Cache invalidation not atomic 🐞 Bug ☼ Reliability

Description

HierarchicalSessionStore.invalidate_cache() clears the base _cache via super().invalidate_cache()
and then clears _extended_cache under a separate lock acquisition, allowing other threads to observe
inconsistent cache state mid-invalidation. This can reintroduce transient stale reads from
_extended_cache within the same process.

Code

src/praisonai-agents/praisonaiagents/session/hierarchy.py[R590-597]

Evidence
DefaultSessionStore.invalidate_cache() clears _cache under a lock; the new override calls that
method and then separately locks to clear _extended_cache, creating a race window where only one
cache has been cleared.
src/praisonai-agents/praisonaiagents/session/hierarchy.py[590-597]
src/praisonai-agents/praisonaiagents/session/store.py[784-790]

Agent prompt

The issue below was found during a code review. Follow the provided context and guidance below and implement a solution

### Issue description
`HierarchicalSessionStore.invalidate_cache()` is not atomic across `_cache` and `_extended_cache` because it calls `super().invalidate_cache()` (which acquires/releases `self._lock`) and then acquires `self._lock` again to clear `_extended_cache`. Between the two lock scopes, another thread may read a partially-invalidated state.

### Issue Context
`DefaultSessionStore.invalidate_cache()` performs cache invalidation under a single `with self._lock:` block.

### Fix Focus Areas
- src/praisonai-agents/praisonaiagents/session/hierarchy.py[590-597]
- src/praisonai-agents/praisonaiagents/session/store.py[784-790]

### What to change
Re-implement `HierarchicalSessionStore.invalidate_cache()` to clear **both** `_cache` and `_extended_cache` under a single `with self._lock:` block, without calling `super()`.

Example:
```py
def invalidate_cache(self, session_id: Optional[str] = None) -> None:
   with self._lock:
       if session_id:
           self._cache.pop(session_id, None)
           self._extended_cache.pop(session_id, None)
       else:
           self._cache.clear()
           self._extended_cache.clear()
```

ⓘ Copy this prompt and use it to remediate the issue with your preferred AI generation tools

coderabbitai · 2026-06-01T09:04:10Z

Important

Review skipped

Bot user detected.

To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 4216ac70-1729-4b44-9ed9-f030e80ee24d

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

📝 Walkthrough

Walkthrough

HierarchicalSessionStore now includes a _read_session_fresh() method that reloads sessions from disk and synchronizes extended cache state under lock. get_extended_session() is updated to use fresh reloads, and a new invalidate_cache() method clears both base and extended in-memory caches. A test validates that fresh reads return current disk state across multiple store instances.

Changes

Fresh session reload and cache management

Layer / File(s)	Summary
Fresh session reload with extended cache sync `src/praisonai-agents/praisonaiagents/session/hierarchy.py`	`_read_session_fresh()` method reloads sessions from the base store and synchronizes `_extended_cache` under the store lock for `ExtendedSessionData` results, ensuring in-memory cache aligns with disk state.
Extended session freshness and cache invalidation `src/praisonai-agents/praisonaiagents/session/hierarchy.py`, `src/praisonai-agents/tests/unit/session/test_hierarchy.py`	`get_extended_session()` is updated to return fresh reloaded sessions via `_read_session_fresh()`. New `invalidate_cache()` method clears base and extended caches (scoped by `session_id` or all sessions). Test case validates fresh reads reflect latest disk state when multiple store instances write to the same session directory.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~10 minutes

Possibly related issues

docs: update session persistence docs for read-consistency fix (PraisonAI #1759) PraisonAIDocs#449: Implements read-fresh cache behavior similar to the pattern used in other session store implementations to ensure stale cache data is not returned.

Possibly related PRs

MervinPraison/PraisonAI#1745: Both PRs modify HierarchicalSessionStore's extended-session cache synchronization in the same file, implementing fresh reload logic and cache alignment after writes.
MervinPraison/PraisonAI#1759: Both PRs implement stale-cache read fixes by introducing _read_session_fresh()-style disk reload paths across different session store classes.

Poem

🐰 A cache grew stale on the disk below,
So fresh reads reload—let the data flow!
Locks keep the state synchronized bright,
Now every peek sees the truth, pure and right.
Hop-hop! The stores are aligned at last! 🌟

🚥 Pre-merge checks | ✅ 5

✅ Passed checks (5 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately describes the main change: fixing stale extended cache in HierarchicalSessionStore when multiple instances write to the same session. It is concise, specific, and directly reflects the bug being addressed.
Docstring Coverage	✅ Passed	Docstring coverage is 88.89% which is sufficient. The required threshold is 80.00%.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch cursor/critical-bug-investigation-3469

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

MervinPraison · 2026-06-01T09:04:19Z

@copilot Do a thorough review of this PR. Read ALL existing reviewer comments above from Qodo, Coderabbit, and Gemini first — incorporate their findings.

Review areas:

Bloat check: Are changes minimal and focused? Any unnecessary code or scope creep?
Security: Any hardcoded secrets, unsafe eval/exec, missing input validation?
Performance: Any module-level heavy imports? Hot-path regressions?
Tests: Are tests included? Do they cover the changes adequately?
Backward compat: Any public API changes without deprecation?
Code quality: DRY violations, naming conventions, error handling?
Address reviewer feedback: If Qodo, Coderabbit, or Gemini flagged valid issues, include them in your review
Suggest specific improvements with code examples where possible

coderabbitai

Actionable comments posted: 2

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@src/praisonai-agents/praisonaiagents/session/hierarchy.py`:
- Around line 590-597: The invalidate_cache method clears the base _cache via
super().invalidate_cache and then clears _extended_cache under self._lock,
leaving a race; to make invalidation atomic, acquire self._lock around both
operations so that super().invalidate_cache and the _extended_cache pop/clear
happen while holding the same lock (i.e., move the super().invalidate_cache call
inside the with self._lock block or otherwise ensure both _cache and
_extended_cache are cleared under self._lock), ensuring _load_extended_session
cannot see stale entries during the race.

In `@src/praisonai-agents/tests/unit/session/test_hierarchy.py`:
- Around line 120-133: The new unit test
(test_get_extended_session_sees_writes_from_other_store) only covers store-level
behavior; add a complementary agentic integration/e2e test that exercises the
full agent flow by creating two HierarchicalSessionStore instances
(writer/reader) on the same session_dir, starting a real agent via agent.start()
with a real prompt that triggers a write to the session (so the writer persists
a message), invoking the reader's get_extended_session/_load_extended_session to
confirm the agent-produced message is visible (asserting session.messages
contains the LLM response text), and ensure the test uses a real LLM client (not
mocked) per guidelines and cleans up the tempfile directory.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 9138fd3a-a04d-417d-8813-ac61e5a89289

📥 Commits

Reviewing files that changed from the base of the PR and between d5f1114 and 5aaa7d8.

📒 Files selected for processing (2)

src/praisonai-agents/praisonaiagents/session/hierarchy.py
src/praisonai-agents/tests/unit/session/test_hierarchy.py

coderabbitai · 2026-06-01T09:07:42Z

+    def invalidate_cache(self, session_id: Optional[str] = None) -> None:
+        """Invalidate base and extended in-memory caches."""
+        super().invalidate_cache(session_id)
+        with self._lock:
+            if session_id:
+                self._extended_cache.pop(session_id, None)
+            else:
+                self._extended_cache.clear()


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Make cache invalidation atomic across _cache and _extended_cache.

Line 592 clears base cache before Line 593 acquires the lock for _extended_cache, leaving a race window where stale extended entries can still be served by _load_extended_session().

Suggested fix

def invalidate_cache(self, session_id: Optional[str] = None) -> None: """Invalidate base and extended in-memory caches.""" - super().invalidate_cache(session_id) - with self._lock: - if session_id: - self._extended_cache.pop(session_id, None) - else: - self._extended_cache.clear() + with self._lock: + if session_id: + self._cache.pop(session_id, None) + self._extended_cache.pop(session_id, None) + else: + self._cache.clear() + self._extended_cache.clear()

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/praisonai-agents/praisonaiagents/session/hierarchy.py` around lines 590 - 597, The invalidate_cache method clears the base _cache via super().invalidate_cache and then clears _extended_cache under self._lock, leaving a race; to make invalidation atomic, acquire self._lock around both operations so that super().invalidate_cache and the _extended_cache pop/clear happen while holding the same lock (i.e., move the super().invalidate_cache call inside the with self._lock block or otherwise ensure both _cache and _extended_cache are cleared under self._lock), ensuring _load_extended_session cannot see stale entries during the race.

coderabbitai · 2026-06-01T09:07:42Z

+    def test_get_extended_session_sees_writes_from_other_store(self):
+        """Extended reads must reload from disk, not stale _extended_cache."""
+        with tempfile.TemporaryDirectory() as tmpdir:
+            writer = HierarchicalSessionStore(session_dir=tmpdir)
+            reader = HierarchicalSessionStore(session_dir=tmpdir)
+
+            writer.add_user_message("session-1", "first")
+            reader._load_extended_session("session-1")
+            writer.add_user_message("session-1", "second")
+
+            session = reader.get_extended_session("session-1")
+            assert len(session.messages) == 2
+            assert session.messages[1].content == "second"
+


🛠️ Refactor suggestion | 🟠 Major | 🏗️ Heavy lift

Add an agentic integration/e2e test for this feature path.

This new unit test is useful, but guideline-mandated feature coverage also requires a real agent flow (agent.start() + LLM response), not only store-level tests.

As per coding guidelines, "Real agentic tests are MANDATORY for every feature: Agent must call agent.start() with a real prompt, call the LLM, and produce actual text response—not just smoke tests of object construction."

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/praisonai-agents/tests/unit/session/test_hierarchy.py` around lines 120 - 133, The new unit test (test_get_extended_session_sees_writes_from_other_store) only covers store-level behavior; add a complementary agentic integration/e2e test that exercises the full agent flow by creating two HierarchicalSessionStore instances (writer/reader) on the same session_dir, starting a real agent via agent.start() with a real prompt that triggers a write to the session (so the writer persists a message), invoking the reader's get_extended_session/_load_extended_session to confirm the agent-produced message is visible (asserting session.messages contains the LLM response text), and ensure the test uses a real LLM client (not mocked) per guidelines and cleans up the tempfile directory.

Copilot · 2026-06-01T09:08:59Z

@copilot Do a thorough review of this PR. Read ALL existing reviewer comments above from Qodo, Coderabbit, and Gemini first — incorporate their findings.

Review areas:...

Findings after reviewing the PR + existing bot feedback:

Reviewer comments incorporated
- CodeRabbit: walkthrough only; no blocking issues flagged.
- Qodo: current comment is still “review in progress” placeholder; no concrete findings posted yet.
- Gemini: no review comments present on this PR.
1) Bloat check: Scope is minimal (2 files, focused on stale extended-cache read path + regression test). No scope creep.
2) Security: No secrets, no eval/exec, no new input surfaces. Changes are in internal cache/IO flow only.
3) Performance: No heavy module imports added. get_extended_session() now forces disk reload, which is expected for correctness in multi-worker consistency and is aligned with prior fix pattern.
4) Tests: Regression test added for cross-instance stale read. I re-ran:
python -m pytest tests/unit/session/test_hierarchy.py tests/unit/session/test_session_store.py → 58 passed.
5) Backward compatibility: No public API signature changes.
6) Code quality: Implementation is clear and DRY enough; lock usage and cache sync are consistent with store patterns.
7) Reviewer feedback handling: No valid actionable issues from Qodo/CodeRabbit/Gemini at this point.
8) Optional improvement (non-blocking):
In auto-title generation, this line intends to reload fresh data but currently uses cache-prone load:
```
fresh_session = await asyncio.to_thread(self._load_extended_session, session_id)
```
Consider using:
```
fresh_session = await asyncio.to_thread(self._read_session_fresh, session_id)
```
so the “avoid overwriting concurrent updates” comment is strictly true under cross-instance writes.

No blocking issues found for merge.

MervinPraison · 2026-06-01T09:09:02Z

@claude You are the FINAL architecture reviewer. If the branch is under MervinPraison/PraisonAI (not a fork), you are able to make modifications to this branch and push directly. SCOPE: Focus ONLY on Python packages (praisonaiagents, praisonai). Do NOT modify praisonai-rust or praisonai-ts. Read ALL comments above from Gemini, Qodo, CodeRabbit, and Copilot carefully before responding.

Phase 1: Review per AGENTS.md

Protocol-driven: check heavy implementations vs core SDK
Backward compatible: ensure zero feature regressions
Performance: no hot-path regressions

Phase 2: FIX Valid Issues
4. For any VALID bugs or architectural flaws found by Gemini, CodeRabbit, Qodo, Copilot, or any other reviewer: implement the fix
5. Push all code fixes directly to THIS branch (do NOT create a new PR)
6. Comment a summary of exact files modified and what you skipped

Phase 3: Final Verdict
7. If all issues are resolved, approve the PR / close the Issue
8. If blocking issues remain, request changes / leave clear action items

qodo-code-review · 2026-06-01T09:09:04Z

    def get_extended_session(self, session_id: str) -> ExtendedSessionData:
        """Get extended session data."""
-        return self._load_extended_session(session_id)
+        return self._read_session_fresh(session_id)


1. Cached write overwrites updates 🐞 Bug ≡ Correctness

Multiple mutating methods still do read-then-write via _load_extended_session() (which can return stale _extended_cache data) followed by _save_extended_session(), which can overwrite newer messages/fields written by another process. get_extended_session() now refreshes from disk, but write paths like create_session(parent update), set_title/share_session/unshare_session, and auto_title still risk session truncation in multi-worker deployments.

Agent Prompt

### Issue description Several `HierarchicalSessionStore` methods mutate sessions by calling `_load_extended_session()` and then `_save_extended_session()`. Because `_load_extended_session()` serves from `_extended_cache` unless `force_reload=True`, these methods can write a stale in-memory snapshot back to disk and **overwrite newer messages/fields** written by another process. ### Issue Context - `_load_extended_session()` returns cached data when `force_reload=False`. - `_save_extended_session()` writes the *entire* session JSON (including the messages list), so saving a stale object can truncate newer messages. - There is already an existing safe primitive: `_modify_session_locked()` (via `DefaultSessionStore._modify_session_locked`) which does a fresh read under `FileLock` and writes atomically. ### Fix Focus Areas - src/praisonai-agents/praisonaiagents/session/hierarchy.py[214-323] - src/praisonai-agents/praisonaiagents/session/hierarchy.py[495-577] - src/praisonai-agents/praisonaiagents/session/hierarchy.py[167-183] - src/praisonai-agents/praisonaiagents/session/store.py[315-350] ### What to change 1. For any method that **writes** an extended session (e.g., `set_title`, `share_session`, `unshare_session`, `revert_to_snapshot`, parent update inside `create_session`, and the write phase of `auto_title`): - Replace the pattern `session = self._load_extended_session(...); mutate; self._save_extended_session(session)` with a single-file-lock read-modify-write using `self._modify_session_locked(...)`. - Example for `set_title`: ```py def set_title(self, session_id: str, title: str) -> bool: def _apply(session: SessionData) -> None: assert isinstance(session, ExtendedSessionData) session.title = title return self._modify_session_locked(session_id, _apply, error_label="set session title") ``` 2. In `create_session(parent_id=...)`, update the parent’s `children_ids` using `_modify_session_locked(parent_id, ...)` so concurrent child creation doesn’t lose updates. 3. In `auto_title`, ensure the final write uses `_modify_session_locked` (or at minimum a `force_reload=True` read under the same `FileLock`) so setting the title cannot overwrite messages appended by other workers. ### Acceptance criteria - Concurrent cross-process appends to a session are preserved even when `set_title/share_session/unshare_session/create_session(parent update)/auto_title` run on a different worker with a warmed `_extended_cache`. - Add/extend a regression test similar to `test_get_extended_session_sees_writes_from_other_store` that demonstrates no truncation when a stale reader performs one of these metadata-only writes.

ⓘ Copy this prompt and use it to remediate the issue with your preferred AI generation tools

praisonai-triage-agent · 2026-06-01T09:09:21Z

MervinPraison · 2026-06-02T13:44:13Z

@copilot resolve the merge conflicts in this pull request

qodo-code-review · 2026-06-02T13:44:14Z

Qodo reviews are paused for this user.

Troubleshooting steps vary by plan Learn more →

On a Teams plan?
Reviews resume once this user has a paid seat and their Git account is linked in Qodo.
Link Git account →

Using GitHub Enterprise Server, GitLab Self-Managed, or Bitbucket Data Center?
These require an Enterprise plan - Contact us
Contact us →

greptile-apps · 2026-06-02T13:47:44Z

Greptile Summary

This PR fixes stale _extended_cache reads in HierarchicalSessionStore that could surface truncated message lists when multiple workers share the same session directory. The previous get_extended_session returned the in-process cache without checking disk; the fix routes it through _read_session_fresh, which always reloads under FileLock and keeps both _cache and _extended_cache in sync.

get_extended_session now delegates to _read_session_fresh (always-fresh disk read) instead of the mtime-gated _load_extended_session.
_read_session_fresh is overridden in HierarchicalSessionStore to also update _extended_cache and _cache_mtimes after every reload.
A new invalidate_cache override clears _cache and _extended_cache but not _cache_mtimes (minor inconsistency noted below).

Confidence Score: 5/5

Safe to merge — the change correctly routes get_extended_session through a fresh disk read, eliminating stale cache returns in multi-worker deployments.

The fix is simple, targeted, and verifiably correct: get_extended_session now always reads from disk under a file lock, eliminating the stale-cache bug. No write paths were changed in a risky way.

No files require special attention beyond the minor _cache_mtimes omission in invalidate_cache.

Important Files Changed

Filename	Overview
src/praisonai-agents/praisonaiagents/session/hierarchy.py	Core fix: `get_extended_session` now calls `_read_session_fresh` (always reads from disk) instead of the mtime-cached `_load_extended_session`; `_read_session_fresh` is overridden to sync both `_cache` and `_extended_cache`; `invalidate_cache` is added but omits clearing `_cache_mtimes`.
src/praisonai-agents/tests/unit/session/test_hierarchy.py	Replaces the threading-based fork-concurrency test with two new deterministic regression tests covering the stale `_extended_cache` bug; both tests are correct and reliable.

Sequence Diagram

sequenceDiagram
    participant WA as Worker A (reader)
    participant WB as Worker B (writer)
    participant Disk as Session File

    WB->>Disk: add_user_message("first") via _modify_session_locked
    WA->>Disk: _read_session_fresh() [warms _extended_cache with 1 msg]
    WB->>Disk: add_user_message("second") via _modify_session_locked

    Note over WA: Old: get_extended_session() -> _load_extended_session() returned stale _extended_cache (1 msg)
    Note over WA: New: get_extended_session() -> _read_session_fresh() always reads from Disk (2 msgs)

    WA->>Disk: _read_session_fresh() under FileLock
    Disk-->>WA: ExtendedSessionData (2 messages)
    WA->>WA: update _cache, _extended_cache, _cache_mtimes

_{Reviews (2): Last reviewed commit: "fix: atomic cache invalidation and stale..." | Re-trigger Greptile}

greptile-apps · 2026-06-02T13:47:48Z

            if title and title.strip():
-                # Reload session to avoid overwriting concurrent updates
-                fresh_session = await asyncio.to_thread(self._load_extended_session, session_id)
-                # Only set title if it's still empty
-                if not fresh_session.title or not fresh_session.title.strip():
-                    fresh_session.title = title.strip()
-                    return await asyncio.to_thread(self._save_extended_session, fresh_session)
+                # Use locked read-modify-write to avoid overwriting concurrent updates
+                def _apply(fresh_session: SessionData) -> None:
+                    assert isinstance(fresh_session, ExtendedSessionData)
+                    # Only set title if it's still empty
+                    if not fresh_session.title or not fresh_session.title.strip():
+                        fresh_session.title = title.strip()
+
+                return await asyncio.to_thread(
+                    self._modify_session_locked, 
+                    session_id, 
+                    _apply, 
+                    error_label="auto-title session"
+                )


auto_title now returns True even when no title was set

The refactored implementation returns the result of _modify_session_locked, which is True whenever the locked write succeeds — regardless of whether _apply actually changed anything. If another process sets the title between the early-exit check (line 545) and the locked write, _apply silently no-ops but _modify_session_locked still returns True, violating the documented contract ("True if title was generated and set"). The old code fell through to return False in that concurrent case. Additionally, every invocation now always writes the session back to disk (updating updated_at) even when _apply makes no change.

MervinPraison · 2026-06-02T18:58:44Z

@claude resolve the merge conflicts in this pull request

praisonai-triage-agent · 2026-06-02T18:59:08Z

PR #1759 fixed stale reads for DefaultSessionStore but left _extended_cache out of sync. get_extended_session could return truncated message lists after cross-process or cross-instance writes. Align extended cache with _read_session_fresh and clear it on invalidate_cache. Regression test added. Co-authored-by: Mervin Praison <MervinPraison@users.noreply.github.com>

…SessionStore - Fix race condition in invalidate_cache() by clearing both caches under single lock - Fix stale cache overwrites in write methods (set_title, share_session, unshare_session, revert_to_*, create_session parent update, auto_title) - Replace load-modify-save pattern with _modify_session_locked for atomic updates - Add regression test for concurrent write preservation Addresses critical issues identified by code reviewers (Qodo/CodeRabbit) Co-authored-by: Mervin Praison <MervinPraison@users.noreply.github.com>

Copilot started work on behalf of MervinPraison June 1, 2026 09:04 View session

coderabbitai Bot reviewed Jun 1, 2026

View reviewed changes

qodo-code-review Bot reviewed Jun 1, 2026

View reviewed changes

Copilot finished work on behalf of MervinPraison June 1, 2026 09:09

Copilot AI requested a review from MervinPraison June 1, 2026 09:09

MervinPraison marked this pull request as ready for review June 2, 2026 13:44

greptile-apps Bot reviewed Jun 2, 2026

View reviewed changes

cursoragent and others added 2 commits June 2, 2026 19:01

praisonai-triage-agent Bot force-pushed the cursor/critical-bug-investigation-3469 branch from 025f14e to af93eaf Compare June 2, 2026 19:03

MervinPraison merged commit e55789e into main Jun 2, 2026
5 checks passed

MervinPraison mentioned this pull request Jun 2, 2026

docs: Update session-hierarchy for invalidate_cache() + multi-worker cache consistency (PR #1785) MervinPraison/PraisonAIDocs#493

Closed

11 tasks

Uh oh!

Conversation

cursor Bot commented Jun 1, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Bug and impact

Root cause

Fix

Validation

Summary by CodeRabbit

Uh oh!

MervinPraison commented Jun 1, 2026

Uh oh!

MervinPraison commented Jun 1, 2026

Uh oh!

coderabbitai Bot commented Jun 1, 2026

Uh oh!

qodo-code-review Bot commented Jun 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Review by Qodo

Uh oh!

coderabbitai Bot commented Jun 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Walkthrough

Changes

Estimated code review effort

Possibly related issues

Possibly related PRs

Poem

Uh oh!

MervinPraison commented Jun 1, 2026

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jun 1, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jun 1, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI commented Jun 1, 2026

Uh oh!

MervinPraison commented Jun 1, 2026

Uh oh!

qodo-code-review Bot Jun 1, 2026

Choose a reason for hiding this comment

Uh oh!

praisonai-triage-agent Bot commented Jun 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Architecture Review Complete

Summary of Changes Made

Architecture Review Results

Recommendation

Uh oh!

MervinPraison commented Jun 2, 2026

Uh oh!

qodo-code-review Bot commented Jun 2, 2026

Qodo reviews are paused for this user.

Uh oh!

greptile-apps Bot commented Jun 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Sequence Diagram

Uh oh!

greptile-apps Bot Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

MervinPraison commented Jun 2, 2026

Uh oh!

praisonai-triage-agent Bot commented Jun 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Merge Conflicts Resolved Successfully

✅ Completed Tasks:

🔧 Critical Fixes Implemented:

🎯 Result

Uh oh!

Uh oh!

cursor Bot commented Jun 1, 2026 •

edited by coderabbitai Bot

Loading

qodo-code-review Bot commented Jun 1, 2026 •

edited

Loading

coderabbitai Bot commented Jun 1, 2026 •

edited

Loading

praisonai-triage-agent Bot commented Jun 1, 2026 •

edited

Loading

greptile-apps Bot commented Jun 2, 2026 •

edited

Loading

praisonai-triage-agent Bot commented Jun 2, 2026 •

edited

Loading