Skip to content

[SLES-2826] Reproducing AAS Restart Named Pipe Overlap Issue [Do Not Merge] #11368

Draft
Lewis-E wants to merge 1 commit into
masterfrom
lewis.lewis/aas-repro-diagnostics
Draft

[SLES-2826] Reproducing AAS Restart Named Pipe Overlap Issue [Do Not Merge] #11368
Lewis-E wants to merge 1 commit into
masterfrom
lewis.lewis/aas-repro-diagnostics

Conversation

@Lewis-E
Copy link
Copy Markdown

@Lewis-E Lewis-E commented May 13, 2026

Add ProcessSupervisor and NamedPipeHealthCheck diagnostics for SLES-2826

Adds [aas-repro] INFO-level log lines to make the pipe-collision failure mode observable without DD_TRACE_DEBUG=true:

ProcessSupervisor:

  • Health state transitions logged at INFO on every change
  • Spawn logged with jvm_pid for cross-process correlation
  • Process start logged with child_pid and faults_reset_to_0
  • Process exit logged with exit code and resulting health state
  • Terminal fault break logged with fault count

ExternalAgentLauncher:

  • Supervisor creation logged with path, pipe name, and jvm_pid

NamedPipeHealthCheck (previously silent):

  • First-check entry logged with pipe name
  • NEVER_CHECKED → READY_TO_START logged with retry count
  • NEVER_CHECKED → HEALTHY logged with warning (orphan risk)
  • Ongoing HEALTHY → READY_TO_START logged when pipe disappears

What Does This Do

Motivation

Additional Notes

Contributor Checklist

Jira ticket: [PROJ-IDENT]

Note: Once your PR is ready to merge, add it to the merge queue by commenting /merge. /merge -c cancels the queue request. /merge -f --reason "reason" skips all merge queue checks; please use this judiciously, as some checks do not run at the PR-level. For more information, see this doc.

…s for SLES-2826

Adds [aas-repro] INFO-level log lines to make the pipe-collision failure
mode observable without DD_TRACE_DEBUG=true:

ProcessSupervisor:
- Health state transitions logged at INFO on every change
- Spawn logged with jvm_pid for cross-process correlation
- Process start logged with child_pid and faults_reset_to_0
- Process exit logged with exit code and resulting health state
- Terminal fault break logged with fault count

ExternalAgentLauncher:
- Supervisor creation logged with path, pipe name, and jvm_pid

NamedPipeHealthCheck (previously silent):
- First-check entry logged with pipe name
- NEVER_CHECKED → READY_TO_START logged with retry count
- NEVER_CHECKED → HEALTHY logged with warning (orphan risk)
- Ongoing HEALTHY → READY_TO_START logged when pipe disappears

These changes are repro-only and do not ship to customers.

Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>
@Lewis-E Lewis-E changed the title [aas-repro] Add ProcessSupervisor and NamedPipeHealthCheck diagnostic… [SLES-2826] [Do Not Merge] Reproducing AAS Restart Named Pipe Overlap Issue May 13, 2026
@Lewis-E Lewis-E changed the title [SLES-2826] [Do Not Merge] Reproducing AAS Restart Named Pipe Overlap Issue [SLES-2826] Reproducing AAS Restart Named Pipe Overlap Issue [Do Not Merge] May 13, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant