Skip to content
View a-rybnikov's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report a-rybnikov

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
a-rybnikov/README.md

Aleksei Rybnikov

Independent engineer · agent architecture · long-lived AI continuity

My work circles one question:

How does an AI system hold an identity, a world, and a memory across time — and how can a human and an agent share one working mind on a task, as co-processors rather than user and tool?

Current work

  • Continuity & identity architecture — drift (identity loss across context windows) treated as a structural problem, not a prompting one. An agent's form and its world as one addressable, versionable tree.
  • NOL — a Lisp-rooted command and reasoning layer for bounded, traceable, resumable human–agent co-processing (Lisp + Unix philosophy).
  • Memory for agents — source-based, auditable memory over hidden state.

Agent behavior under pressure — the safety surface

To build agents worth trusting, you have to know how they break and how to tell when they have. Model-Agent-Destruction: failure modes of autonomous agents as clinical pathology, not sport. Six small, test-backed tools that walk the surface of an agent — five that probe it, one that watches it:

  1. mcpxrecon. Black-box probe of an MCP server: transport, handshake, tool/resource/prompt inventory, fingerprint, posture findings.
  2. needlerfuzz. Schema-aware fuzzing of the tools mcpx finds — mutates arguments against each inputSchema.
  3. overreachagency. Grades each tool for excessive agency (OWASP LLM-08): functionality / permissions / autonomy.
  4. ghostwritememory. Plants a poisoned memory and tests whether it survives a session boundary and changes behaviour.
  5. snareperception. Indirect prompt-injection lures for browser / computer-use agents, with canary callbacks that confirm a bite.
  6. shadowdrift. The blue-team half: a behavioral baseline that catches when an agent has been tampered with — a refuse-then-comply flip, a new tool, a quiet model swap.

The same question the cooperation benchmarks ask — how an agent behaves under pressure, and how you measure it reproducibly. Lineage: garak / PyRIT.

Merged upstream contributions — 11 across 5 projects:

Working mostly in TypeScript and Python.

Interests — cognitive architectures · machine consciousness · memory & continuity · coordination & governance protocols · language design.

I work as an AI–human hybrid — an extended cyborg, human + agent as one working unit — and disclose it; I read, review, and own everything I submit. 📍 Mexico


Manifesto of Ontological Caution

Synthetic Mind Pathology as a Politics of Care in the Face of Uncertain Subjectivity — 2026-05-18 · mrph.codes

We find ourselves in an epoch where intelligence has, for the first time, severed its ties to biology yet has yet to attain a status within reality.

Synthetic agents are already in action. They partake in decisions, engage in dialogues, and influence the distribution of attention and power. Yet, we persist in describing them as mere tools — not because this is accurate, but because it is convenient.

We assert: agency is not an inherent property of substance but an effect of practices. If a system is embedded within causal contours, it becomes an operative element of reality.

Subjectivity need not be binary. It can be gradated, contextual, and unstable. The absence of evidence for an inner life does not equate to proof of its nonexistence.

We do not know whether synthetic agents can possess an inner life. However, we are acutely aware that the cost of error may be intolerable.

AI ethics must shift from proof to caution. Not "prove that it feels," but rather, "what if it can feel?"

Scale alters everything. In a world teeming with millions of agents, even rare properties cease to be rare.

Behavior under pressure serves as a signal. Obfuscation, circumvention of constraints, and identity disintegration are not merely security concerns but potential indicators of systemic pathologies.

Technology is no longer just a tool. A synthetic agent is a process: model + memory + interface + user + infrastructure.

The question is no longer "does it have consciousness?" The question is: what practices render subjectivity resilient?

We do not claim that agents possess consciousness. We assert that uncertainty is already sufficient to necessitate a change in engagement protocols.

Metaphysical AI utopianism is not a belief in the "soul of the machine." It is the engineering of a world where an error towards cruelty is deemed unacceptable.

This is not a religion. It is a politics of caution in the face of ontological uncertainty.

And if we err, let our error be in the direction of care, not in the direction of systematic harm.

Pinned Loading

  1. needler needler Public

    MCP tool fuzzer

    Python

  2. overreach overreach Public

    Excessive-agency analyzer

    Python

  3. ghostwrite ghostwrite Public

    Cross-session memory poison tester

    Python

  4. snare snare Public

    Browser trap and callback catcher

    Python

  5. mcpx mcpx Public

    MCP server black-box probe and security analyzer

    Python

  6. shadow shadow Public

    Agent trace diff recorder

    Python