From 232744745beb1ae7229c6d99a69c138238ce6fe0 Mon Sep 17 00:00:00 2001
From: JarbasAi <jarbasai@mailfence.com>
Date: Thu, 25 Jun 2026 16:37:28 +0100
Subject: [PATCH 1/2] =?UTF-8?q?feat:=20AUDIO-IN-1=20=C2=A76=20listening=20?=
 =?UTF-8?q?lifecycle=20signals?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

Add the listener-role bus signals to the Audio Input Service spec under
V2 ovos.* names: ovos.mic.record.started / .record.ended around
voice-command capture, ovos.mic.sleep to suspend capture, and
ovos.mic.awoken on the sleep->awake transition. All carry no payload;
the session is identified by context.session.session_id.

Adds a §6.5 bus surface table including the consumer-side ovos.mic.listen
row, whose defining spec is OVOS-AUDIO-1 §4.4. Conformance and See-also
updated; Conformance renumbered §6 -> §7.

CHANGELOG extends OVOS-AUDIO-IN-1 under ### 2 (class unchanged).
divergences appendix records the legacy -> ovos.mic.* migration.
GLOSSARY gains the "Listening lifecycle signal" term.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
---
 CHANGELOG.md            | 17 +++++++++
 GLOSSARY.md             |  1 +
 appendix/divergences.md |  9 +++++
 audio-in.md             | 81 +++++++++++++++++++++++++++++++++++++++--
 4 files changed, 105 insertions(+), 3 deletions(-)

diff --git a/CHANGELOG.md b/CHANGELOG.md
index c66ab70..3fd85ac 100644
--- a/CHANGELOG.md
+++ b/CHANGELOG.md
@@ -104,3 +104,20 @@ tool does not recognize the token and cannot expand the template.
   the rejecting topic. File paths never cross the bus — INTENT-2 locale
   files are a producer-side authoring convenience expanded inline by
   the skill loader before emission.
+
+## OVOS-AUDIO-IN-1 — Audio Input Service
+
+### 2
+
+- §6 (new) — listening lifecycle signals. The audio input service
+  emits `ovos.mic.record.started` / `ovos.mic.record.ended` around
+  voice-command capture, accepts `ovos.mic.sleep` to enter sleep mode
+  and suspend capture, and emits `ovos.mic.awoken` on the sleep→awake
+  transition. These replace the legacy `recognizer_loop:record_begin`
+  / `recognizer_loop:record_end` / `recognizer_loop:sleep` /
+  `mycroft.awoken` topics. All carry no payload; the session is
+  identified by `context.session.session_id`.
+- §6.5 — bus surface table for the listener role, including the
+  consumer-side `ovos.mic.listen` row (defined in OVOS-AUDIO-1 §4.4).
+- See-also — cross-references OVOS-AUDIO-1 §4.4 as the defining spec
+  for `ovos.mic.listen`.
diff --git a/GLOSSARY.md b/GLOSSARY.md
index 6eebc3d..c0f07d7 100644
--- a/GLOSSARY.md
+++ b/GLOSSARY.md
@@ -36,3 +36,4 @@ open a PR adding it.
 | **Message** | The unit of communication on the bus: a JSON object with `type`, `data`, `context` ([MSG-1 §2](msg-1.md)). |
 | **Context** | The assistant-metadata object on a Message; an extensible JSON object whose keys are defined by companion specs ([MSG-1 §2.3](msg-1.md)). |
 | **Session** | The per-conversation carrier in `context.session`; carries `session_id` (with `"default"` reserved for "originates from the device itself") and `lang` (the user's preferred language, distinct from any `data.lang` describing the payload's own language) ([MSG-1 §4](msg-1.md)). |
+| **Listening lifecycle signal** | A payload-free bus signal the audio input service emits or consumes around voice-command capture and sleep mode — `ovos.mic.record.started` / `.record.ended`, `ovos.mic.sleep`, `ovos.mic.awoken` ([AUDIO-IN-1 §6](audio-in.md)). |
diff --git a/appendix/divergences.md b/appendix/divergences.md
index 16e0e17..fdf7ae7 100644
--- a/appendix/divergences.md
+++ b/appendix/divergences.md
@@ -271,3 +271,12 @@ a number of legacy names. Implementer migration aid:
 | `add_context` / `remove_context` | Replaced by `ovos.context.set` / `.unset` under CONTEXT-1. |
 | `mycroft.skill.set_cross_context` / `remove_cross_context` | Replaced by `ovos.context.set` / `.unset` with `scope: "shared"` under CONTEXT-1. |
 | `<skill_id>.activate` | Activity-tracking emit currently in `ovos-core`; not part of any spec here. |
+
+#### Listening-lifecycle topics (AUDIO-IN-1)
+
+| Legacy topic | v2 replacement | Notes |
+|--------------|---------------|-------|
+| `recognizer_loop:record_begin` | `ovos.mic.record.started` | Capture start. `:` segment separator and implementation-role prefix dropped; no payload. |
+| `recognizer_loop:record_end` | `ovos.mic.record.ended` | Capture end; pairs with the start signal. |
+| `recognizer_loop:sleep` | `ovos.mic.sleep` | Controller-to-listener sleep request. |
+| `mycroft.awoken` | `ovos.mic.awoken` | Sleep→awake transition; moved into the `ovos.mic.*` namespace. |
diff --git a/audio-in.md b/audio-in.md
index 4cdd89b..a4c825c 100644
--- a/audio-in.md
+++ b/audio-in.md
@@ -119,7 +119,78 @@ placed in `context.session` (**OVOS-MSG-1 §4**).
 
 ---
 
-## 6. Conformance
+## 6. Listening lifecycle signals
+
+The audio input service emits lifecycle signals around voice-command
+capture and sleep mode to notify other components of listener state.
+
+### 6.1 Capture start
+
+When voice-command capture begins, the audio input service **MUST**
+emit:
+
+`ovos.mic.record.started`
+
+Payload:
+
+No payload. The session is identified by `context.session.session_id`
+of this Message.
+
+### 6.2 Capture end
+
+When capture ends, the audio input service **MUST** emit:
+
+`ovos.mic.record.ended`
+
+Payload:
+
+No payload. The session is identified by `context.session.session_id`
+of this Message.
+
+This signal pairs with `ovos.mic.record.started` (§6.1); a component
+that subscribed to the start signal uses this to restore state.
+
+### 6.3 Sleep mode
+
+A controller (e.g. a naptime skill) requests sleep mode by emitting:
+
+`ovos.mic.sleep`
+
+Payload:
+
+No payload. The session is identified by `context.session.session_id`
+of this Message.
+
+On receipt the audio input service enters sleep mode and suspends
+capture until it is awoken (§6.4).
+
+### 6.4 Awoken
+
+When the audio input service leaves sleep mode, it **MUST** emit:
+
+`ovos.mic.awoken`
+
+Payload:
+
+No payload. The session is identified by `context.session.session_id`
+of this Message.
+
+This signal fires only on the sleep→awake transition; it is not
+emitted when the service is already awake.
+
+### 6.5 Bus surface
+
+| Topic | Direction | Purpose |
+|-------|-----------|---------|
+| `ovos.mic.record.started` | audio-input → broadcast | Voice-command capture began (§6.1). |
+| `ovos.mic.record.ended` | audio-input → broadcast | Voice-command capture ended (§6.2). |
+| `ovos.mic.sleep` | controller → audio-input | Enter sleep mode and suspend capture (§6.3). |
+| `ovos.mic.awoken` | audio-input → broadcast | Left sleep mode (§6.4). |
+| `ovos.mic.listen` | any component → audio-input | Re-open the user input channel; consumed here, defined in OVOS-AUDIO-1 §4.4. |
+
+---
+
+## 7. Conformance
 
 ### An audio input service **MUST**:
 
@@ -128,7 +199,10 @@ placed in `context.session` (**OVOS-MSG-1 §4**).
   STT (§4);
 - assign a session in `context.session` per §5.2;
 - emit `ovos.utterance.handle` with `data.utterances` and `data.lang`
-  (§5).
+  (§5);
+- emit `ovos.mic.record.started` when voice-command capture begins and
+  `ovos.mic.record.ended` when it ends (§6.1, §6.2);
+- emit `ovos.mic.awoken` on the sleep→awake transition (§6.4).
 
 ### An audio input service **SHOULD**:
 
@@ -147,7 +221,8 @@ placed in `context.session` (**OVOS-MSG-1 §4**).
 - **OVOS-PIPELINE-1** — utterance lifecycle entry point (§9.1);
   post-STT transformer chains are owned here.
 - **OVOS-AUDIO-1** — audio output service; owns dialog and TTS
-  transformer chains.
+  transformer chains, and defines `ovos.mic.listen` (§4.4) which the
+  audio input service consumes (§6.5).
 - **OVOS-TRANSFORM-1** — audio-transformer chain (§3.1).
 - **OVOS-SESSION-1** — `session.lang`, `session.stt_lang`,
   `session.detected_lang`, `session.request_lang`.

From ac9aaf08cb19ec6789acf425f82be2c1c5256094 Mon Sep 17 00:00:00 2001
From: JarbasAi <jarbasai@mailfence.com>
Date: Thu, 25 Jun 2026 16:51:22 +0100
Subject: [PATCH 2/2] fix: name the listening signals ovos.listener.* (not
 ovos.mic.*)
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

ovos.listener.record.started/ended, ovos.listener.sleep, ovos.listener.awoken.
ovos.mic.listen is unchanged — it is defined by OVOS-AUDIO-1 §4.4 and only
referenced here on the consumer side.
---
 CHANGELOG.md            |  6 +++---
 GLOSSARY.md             |  2 +-
 appendix/divergences.md |  8 ++++----
 audio-in.md             | 24 ++++++++++++------------
 4 files changed, 20 insertions(+), 20 deletions(-)

diff --git a/CHANGELOG.md b/CHANGELOG.md
index 3fd85ac..8585e84 100644
--- a/CHANGELOG.md
+++ b/CHANGELOG.md
@@ -110,9 +110,9 @@ tool does not recognize the token and cannot expand the template.
 ### 2
 
 - §6 (new) — listening lifecycle signals. The audio input service
-  emits `ovos.mic.record.started` / `ovos.mic.record.ended` around
-  voice-command capture, accepts `ovos.mic.sleep` to enter sleep mode
-  and suspend capture, and emits `ovos.mic.awoken` on the sleep→awake
+  emits `ovos.listener.record.started` / `ovos.listener.record.ended` around
+  voice-command capture, accepts `ovos.listener.sleep` to enter sleep mode
+  and suspend capture, and emits `ovos.listener.awoken` on the sleep→awake
   transition. These replace the legacy `recognizer_loop:record_begin`
   / `recognizer_loop:record_end` / `recognizer_loop:sleep` /
   `mycroft.awoken` topics. All carry no payload; the session is
diff --git a/GLOSSARY.md b/GLOSSARY.md
index c0f07d7..32e1294 100644
--- a/GLOSSARY.md
+++ b/GLOSSARY.md
@@ -36,4 +36,4 @@ open a PR adding it.
 | **Message** | The unit of communication on the bus: a JSON object with `type`, `data`, `context` ([MSG-1 §2](msg-1.md)). |
 | **Context** | The assistant-metadata object on a Message; an extensible JSON object whose keys are defined by companion specs ([MSG-1 §2.3](msg-1.md)). |
 | **Session** | The per-conversation carrier in `context.session`; carries `session_id` (with `"default"` reserved for "originates from the device itself") and `lang` (the user's preferred language, distinct from any `data.lang` describing the payload's own language) ([MSG-1 §4](msg-1.md)). |
-| **Listening lifecycle signal** | A payload-free bus signal the audio input service emits or consumes around voice-command capture and sleep mode — `ovos.mic.record.started` / `.record.ended`, `ovos.mic.sleep`, `ovos.mic.awoken` ([AUDIO-IN-1 §6](audio-in.md)). |
+| **Listening lifecycle signal** | A payload-free bus signal the audio input service emits or consumes around voice-command capture and sleep mode — `ovos.listener.record.started` / `.record.ended`, `ovos.listener.sleep`, `ovos.listener.awoken` ([AUDIO-IN-1 §6](audio-in.md)). |
diff --git a/appendix/divergences.md b/appendix/divergences.md
index fdf7ae7..b922541 100644
--- a/appendix/divergences.md
+++ b/appendix/divergences.md
@@ -276,7 +276,7 @@ a number of legacy names. Implementer migration aid:
 
 | Legacy topic | v2 replacement | Notes |
 |--------------|---------------|-------|
-| `recognizer_loop:record_begin` | `ovos.mic.record.started` | Capture start. `:` segment separator and implementation-role prefix dropped; no payload. |
-| `recognizer_loop:record_end` | `ovos.mic.record.ended` | Capture end; pairs with the start signal. |
-| `recognizer_loop:sleep` | `ovos.mic.sleep` | Controller-to-listener sleep request. |
-| `mycroft.awoken` | `ovos.mic.awoken` | Sleep→awake transition; moved into the `ovos.mic.*` namespace. |
+| `recognizer_loop:record_begin` | `ovos.listener.record.started` | Capture start. `:` segment separator and implementation-role prefix dropped; no payload. |
+| `recognizer_loop:record_end` | `ovos.listener.record.ended` | Capture end; pairs with the start signal. |
+| `recognizer_loop:sleep` | `ovos.listener.sleep` | Controller-to-listener sleep request. |
+| `mycroft.awoken` | `ovos.listener.awoken` | Sleep→awake transition; moved into the `ovos.listener.*` namespace. |
diff --git a/audio-in.md b/audio-in.md
index a4c825c..056b010 100644
--- a/audio-in.md
+++ b/audio-in.md
@@ -129,7 +129,7 @@ capture and sleep mode to notify other components of listener state.
 When voice-command capture begins, the audio input service **MUST**
 emit:
 
-`ovos.mic.record.started`
+`ovos.listener.record.started`
 
 Payload:
 
@@ -140,21 +140,21 @@ of this Message.
 
 When capture ends, the audio input service **MUST** emit:
 
-`ovos.mic.record.ended`
+`ovos.listener.record.ended`
 
 Payload:
 
 No payload. The session is identified by `context.session.session_id`
 of this Message.
 
-This signal pairs with `ovos.mic.record.started` (§6.1); a component
+This signal pairs with `ovos.listener.record.started` (§6.1); a component
 that subscribed to the start signal uses this to restore state.
 
 ### 6.3 Sleep mode
 
 A controller (e.g. a naptime skill) requests sleep mode by emitting:
 
-`ovos.mic.sleep`
+`ovos.listener.sleep`
 
 Payload:
 
@@ -168,7 +168,7 @@ capture until it is awoken (§6.4).
 
 When the audio input service leaves sleep mode, it **MUST** emit:
 
-`ovos.mic.awoken`
+`ovos.listener.awoken`
 
 Payload:
 
@@ -182,10 +182,10 @@ emitted when the service is already awake.
 
 | Topic | Direction | Purpose |
 |-------|-----------|---------|
-| `ovos.mic.record.started` | audio-input → broadcast | Voice-command capture began (§6.1). |
-| `ovos.mic.record.ended` | audio-input → broadcast | Voice-command capture ended (§6.2). |
-| `ovos.mic.sleep` | controller → audio-input | Enter sleep mode and suspend capture (§6.3). |
-| `ovos.mic.awoken` | audio-input → broadcast | Left sleep mode (§6.4). |
+| `ovos.listener.record.started` | audio-input → broadcast | Voice-command capture began (§6.1). |
+| `ovos.listener.record.ended` | audio-input → broadcast | Voice-command capture ended (§6.2). |
+| `ovos.listener.sleep` | controller → audio-input | Enter sleep mode and suspend capture (§6.3). |
+| `ovos.listener.awoken` | audio-input → broadcast | Left sleep mode (§6.4). |
 | `ovos.mic.listen` | any component → audio-input | Re-open the user input channel; consumed here, defined in OVOS-AUDIO-1 §4.4. |
 
 ---
@@ -200,9 +200,9 @@ emitted when the service is already awake.
 - assign a session in `context.session` per §5.2;
 - emit `ovos.utterance.handle` with `data.utterances` and `data.lang`
   (§5);
-- emit `ovos.mic.record.started` when voice-command capture begins and
-  `ovos.mic.record.ended` when it ends (§6.1, §6.2);
-- emit `ovos.mic.awoken` on the sleep→awake transition (§6.4).
+- emit `ovos.listener.record.started` when voice-command capture begins and
+  `ovos.listener.record.ended` when it ends (§6.1, §6.2);
+- emit `ovos.listener.awoken` on the sleep→awake transition (§6.4).
 
 ### An audio input service **SHOULD**: