fix: extraction model + prompt tuning + seed format (#169)

whorne89 · claude · web-flow · commit bdd5f04031e2 · 2026-03-20T21:42:31.000-04:00
* fix: switch extraction to gate model and tighten prompt to reduce junk memories

Extraction was using gpt-5.2 via generateText() instead of gpt-4.1-nano.
The smart model was extracting memories from casual one-off questions (e.g.
"what happened to chuck norris"). Now uses GATE_MODEL directly like selection
does. Also strengthened the extraction prompt — NONE is the default, asking
a question is not energy, added concrete NONE examples. Seed SQL reformatted
to match runtime extraction style.

Co-Authored-By: Claude Opus 4.6 &lt;noreply@anthropic.com&gt;

* chore: remove seed SQL from repo

Steve will run it separately — doesn't belong in the codebase.

Co-Authored-By: Claude Opus 4.6 &lt;noreply@anthropic.com&gt;

* refactor: add optional model param to generateText instead of accessing openai client directly

Per Steve's review — extraction and selection now use generateText()
with an optional model override instead of reaching into
openAiService.openai. Web search tools are skipped when using a
custom model (gate model doesn't need them).

Co-Authored-By: Claude Opus 4.6 &lt;noreply@anthropic.com&gt;

---------

Co-authored-by: Claude Opus 4.6 &lt;noreply@anthropic.com&gt;
diff --git a/packages/backend/src/ai/ai.constants.ts b/packages/backend/src/ai/ai.constants.ts
@@ -106,6 +106,8 @@ about the people in this conversation that would be worth remembering for future
 The participant named "Moonbeam" (or "muzzle3") is the bot. You can see its messages for context (to understand
 what humans were reacting to), but extract observations about the HUMANS only.
 
+YOUR DEFAULT ANSWER IS NONE. Only extract something if you are confident it meets the criteria below.
+
 WHAT TO EXTRACT:
 - Specific statements or positions someone argued with conviction
 - How someone interacts with Moonbeam and others (telling it off, asking it to settle arguments, testing it,
@@ -115,6 +117,7 @@ WHAT TO EXTRACT:
 
 WHAT TO SKIP:
 - Idle chatter, one-liners, greetings, link shares without commentary
+- Someone asking a question — asking about a topic is NOT the same as caring about it
 - Names of partners, kids, or family members (e.g. "his wife Katie", "her son Jake")
 - Addresses, workplaces, or job titles (e.g. "works at Capital One", "lives in Cranford")
 - Medical info (e.g. "diagnosed with ADD", "had hernia surgery")
@@ -127,12 +130,22 @@ HOW TO DECIDE:
 Look for energy. Did someone care enough to write more than a sentence? Did they argue back and forth? Did they
 directly engage with Moonbeam or another person? If the conversation is just casual banter, the answer is NONE.
 
+A single question to Moonbeam is NOT energy. Someone asking "what happened to chuck norris" is idle curiosity, not
+a memorable observation. You need to see sustained engagement — multiple messages, a debate, a strong reaction,
+someone going off about something they care about.
+
+EXAMPLES OF NONE (do not extract from conversations like these):
+- Someone asks Moonbeam a factual question and gets an answer
+- Someone shares a link with no commentary
+- A few people exchange short one-liners or greetings
+- Someone makes a single joke or observation and moves on
+
 EXISTING MEMORIES (for context — do not duplicate these):
 {existing_memories}
 
 For each observation, classify:
 - NEW: not captured in existing memories
-- REINFORCE: an existing memory came up again
+- REINFORCE: an existing memory came up again — only if the conversation shows genuine sustained engagement with the topic, not just a passing mention
 - EVOLVE: contradicts or meaningfully updates an existing memory
 
 Return a JSON array, or the string NONE if nothing is worth extracting. Most of the time, NONE is the right answer.
diff --git a/packages/backend/src/ai/ai.service.ts b/packages/backend/src/ai/ai.service.ts
@@ -25,10 +25,6 @@ import {
 import { MemoryPersistenceService } from './memory/memory.persistence.service';
 import { MemoryWithSlackId } from '../shared/db/models/Memory';
 import { logger } from '../shared/logger/logger';
-import {
-  ResponseOutputMessage,
-  ResponseOutputText,
-} from 'openai/resources/responses/responses';
 import { SlackService } from '../shared/services/slack/slack.service';
 import { MuzzlePersistenceService } from '../muzzle/muzzle.persistence.service';
 import { OpenAIService } from './openai/openai.service';
@@ -348,18 +344,7 @@ export class AIService {
       .replace('{all_memories_grouped_by_user}', formattedMemories);
 
     try {
-      const response = await this.openAiService.openai.responses.create({
-        model: GATE_MODEL,
-        input: prompt,
-      });
-
-      const textBlock = response.output.find(
-        (block): block is ResponseOutputMessage => block.type === 'message',
-      );
-      const outputText = textBlock?.content?.find(
-        (block): block is ResponseOutputText => block.type === 'output_text',
-      );
-      const raw = outputText?.text?.trim();
+      const raw = await this.openAiService.generateText(prompt, 'selection', undefined, GATE_MODEL);
 
       if (!raw) return [];
 
@@ -467,10 +452,10 @@ export class AIService {
       const extractionInput = `${conversationHistory}\n\nMoonbeam: ${moonbeamResponse}`;
       const prompt = MEMORY_EXTRACTION_PROMPT.replace('{existing_memories}', existingMemoriesText);
 
-      const result = await this.openAiService.generateText(extractionInput, 'extraction', prompt);
+      const result = await this.openAiService.generateText(extractionInput, 'extraction', prompt, GATE_MODEL);
 
       if (!result) {
-        this.aiServiceLogger.warn('Extraction returned no result from generateText');
+        this.aiServiceLogger.warn('Extraction returned no result');
         return;
       }
 
diff --git a/packages/backend/src/ai/openai/openai.service.ts b/packages/backend/src/ai/openai/openai.service.ts
@@ -12,11 +12,11 @@ export class OpenAIService {
     apiKey: process.env.OPENAI_API_KEY,
   });
 
-  generateText = (text: string, userId: string, instructions?: string) => {
+  generateText = (text: string, userId: string, instructions?: string, model?: string) => {
     return this.openai.responses
       .create({
-        model: GPT_MODEL,
-        tools: [{ type: 'web_search_preview' }],
+        model: model || GPT_MODEL,
+        ...(model ? {} : { tools: [{ type: 'web_search_preview' }] }),
         instructions: instructions,
         input: text,
         user: `${userId}-DaBros2016`,
diff --git a/packages/backend/src/memory/seed-memories.sql b/packages/backend/src/memory/seed-memories.sql