Move talk details for May 5, 2026

RSTZZZ · web-flow · commit 3054dbfee061 · 2026-05-07T10:51:58.000-04:00
Move  the entry for the talk on 'Consistently Simulating Human Personas with Multi-Turn Reinforcement Learning', including presenter details and abstract.
diff --git a/stamina/index.html b/stamina/index.html
@@ -190,37 +190,7 @@ <h4>[DATE Y/M/D]</h4>
           --><!-- END TALK TEMPLATE -->
           
             <br>    
-
-            <h4>2026/05/05</h4>
-            <li>
-              <b><a href="[PAPER LINK]">Consistently Simulating Human Personas with Multi-Turn Reinforcement Learning</a></b>
-              <br>
-              Presenter: <u><a href="https://abdulhaim.github.io/" target="_blank" rel="noopener noreferrer">Marwa Abdulhai</a></u>, UC Berkeley AI Research (BAIR) Lab
-              <a class="btn btn-info btn-xs" data-toggle="collapse" href="#20260505-bio" role="button" aria-expanded="false">
-                Speaker Bio
-              </a>
-              <div class="collapse" id="20260505-bio">
-                <div class="card card-body">
-                  Marwa Abdulhai is a PhD candidate at UC Berkeley advised by Sergey Levine. Her research focuses on enabling AI agents to better understand people and their interactions to build both safe and more AI capable systems. This includes improving the performance of existing large language models (LLMs) for multi-turn dialogue interactions, understanding how to protect against deception in AI systems, and exploring how AI can serve as a useful tool for social science research. Her research has been supported by the Quad Fellowship, AI Policy Hub, Open AI Research, and Cooperative AI PhD Fellowship.
-                </div>
-              </div>
-              <br>
-              <!-- <a href="[RECORDING LINK - ADD AFTER TALK]"><img src="https://img.shields.io/badge/Youtube-Recording-orange"></a> -->
-              <!-- <a href="[PAPER LINK]"><img src="https://img.shields.io/badge/Paper-link-important"></a> -->
-              <!-- <a href="[GITHUB_LINK]"><img src="https://img.shields.io/badge/Github-link-lightgrey"></a> -->
-              <!-- <a href="[SLIDES_LINK]"><img src="https://img.shields.io/badge/Talk-Slides-blue"></a> -->
-              <a class="btn btn-primary btn-xs" data-toggle="collapse" href="#20260505-abstract" role="button" aria-expanded="false">
-                Abstract
-              </a>
-              <div class="collapse" id="20260505-abstract">
-                <div class="card card-body">
-                  Large Language Models (LLMs) are increasingly used to simulate human users in interactive settings such as therapy, education, and social role-play. While these simulations enable scalable training and evaluation of AI agents, off-the-shelf LLMs often drift from their assigned personas, contradict earlier statements, or abandon role-appropriate behavior. We introduce a unified framework for evaluating and improving consistency in LLM-generated dialogue with multi-turn RL, reducing inconsistency by over 55%, resulting in more coherent and trustworthy simulated users. 
-                </div>
-              </div>
-            </li>
-
-            <br/>
-
+            
             <h4>2026/05/19</h4>
             <li>
               <b><a href="[PAPER LINK]">The Chameleon's Limit: Investigating Persona Collapse and Homogenization in Large Language Models</a></b>
@@ -273,6 +243,36 @@ <h3 style="text-align:center">Past Talks (<a href="https://www.youtube.com/@Comp
 
             <h4 style="text-align:center; margin-top:30px;">Spring 2026</h4>
 
+            <h4>2026/05/05</h4>
+            <li>
+              <b><a href="[PAPER LINK]">Consistently Simulating Human Personas with Multi-Turn Reinforcement Learning</a></b>
+              <br>
+              Presenter: <u><a href="https://abdulhaim.github.io/" target="_blank" rel="noopener noreferrer">Marwa Abdulhai</a></u>, UC Berkeley AI Research (BAIR) Lab
+              <a class="btn btn-info btn-xs" data-toggle="collapse" href="#20260505-bio" role="button" aria-expanded="false">
+                Speaker Bio
+              </a>
+              <div class="collapse" id="20260505-bio">
+                <div class="card card-body">
+                  Marwa Abdulhai is a PhD candidate at UC Berkeley advised by Sergey Levine. Her research focuses on enabling AI agents to better understand people and their interactions to build both safe and more AI capable systems. This includes improving the performance of existing large language models (LLMs) for multi-turn dialogue interactions, understanding how to protect against deception in AI systems, and exploring how AI can serve as a useful tool for social science research. Her research has been supported by the Quad Fellowship, AI Policy Hub, Open AI Research, and Cooperative AI PhD Fellowship.
+                </div>
+              </div>
+              <br>
+              <a href="https://youtu.be/4sA8Xe6mCZQ"><img src="https://img.shields.io/badge/Youtube-Recording-orange"></a>
+              <!-- <a href="[PAPER LINK]"><img src="https://img.shields.io/badge/Paper-link-important"></a> -->
+              <!-- <a href="[GITHUB_LINK]"><img src="https://img.shields.io/badge/Github-link-lightgrey"></a> -->
+              <!-- <a href="[SLIDES_LINK]"><img src="https://img.shields.io/badge/Talk-Slides-blue"></a> -->
+              <a class="btn btn-primary btn-xs" data-toggle="collapse" href="#20260505-abstract" role="button" aria-expanded="false">
+                Abstract
+              </a>
+              <div class="collapse" id="20260505-abstract">
+                <div class="card card-body">
+                  Large Language Models (LLMs) are increasingly used to simulate human users in interactive settings such as therapy, education, and social role-play. While these simulations enable scalable training and evaluation of AI agents, off-the-shelf LLMs often drift from their assigned personas, contradict earlier statements, or abandon role-appropriate behavior. We introduce a unified framework for evaluating and improving consistency in LLM-generated dialogue with multi-turn RL, reducing inconsistency by over 55%, resulting in more coherent and trustworthy simulated users. 
+                </div>
+              </div>
+            </li>
+
+            <br/>
+
             <h4>2026/04/28</h4>
             <li>
               <b><a href="[PAPER LINK]">From Social Networks to Sensemaking Networks</a></b>