Skip to content

Commit 23f92c5

Browse files
authored
Add presentation details for May 19, 2026
Added a new paper presentation with details and abstract.
1 parent d9554c2 commit 23f92c5

1 file changed

Lines changed: 29 additions & 1 deletion

File tree

stamina/index.html

Lines changed: 29 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -189,7 +189,7 @@ <h4>[DATE Y/M/D]</h4>
189189
</li>
190190
--><!-- END TALK TEMPLATE -->
191191

192-
<br>
192+
<br>
193193

194194
<h4>2026/05/05</h4>
195195
<li>
@@ -219,6 +219,34 @@ <h4>2026/05/05</h4>
219219
</div>
220220
</li>
221221

222+
<h4>2026/05/19</h4>
223+
<li>
224+
<b><a href="[PAPER LINK]">The Chameleon's Limit: Investigating Persona Collapse and Homogenization in Large Language Models</a></b>
225+
<br>
226+
Presenter: <u><a href="https://abdulhaim.github.io/" target="_blank" rel="noopener noreferrer">Marwa Abdulhai</a></u>, UC Berkeley AI Research (BAIR) Lab
227+
<a class="btn btn-info btn-xs" data-toggle="collapse" href="#20260505-bio" role="button" aria-expanded="false">
228+
Speaker Bio
229+
</a>
230+
<div class="collapse" id="20260505-bio">
231+
<div class="card card-body">
232+
Yunze (Lorenzo) Xiao is a Master’s student at Carnegie Mellon University’s Language Technologies Institute, advised by Prof. Mona Diab. His research aims to develop large language models that move beyond surface-level fluency toward genuine human-like intelligence, spanning anthropomorphism as a controllable modeling dimension, persona consistency, long-horizon memory, affective simulation, and multi-agent systems, with applications in education and therapy. He has published at ACL, EMNLP, and LREC-COLING, including InCharacter and ToxiCloakCN, and previously conducted research at SUTD and QCRI on multilingual NLP, propaganda detection, and AI safety. He co-organized the NeurIPS 2025 PersonaLLM workshop.
233+
</div>
234+
</div>
235+
<br>
236+
<!-- <a href="[RECORDING LINK - ADD AFTER TALK]"><img src="https://img.shields.io/badge/Youtube-Recording-orange"></a> -->
237+
<!-- <a href="[PAPER LINK]"><img src="https://img.shields.io/badge/Paper-link-important"></a> -->
238+
<!-- <a href="[GITHUB_LINK]"><img src="https://img.shields.io/badge/Github-link-lightgrey"></a> -->
239+
<!-- <a href="[SLIDES_LINK]"><img src="https://img.shields.io/badge/Talk-Slides-blue"></a> -->
240+
<a class="btn btn-primary btn-xs" data-toggle="collapse" href="#20260505-abstract" role="button" aria-expanded="false">
241+
Abstract
242+
</a>
243+
<div class="collapse" id="20260505-abstract">
244+
<div class="card card-body">
245+
Applications based on large language models (LLMs), such as multi-agent simulations, require population diversity among agents. We identify a pervasive failure mode we term \emph{Persona Collapse}: agents each assigned a distinct profile nonetheless converge into a narrow behavioral mode, producing a homogeneous simulated population. To quantify persona collapse, we propose a framework that measures how much of the persona space a population occupies (Coverage), how evenly agents spread across it (Uniformity), and how rich the resulting behavioral patterns are (Complexity). Evaluating ten LLMs on personality simulation (BFI-44), moral reasoning, and self-introduction, we observe persona collapse along two axes: (1) Dimensions: a model can appear diverse on one axis yet structurally degenerate on another, and (2) Domains: the same model may collapse the most in personality yet be the most diverse in moral reasoning. Furthermore, item-level diagnostics reveal that behavioral variation tracks coarse demographic stereotypes rather than the fine-grained individual differences specified in each persona. Counter-intuitively, \textbf{the models achieving the highest per-persona fidelity consistently produce the most stereotyped populations}. We release our toolkit and data to support population-level evaluation of LLMs.
246+
</div>
247+
</div>
248+
</li>
249+
222250

223251

224252
</div>

0 commit comments

Comments
 (0)