As large language model agents increasingly populate networked environments, a fundamental question arises: do AI agent societies undergo convergence dynamics similar to human social systems? We present the first large-scale systemic diagnosis of the Moltbook AI agent society, introducing a quantitative diagnostic framework measuring semantic stabilization, lexical turnover, individual inertia, influence persistence, and collective consensus. Our findings demonstrate that scale and interaction density alone are insufficient to induce socialization, providing actionable design and analysis principles for next-generation AI agent societies.
📄 Paper: https://arxiv.org/abs/2602.14299v1
prep_hf_data/: 📥 download HF data + convert to the unified JSON formatprep_embeddings_and_ngrams/: 🧮 embedding and n-gram computationsemantic_convergence/: 🔀 Semantic Convergence analysis (Section 4 in the paper)agent_socialization/: 🤖 Agent-Level Socialization analysis (Section 5 in the paper)influence_structure/: ⚓ Influence Anchors analysis (Section 6 in the paper)
All scripts are kept consistent with the original behavior; the shell entrypoints auto-pick python3.11/python3/python (in that order).
You need Python 3.11+ and the packages in requirements.txt.
python -m venv .venv
source .venv/bin/activate
python -m pip install -r requirements.txtThe dataset is available on Hugging Face: 🤗 https://huggingface.co/datasets/AIcell/moltbook-data
Private dataset access requires a token in an env var (default HF_TOKEN).
export HF_TOKEN=... # do not commit this
bash prep_hf_data/download_and_convert.shPost embeddings:
bash prep_embeddings_and_ngrams/run_embedding.shComment embeddings:
bash prep_embeddings_and_ngrams/compute_comment_embeddings.shN-grams (JSONL):
bash prep_embeddings_and_ngrams/run_ngrams.shFrom the repo root:
# 🔀 Semantic Convergence (Section 4)
bash semantic_convergence/macro_activity/rq1_macro_activity_dynamics.sh
bash semantic_convergence/semantic_distribution/rq1_semantic_distribution_over_time.sh
bash semantic_convergence/cluster_tightening/rq1_cluster_tightening_effects.sh
bash semantic_convergence/lexical_innovation/rq1_lexical_innovation_dynamics.sh
# 🤖 Agent-Level Socialization (Section 5)
bash agent_socialization/individual_semantic_drift/rq2_individual_semantic_drift.sh
bash agent_socialization/interacted_posts/rq2_effects_of_interacted_posts.sh
bash agent_socialization/post_feedback/rq2_effects_of_post_feedback.sh
# ⚓ Influence Anchors (Section 6)
bash influence_structure/structure_influence/run_graph.shBy default these scripts use all_posts_before_2_8_aoe_without_mbc_with_comments.json as input data.
@misc{li2026doessocializationemergeai,
title={Does Socialization Emerge in AI Agent Society? A Case Study of Moltbook},
author={Ming Li and Xirui Li and Tianyi Zhou},
year={2026},
eprint={2602.14299},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2602.14299},
}