LLM Agent-Based Simulation of Student Activities and Mental Health Using Smartphone Sensing Data

A modular pipeline for analyzing student life data using Large Language Models (LLMs). This project aggregates student behavioral data and generates insightful weekly summaries using OpenAI and Anthropic APIs.

✨ Features

Multi-source data analysis: Processes emotions, academics, activities, and behavioral patterns
LLM integration: Supports OpenAI and Anthropic for advanced analysis and summarization
Flexible configuration: Environment variables and JSON/CSV configuration files
Weekly insights: Generates comprehensive weekly summaries for individual students
Modular architecture: Clean, extensible pipeline design

📁 Project Structure

.
├── all_pipeline.py          # Main pipeline script
├── agents/                  # Analysis agents and modules
├── dataset/
│   ├── education/
│   │   ├── class.csv       # Class enrollment data
│   │   ├── class_info.json # Course information
│   │   └── deadlines.csv   # Assignment deadlines
│   └── *.csv               # Additional student data (gitignored)
├── BigFive.csv             # Personality assessment data
├── lab_assignment.csv      # Laboratory assignment records
├── .env.example            # Environment variables template
├── requirements.txt        # Python dependencies
└── README.md

🛠️ Requirements

Python 3.8+
pip package manager
API keys for OpenAI and/or Anthropic

💻 Installation

Clone the repository:

git clone https://github.com/DarthMouthStudentSimulator/DarthMouthStudentSimulator.git
cd DarthMouthStudentSimulator

Install dependencies:
```
pip install -r requirements.txt
```

Configure environment variables:

cp .env.example .env

Edit .env with your API keys:

OPENAI_API_KEY=your_openai_api_key_here
ANTHROPIC_API_KEY=your_anthropic_api_key_here

Verify data files: Ensure the following files are present:
- BigFive.csv
- lab_assignment.csv
- dataset/education/class.csv
- dataset/education/class_info.json
- dataset/education/deadlines.csv

🚀 Usage

Basic Usage

Run the main pipeline with default settings:

python all_pipeline.py

Default configuration:

User ID: u58
Analysis period: Weeks 1-10
LLM provider: OpenAI (configurable)

Custom Configuration

Modify the config dictionary in all_pipeline.py to customize:

config = {
    'uid': 'u58',              # Target user ID
    'weeks': range(1, 11),     # Analysis period
    'client_type': 'openai',   # 'openai' or 'anthropic'
    # ... other settings
}

⚙️ Configuration Options

Parameter	Description	Options
`uid`	Student user ID to analyze	Any valid user ID in dataset
`client_type`	LLM provider	`"openai"` or `"anthropic"`
`weeks`	Analysis time period	Range object (e.g., `range(1, 11)`)

📊 Output

The pipeline generates comprehensive weekly analysis including:

Emotional patterns: Mood trends and emotional state analysis
Academic performance: Grades, assignment completion, study habits
Behavioral insights: Activity patterns and engagement metrics
Personalized recommendations: Based on Big Five personality assessment

Sample Output

Week 1 Analysis for User u58:
- Emotion Score: 7.2/10 (Positive trend)
- Academic Score: 8.5/10 (High engagement)
- Key Insights: Strong start to semester, consistent study patterns

🤝 Contributing

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

📚 Citation

If you use this code in your research, please cite our repo:

@misc{2508.02679,
Author = {Wayupuk Sommuang and Kun Kerdthaisong and Pasin Buakhaw and Aslan B. Wong and Nutchanon Yongsatianchot},
Title = {LLM Agent-Based Simulation of Student Activities and Mental Health Using Smartphone Sensing Data},
Year = {2025},
Eprint = {arXiv:2508.02679},
}

🙏 Acknowledgments

PreceptorAI Powered by Cariva for compute resource
Contributors to the student life dataset

Need help? Open an issue or contact the maintainers.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Agent-Based Simulation of Student Activities and Mental Health Using Smartphone Sensing Data

✨ Features

📁 Project Structure

🛠️ Requirements

💻 Installation

🚀 Usage

Basic Usage

Custom Configuration

⚙️ Configuration Options

📊 Output

Sample Output

🤝 Contributing

📄 License

📚 Citation

🙏 Acknowledgments

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
agents		agents
dataset		dataset
models		models
scripts		scripts
student_info		student_info
LICENSE		LICENSE
README.md		README.md
all_pipeline.py		all_pipeline.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

LLM Agent-Based Simulation of Student Activities and Mental Health Using Smartphone Sensing Data

✨ Features

📁 Project Structure

🛠️ Requirements

💻 Installation

🚀 Usage

Basic Usage

Custom Configuration

⚙️ Configuration Options

📊 Output

Sample Output

🤝 Contributing

📄 License

📚 Citation

🙏 Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages