Project Documentation

Overview

Lily is a comprehensive unified platform that integrates multiple powerful tools into one seamless experience. As a single entity, Lily unifies an LLM-powered chatbot, a feature-rich Discord bot, and VTube Studio integration for virtual avatar control. Lily also supports translation and speech synthesis, providing an all-in-one interface for communication, development, and multimedia interaction. Long-term memory is handled via MongoDB, utilizing vector embeddings and semantic search for intelligent information retrieval.

Setup

Install Dependencies:
```
pip install -r requirements.txt
```
Configure Environment:
- Copy .env.template to .env.
- Fill in the required values in .env (Discord tokens, MongoDB URI, personality settings, etc.).
Configure LLM API Keys:
- Copy llm_api_keys.json.template to llm_api_keys.json.
- Open llm_api_keys.json and add your API keys for the desired providers (e.g., "gemini", "openai"). You can add multiple keys per provider as a list of strings. The application will cycle through these keys using a round-robin strategy.
```
{
  "gemini": [
    "YOUR_GEMINI_API_KEY_1",
    "YOUR_GEMINI_API_KEY_2"
  ],
  "openai": [
    "YOUR_OPENAI_API_KEY_1"
  ]
}
```
- Important: Ensure llm_api_keys.json is added to your .gitignore file to prevent accidentally committing your keys.
Run TTS Provider (Optional): If using Text-to-Speech, ensure the TTS-Provider server is running.
Run MongoDB: If using long-term memory, ensure your MongoDB instance is running and accessible via the MONGO_URI in your .env file.
Run Application:
```
python main.py
```

Technologies Used

Python
Kivy
Discord.py
SpeechRecognition
External TTS-Provider (for Text-to-Speech via WebSocket) - Requires separate server.
Deep Translator (for translation)
OpenAI and/or Gemini API (via llm_api_keys.json)
pymongo (for MongoDB interaction)
sentence-transformers (for memory embeddings and semantic search)
websockets (for VTube Studio integration)

Note: Text-to-Speech functionality requires the TTS-Provider server to be running locally (default: ws://localhost:9000). Note: MongoDB memory requires a running MongoDB instance and the MONGO_URI set in the .env file.

Features

Chat Tab:
- Interact with a chatbot powered by the configured LLM provider (OpenAI or Gemini, keys managed in llm_api_keys.json).
- Translate chatbot responses into Japanese using Deep Translator.
- Convert Japanese responses into speech via an external TTS-Provider server.
- Short-term memory: Maintains a prompt history (in-memory).
- Long-term memory: Stores and retrieves information ("facts") in a MongoDB database using vector embeddings (sentence-transformers) for semantic search. This allows recalling relevant information based on meaning, not just keywords. Requires MONGO_URI in .env. The LLM can interact with this memory using the save_memory and fetch_memory tools. Duplicate facts are automatically detected and handled based on semantic similarity.
Discord Tab:
- Start and stop a Discord bot that responds to messages in a designated channel.
- The bot utilizes an LLM (provider and model configurable via .env, keys managed in llm_api_keys.json) to understand and generate responses.
- The LLM can use specific tools (defined in llm/discord_llm.py) to enhance its replies during the main interaction loop, including:
  - fetch_memory: Retrieves relevant information from memory using semantic search, returning facts with their unique IDs.
  - search_web: Performs web searches.
  - get_current_time: Gets the current time.
- Note: The Discord LLM explicitly skips the final memory save/update step for security reasons.
VTube Studio Integration (VTube Tab):
- Connect to VTube Studio via WebSocket (pyvts library).
- Fetch available VTS parameters (input parameters).
- Load, display, and trigger custom "animations" (parameter presets) saved as JSON files in %APPDATA%/NsTut/LilyTheThird/vtube/animations.
- Create, edit, and delete these animation JSON files using an inline editor panel.
- Filter displayed animations and parameters using search bars.
- Copy parameter names or set animation data from clipboard JSON.
- See docs/vtube.md for detailed documentation.

Name		Name	Last commit message	Last commit date
Latest commit History 170 Commits
.idea		.idea
.vscode		.vscode
assets		assets
config		config
discord_integration		discord_integration
docs		docs
lib		lib
llm		llm
memory		memory
models		models
processes		processes
tools		tools
utils		utils
views		views
.env.template		.env.template
.gitignore		.gitignore
README.md		README.md
llm_api_keys.json.template		llm_api_keys.json.template
main.py		main.py
main_layout.kv		main_layout.kv
requirements.txt		requirements.txt
settings_manager.py		settings_manager.py
translator.py		translator.py
tts.py		tts.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project Documentation

Overview

Setup

Technologies Used

Features

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Project Documentation

Overview

Setup

Technologies Used

Features

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages