Skip to content

Break the glass of the museum exhibit. ⏳ TimeDial enables real-time voice conversations with Einstein, Cleopatra, and more using Gemini 3.0 Pro and ElevenLabs.

License

Notifications You must be signed in to change notification settings

engrtitooo/TimeDial

Repository files navigation

⏳ TimeDial: Immersive History Teacher

Talk to history, face-to-face.

Google Cloud Gemini 3.0 ElevenLabs Status

🚀 Links & Demo


🏆 Hackathon Track Alignment

We are submitting TimeDial to the following tracks:

1. Google Cloud Track (Vertex AI)

  • Integration: We utilize Google Vertex AI to access the exclusive Gemini 3.0 Pro model.
  • Why it matters: Unlike standard chat apps, we leverage Gemini 3.0's superior reasoning and roleplay capabilities to maintain deep, historically accurate personas (Einstein, Cleopatra) that never break character, even when asked complex modern questions.
  • Infrastructure: The entire application is containerized with Docker and deployed on Google Cloud Run for serverless scalability.

2. ElevenLabs Track (Conversational AI)

  • Integration: We use the latest ElevenLabs v3 model (via Python SDK) for ultra-low latency voice generation.
  • Voice-First Experience: The app is designed as a "Voice-First" interface. We implemented a streaming architecture where Gemini's text response is immediately converted to audio, allowing users to have a fluid, natural conversation with history.
  • Emotional Range: We utilized specific "Style Prompts" to give Einstein a thoughtful, slow-paced German accent and Cleopatra a commanding, regal tone.

💡 What it does

TimeDial is an educational time machine. Instead of reading boring textbooks, students can "dial in" a historical figure and have a real-time voice conversation.

  • Real-time Voice: Speak to the app, and it speaks back instantly.
  • Visual Immersion: The UI pulses and reacts to the character's voice frequency.
  • Fact-Checking: Powered by Gemini's grounding to ensure historical accuracy.

🛠️ Tech Stack

  • AI Brain: Google Vertex AI (Gemini 3.0 Pro)
  • Voice: ElevenLabs API (Turbo v2.5 / Multilingual v3)
  • Backend: Python FastAPI (Async/Await)
  • Frontend: HTML5, TailwindCSS, Vanilla JS
  • DevOps: Docker, Google Cloud Run

⚙️ How to Run Locally

  1. Clone the repo:

    git clone https://github.com/engrtitooo/TimeDial.git
    cd TimeDial
  2. Set up Environment Variables: Create a .env file with your keys:

GOOGLE_PROJECT_ID=your_project_id
ELEVENLABS_API_KEY=your_api_key
PORT=8080
  1. Run with Docker:
docker build -t timedial .
docker run -p 8080:8080 --env-file .env timedial

📜 License

MIT License - Open Source for the AI Partner Catalyst Hackathon.

About

Break the glass of the museum exhibit. ⏳ TimeDial enables real-time voice conversations with Einstein, Cleopatra, and more using Gemini 3.0 Pro and ElevenLabs.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages