✨ Magic Cursor

Give your browser a sixth sense. Interact with anything on your screen using the power of multimodal AI.

Magic Cursor is a Chrome extension that bridges the gap between your screen and AI. Select any region of a webpage, capture it instantly, and start a conversation with Gemini (or any OpenAI-compatible model) about what you see.

🚀 Quick Start

1. Build the Extension

First, clone this repository and build the production bundle:

# Clone the repository
git clone https://github.com/your-username/magic-cursor.git
cd magic-cursor

# Install dependencies
npm install

# Build the sidepanel bundle
npm run build

2. Import to Chrome

Open Chrome and navigate to chrome://extensions/.
Enable "Developer mode" in the top right corner.
Click "Load unpacked".
Select the magic-cursor folder (the root directory of this project).

3. Set Up Your API Key

Click the Magic Cursor icon in your extensions toolbar to open the Side Panel.
Click the Settings (⚙️) icon in the top right.
Select your provider (e.g., Gemini) and paste your API key.
- Get a free Gemini API key at Google AI Studio.
The extension will automatically save your settings and load available models.

💡 Example Prompts to Try

Magic Cursor isn't just for text—it's for understanding pixels. Try these:

Shopping Helper: Select a piece of furniture or clothing.

"Where can I buy this or something similar? What is this style called?"
Code Architect: Highlight a complex UI component or a snippet of code.

"Explain how this layout is achieved with CSS Grid." or "Refactor this logic into a React hook."
Study Buddy: Capture a diagram or a math problem from a PDF.

"Walk me through the steps to solve this." or "Explain this diagram like I'm five."
Data Miner: Select a chart or table from a news article.

"Summarize the key trends shown in this graph."

🛠 Features

🎯 Precision Selection: Click and drag to capture exactly what matters.
💬 Multimodal Chat: Send text and images simultaneously to the AI.
📜 Smart History: Multi-turn conversations with automatic chat summarization.
🔑 Bring Your Own Key: Supports Gemini, OpenAI, Anthropic, and OpenRouter.
🌓 Adaptive UI: Beautiful light and dark modes that follow your system theme.

🧰 Tech Stack

AI SDK: OpenAI Node Library (configured for Gemini)
Bundler: Rollup
UI: Vanilla JS + CSS, Marked for Markdown, DOMPurify for safety.

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
.gemini		.gemini
.github		.github
images		images
sidepanel		sidepanel
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.md		LICENSE.md
README.md		README.md
TODO.md		TODO.md
background.js		background.js
content.css		content.css
content.js		content.js
demo.gif		demo.gif
manifest.json		manifest.json
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

✨ Magic Cursor

🚀 Quick Start

1. Build the Extension

2. Import to Chrome

3. Set Up Your API Key

💡 Example Prompts to Try

🛠 Features

🧰 Tech Stack

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

✨ Magic Cursor

🚀 Quick Start

1. Build the Extension

2. Import to Chrome

3. Set Up Your API Key

💡 Example Prompts to Try

🛠 Features

🧰 Tech Stack

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages