Obsidian Vector Search API

A FastAPI application that creates a vector database from your Obsidian vault and provides semantic search capabilities using Ollama embeddings and Chroma vector database.

Features

🔍 Semantic Search: Search your Obsidian notes using natural language
🤖 Ollama Integration: Uses mxbai-embed-large embedding model via Ollama
📊 Chroma Vector DB: Lightweight, persistent vector database
🔄 Auto-Indexing: Periodic updates to keep your search index current
🚀 Fast API: RESTful API with automatic documentation
📱 Easy Setup: Simple configuration and startup

Quick Start

1. Install Dependencies

pip install -r requirements.txt

2. Configure Environment

Copy the example environment file and update it:

cp .env.example .env

Edit .env with your settings:

OLLAMA_URL=http://your-ollama-server:11434
EMBEDDING_MODEL=mxbai-embed-large:latest
VAULT_PATH=/path/to/your/obsidian/vault

3. Start Ollama and Pull Model

Make sure Ollama is running on your server and the embedding model is available:

ollama pull mxbai-embed-large:latest

4. Start the API

python start.py

The API will be available at http://localhost:8000

5. Start the Web UI (Optional)

For a user-friendly web interface, you can also start the Gradio UI:

python start_ui.py

The web UI will be available at http://localhost:7860

API Endpoints

Search Documents

POST /search
Content-Type: application/json

{
    "query": "machine learning concepts",
    "limit": 10
}

Manual Reindex

POST /reindex

Health Check

GET /health

Get Statistics

GET /stats

Delete Document

DELETE /documents/{file_path}

Configuration Options

Environment Variable	Default	Description
`OLLAMA_URL`	`http://localhost:11434`	Ollama server URL
`EMBEDDING_MODEL`	`mxbai-embed-large:latest`	Embedding model name
`VAULT_PATH`	required	Path to Obsidian vault
`CHROMA_PERSIST_DIRECTORY`	`./chroma_db`	Vector database storage path
`API_HOST`	`0.0.0.0`	API server host
`API_PORT`	`8000`	API server port
`INDEX_INTERVAL_MINUTES`	`30`	Auto-indexing interval

How It Works

File Processing: Scans your Obsidian vault for .md files
Text Chunking: Splits large documents into overlapping chunks for better search
Embedding Generation: Uses Ollama to generate embeddings for each chunk
Vector Storage: Stores embeddings in Chroma database with metadata
Semantic Search: Converts search queries to embeddings and finds similar documents
Auto-Updates: Periodically checks for new or modified files

Development

Project Structure

obsidian_vector_search/
├── main.py              # FastAPI application
├── config.py            # Configuration management
├── ollama_client.py     # Ollama API client
├── vector_db.py         # Chroma database wrapper
├── indexer.py           # File processing and indexing
├── start.py             # Startup script
├── requirements.txt     # Python dependencies
├── .env.example         # Environment template
└── README.md           # This file

Running in Development Mode

uvicorn main:app --reload --host 0.0.0.0 --port 8000

Web Interface

The Gradio web UI provides an easy-to-use interface with:

🔍 Search Tab: Enter queries and view results with similarity scores
⚙️ System Tab: Check API connection, view statistics, and manually reindex
📊 Real-time Stats: View vault and database statistics
🔄 Manual Controls: Force reindexing and connection testing

API Documentation

Once the server is running, visit:

Web UI: http://localhost:7860 (if started with start_ui.py)
Interactive API docs: http://localhost:8000/docs
ReDoc documentation: http://localhost:8000/redoc

Troubleshooting

Common Issues

"Cannot connect to Ollama server"
- Verify Ollama is running: ollama list
- Check the OLLAMA_URL in your .env file
- Ensure network connectivity to the Ollama server
"Embedding model not found"
- Pull the model: ollama pull mxbai-embed-large:latest
- Verify with: ollama list
"Vault path does not exist"
- Check the VAULT_PATH in your .env file
- Ensure the path is accessible and contains .md files
Slow indexing
- Reduce batch size in indexer
- Check Ollama server performance
- Consider using a smaller embedding model for testing

Logs

The application logs important information to help with debugging:

Indexing progress and errors
API request handling
Database operations
Ollama connectivity issues

License

This project is open source and available under the MIT License.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Obsidian Vector Search API

Features

Quick Start

1. Install Dependencies

2. Configure Environment

3. Start Ollama and Pull Model

4. Start the API

5. Start the Web UI (Optional)

API Endpoints

Search Documents

Manual Reindex

Health Check

Get Statistics

Delete Document

Configuration Options

How It Works

Development

Project Structure

Running in Development Mode

Web Interface

API Documentation

Troubleshooting

Common Issues

Logs

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.env.example		.env.example
.gitignore		.gitignore
MCP_README.md		MCP_README.md
README.md		README.md
config.py		config.py
gradio_ui.py		gradio_ui.py
indexer.py		indexer.py
main.py		main.py
mcp_server.py		mcp_server.py
ollama_client.py		ollama_client.py
requirements.txt		requirements.txt
start.py		start.py
start_ui.py		start_ui.py
test_mcp.py		test_mcp.py
vector_db.py		vector_db.py

Folders and files

Latest commit

History

Repository files navigation

Obsidian Vector Search API

Features

Quick Start

1. Install Dependencies

2. Configure Environment

3. Start Ollama and Pull Model

4. Start the API

5. Start the Web UI (Optional)

API Endpoints

Search Documents

Manual Reindex

Health Check

Get Statistics

Delete Document

Configuration Options

How It Works

Development

Project Structure

Running in Development Mode

Web Interface

API Documentation

Troubleshooting

Common Issues

Logs

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages