Grafeo

Grafeo is a pure-Rust, high-performance graph database that can be embedded as a library or run as a standalone database, with optional in-memory or persistent storage. Grafeo supports both Labeled Property Graph (LPG) and Resource Description Framework (RDF) graph data models and all major query languages.

Features

Core Capabilities

Dual data model support: LPG and RDF with optimized storage for each
Multi-language queries: GQL, Cypher, Gremlin, GraphQL, and SPARQL
Embeddable with zero external dependencies
Python bindings via PyO3, Node.js/TypeScript bindings via napi-rs
In-memory and persistent storage modes
MVCC transactions with snapshot isolation

Query Languages

GQL (ISO/IEC 39075)
Cypher (openCypher 9.0)
Gremlin (Apache TinkerPop)
GraphQL (September 2025)
SPARQL (W3C 1.1)

Vector Search & AI

Vector as a first-class type: Value::Vector(Arc<[f32]>) stored alongside graph data
HNSW index: O(log n) approximate nearest neighbor search with tunable recall
Distance functions: Cosine, Euclidean, Dot Product, Manhattan (SIMD-accelerated: AVX2, SSE, NEON)
Vector quantization: Scalar (f32 → u8), Binary (1-bit), and Product Quantization (8-32x compression)
Hybrid graph+vector queries: Combine graph traversals with vector similarity in GQL and SPARQL
Memory-mapped storage: Disk-backed vectors with LRU cache for large datasets
Batch operations: Parallel multi-query search via rayon

Performance Features

Push-based vectorized execution with adaptive chunk sizing
Morsel-driven parallelism with auto-detected thread count
Columnar storage with dictionary, delta, and RLE compression
Cost-based optimizer with DPccp join ordering and histograms
Zone maps for intelligent data skipping (including vector zone maps)
Adaptive query execution with runtime re-optimization
Transparent spilling for out-of-core processing
Bloom filters for efficient membership tests

Query Language & Data Model Support

Query Language	LPG	RDF
GQL	✅	—
Cypher	✅	—
Gremlin	✅	—
GraphQL	✅	✅
SPARQL	—	✅

Grafeo uses a modular translator architecture where query languages are parsed into ASTs, then translated to a unified logical plan that executes against the appropriate storage backend (LPG or RDF).

Data Models

LPG (Labeled Property Graph): Nodes with labels and properties, edges with types and properties. Ideal for social networks, knowledge graphs, and application data.
RDF (Resource Description Framework): Triple-based storage (subject-predicate-object) with SPO/POS/OSP indexes. Ideal for semantic web, linked data, and ontology-based applications.

Installation

Rust

cargo add grafeo

All query languages (GQL, Cypher, Gremlin, GraphQL, SPARQL) are enabled by default. To disable specific languages:

cargo add grafeo --no-default-features --features gql,cypher

Node.js / TypeScript

npm install @grafeo-db/js

Python

uv add grafeo

With CLI support:

uv add grafeo[cli]

Quick Start

Node.js / TypeScript

const { GrafeoDB } = require('@grafeo-db/js');

// Create an in-memory database
const db = await GrafeoDB.create();

// Or open a persistent database
// const db = await GrafeoDB.create({ path: './my-graph.db' });

// Create nodes and relationships
await db.execute("INSERT (:Person {name: 'Alice', age: 30})");
await db.execute("INSERT (:Person {name: 'Bob', age: 25})");
await db.execute(`
    MATCH (a:Person {name: 'Alice'}), (b:Person {name: 'Bob'})
    INSERT (a)-[:KNOWS {since: 2020}]->(b)
`);

// Query the graph
const result = await db.execute(`
    MATCH (p:Person)-[:KNOWS]->(friend)
    RETURN p.name, friend.name
`);
console.log(result.rows);

await db.close();

Python

import grafeo

# Create an in-memory database
db = grafeo.GrafeoDB()

# Or open/create a persistent database
# db = grafeo.GrafeoDB("/path/to/database")

# Create nodes using GQL
db.execute("INSERT (:Person {name: 'Alice', age: 30})")
db.execute("INSERT (:Person {name: 'Bob', age: 25})")

# Create a relationship
db.execute("""
    MATCH (a:Person {name: 'Alice'}), (b:Person {name: 'Bob'})
    INSERT (a)-[:KNOWS {since: 2020}]->(b)
""")

# Query the graph
result = db.execute("""
    MATCH (p:Person)-[:KNOWS]->(friend)
    RETURN p.name, friend.name
""")

for row in result:
    print(row)

# Or use the direct API
node = db.create_node(["Person"], {"name": "Carol"})
print(f"Created node with ID: {node.id}")

# Manage labels
db.add_node_label(node.id, "Employee")     # Add a label
db.remove_node_label(node.id, "Contractor") # Remove a label
labels = db.get_node_labels(node.id)        # Get all labels

Admin APIs (Python)

# Database inspection
db.info()           # Overview: mode, counts, persistence
db.detailed_stats() # Memory usage, index counts
db.schema()         # Labels, edge types, property keys
db.validate()       # Integrity check

# Persistence control
db.save("/path/to/backup")    # Save to disk
db.to_memory()                # Create in-memory copy
GrafeoDB.open_in_memory(path) # Load as in-memory

# WAL management
db.wal_status()      # WAL info
db.wal_checkpoint()  # Force checkpoint

Rust

use grafeo::GrafeoDB;

fn main() {
    // Create an in-memory database
    let db = GrafeoDB::new_in_memory();

    // Or open a persistent database
    // let db = GrafeoDB::open("./my_database").unwrap();

    // Execute GQL queries
    db.execute("INSERT (:Person {name: 'Alice'})").unwrap();

    let result = db.execute("MATCH (p:Person) RETURN p.name").unwrap();
    for row in result.rows {
        println!("{:?}", row);
    }
}

Vector Search

import grafeo

db = grafeo.GrafeoDB()

# Store documents with embeddings
db.execute("""INSERT (:Document {
    title: 'Graph Databases',
    embedding: vector([0.1, 0.8, 0.3, 0.5])
})""")
db.execute("""INSERT (:Document {
    title: 'Vector Search',
    embedding: vector([0.2, 0.7, 0.4, 0.6])
})""")
db.execute("""INSERT (:Document {
    title: 'Cooking Recipes',
    embedding: vector([0.9, 0.1, 0.2, 0.1])
})""")

# Create an HNSW index for fast approximate search
db.execute("""
    CREATE VECTOR INDEX doc_idx ON :Document(embedding)
    WITH (dimensions: 4, metric: 'cosine')
""")

# Find similar documents using cosine similarity
query = [0.15, 0.75, 0.35, 0.55]
result = db.execute(f"""
    MATCH (d:Document)
    WHERE cosine_similarity(d.embedding, vector({query})) > 0.9
    RETURN d.title, cosine_similarity(d.embedding, vector({query})) AS score
    ORDER BY score DESC
""")
for row in result:
    print(row)  # Graph Databases, Vector Search (Cooking Recipes filtered out)

Command-Line Interface

Optional admin CLI for operators and DevOps:

# Install with CLI support
uv add grafeo[cli]

# Inspection
grafeo info ./mydb              # Overview: counts, size, mode
grafeo stats ./mydb             # Detailed statistics
grafeo schema ./mydb            # Labels, edge types, property keys
grafeo validate ./mydb          # Integrity check

# Backup & restore
grafeo backup create ./mydb -o backup
grafeo backup restore backup ./copy --force

# WAL management
grafeo wal status ./mydb
grafeo wal checkpoint ./mydb

# Output formats
grafeo info ./mydb --format json  # Machine-readable JSON
grafeo info ./mydb --format table # Human-readable table (default)

Ecosystem

Project	Description
grafeo-server	HTTP server & web UI: REST API, transactions, single binary (~20MB Docker image)
grafeo-web	Browser-based Grafeo via WebAssembly with IndexedDB persistence
anywidget-graph	Interactive graph visualization for Python notebooks (Marimo, Jupyter, VS Code, Colab)
anywidget-vector	3D vector/embedding visualization for Python notebooks
graph-bench	Benchmark suite comparing graph databases across 25+ benchmarks

Documentation

Full documentation is available at grafeo.dev.

Contributing

See CONTRIBUTING.md for development setup and guidelines.

Acknowledgments

Grafeo's execution engine draws inspiration from:

DuckDB, vectorized push-based execution, morsel-driven parallelism
LadybugDB, CSR-based adjacency indexing, factorized query processing

License

Apache-2.0

Name		Name	Last commit message	Last commit date
Latest commit History 193 Commits
.github		.github
crates		crates
docs		docs
examples		examples
scripts		scripts
tests/python		tests/python
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
CONTRIBUTORS.md		CONTRIBUTORS.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
_typos.toml		_typos.toml
codecov.yml		codecov.yml
deny.toml		deny.toml
mkdocs.yml		mkdocs.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Grafeo

Features

Core Capabilities

Query Languages

Vector Search & AI

Performance Features

Query Language & Data Model Support

Data Models

Installation

Rust

Node.js / TypeScript

Python

Quick Start

Node.js / TypeScript

Python

Admin APIs (Python)

Rust

Vector Search

Command-Line Interface

Ecosystem

Documentation

Contributing

Acknowledgments

License

About

Uh oh!

Releases 10

Sponsor this project

Uh oh!

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

Uh oh!

License

GrafeoDB/grafeo

Folders and files

Latest commit

History

Repository files navigation

Grafeo

Features

Core Capabilities

Query Languages

Vector Search & AI

Performance Features

Query Language & Data Model Support

Data Models

Installation

Rust

Node.js / TypeScript

Python

Quick Start

Node.js / TypeScript

Python

Admin APIs (Python)

Rust

Vector Search

Command-Line Interface

Ecosystem

Documentation

Contributing

Acknowledgments

License

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 10

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages