Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
36 commits
Select commit Hold shift + click to select a range
219c967
add scatter gather?
michaelfeil Nov 9, 2025
00a7b76
compiles
michaelfeil Nov 9, 2025
976f563
compiles
michaelfeil Nov 9, 2025
7b4b2e9
compiles and tests are passing
michaelfeil Nov 9, 2025
93af502
add another passing tests
michaelfeil Nov 9, 2025
632e54a
working and fast
michaelfeil Nov 9, 2025
ceb1aea
sleepup from mapfree operation
michaelfeil Nov 9, 2025
49dc90d
single pass indexing
michaelfeil Nov 9, 2025
618bc8d
just fmt
michaelfeil Nov 9, 2025
3105ac0
small removal
michaelfeil Nov 9, 2025
3a63882
fmt only
michaelfeil Nov 9, 2025
2ffbb9b
scatter gather help
michaelfeil Nov 9, 2025
4a49939
clippy fixes
michaelfeil Nov 9, 2025
04d7a07
project is compiling?!
michaelfeil Nov 9, 2025
40c3792
working e2e, prompetheus ingration
michaelfeil Nov 9, 2025
91cb5ad
add percentage in command line
michaelfeil Nov 21, 2025
26a1441
update defualt factor
michaelfeil Nov 21, 2025
1ebdf4e
move radix mlp to separate layer
michaelfeil Nov 21, 2025
07f9b95
radix mlp implementation
michaelfeil Nov 21, 2025
9ccd7e6
add comment
michaelfeil Nov 21, 2025
bf0c9cb
add comment
michaelfeil Nov 21, 2025
09fdf3d
flash qwen2
michaelfeil Nov 21, 2025
f35638d
add radix mlp folder
michaelfeil Nov 21, 2025
dba7ddc
compression ratio
michaelfeil Nov 21, 2025
bc47026
cargo releases
michaelfeil Nov 21, 2025
b87aeb4
add queue.rs
michaelfeil Nov 22, 2025
8bb103f
set padding
michaelfeil Nov 25, 2025
0b25115
clippy fix
michaelfeil Nov 25, 2025
a3e761b
fix >= 1.0 condition
michaelfeil Nov 26, 2025
cc88374
add bench crate
michaelfeil Dec 7, 2025
aba0825
improve benchmark
michaelfeil Dec 7, 2025
cfedb27
better bench
michaelfeil Dec 7, 2025
dc5d666
better benchmark
michaelfeil Dec 7, 2025
57c5566
b 32
michaelfeil Dec 7, 2025
088ba6b
normalized benchmark
michaelfeil Dec 7, 2025
89224ce
better bench
michaelfeil Dec 8, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
154 changes: 153 additions & 1 deletion Cargo.lock

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

1 change: 1 addition & 0 deletions Cargo.toml
Original file line number Diff line number Diff line change
Expand Up @@ -2,6 +2,7 @@
members = [
"backends",
"backends/candle",
"backends/candle-bench",
"backends/ort",
"backends/core",
"backends/python",
Expand Down
28 changes: 28 additions & 0 deletions backends/candle-bench/Cargo.toml
Original file line number Diff line number Diff line change
@@ -0,0 +1,28 @@
[package]
name = "candle-bench"
version = "0.1.0"
edition = "2021"
publish = false

[dependencies]
text-embeddings-backend-candle = { path = "../candle" }
text-embeddings-backend = { path = ".." }
text-embeddings-backend-core = { path = "../core" }
text-embeddings-core = { path = "../../core" }
anyhow = { workspace = true }
cudarc = { workspace = true, optional = true }
hf-hub = { workspace = true , features = ["ureq"] }
serde_json = "*"
tracing = "*"

[dev-dependencies]
criterion = "0.5"

[[bench]]
name = "radix_mlp_benchmark"
harness = false

[features]
metal = ["text-embeddings-backend-candle/metal"]
cuda = ["text-embeddings-backend-candle/cuda", "text-embeddings-backend-candle/flash-attn", "dep:cudarc","cudarc?/dynamic-linking"]
candle = ["text-embeddings-backend/candle"]
3 changes: 3 additions & 0 deletions backends/candle-bench/Readme.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
```
cargo bench --manifest-path backends/candle-bench/Cargo.toml --bench radix_mlp_benchmark --features "candle,cuda"
```
Loading