A/B Testing Results

Experiment: FAISS nprobe Optimization

Hypothesis: Lower nprobe improves speed without significant recall loss.

Config	P50 Latency	P95 Latency	P99 Latency	QPS	Recall@10
nprobe=5	42ms	68ms	85ms	238	90%
nprobe=10	58ms	92ms	115ms	172	95%
nprobe=20	89ms	142ms	178ms	112	98%

Winner: nprobe=10 (default)

Trade-offs:

Decision: Keep nprobe=10 as default, expose as API parameter for user control.

docker compose up -d sidecar
python tests/ab-testing/nprobe_experiment.py

Statistical significance: p < 0.01 (t-test)

Implemented adaptive nprobe:

See SearchRequest.Nprobe parameter in API.