Benchmark Trust Model

PomaiDB benchmarks exist to earn trust, not to market speed. We adhere to a "Single Source of Truth" policy.

P99/P999

Tail Latency Transparency

Recall@k

Validated vs Oracle

Crash

Safety Verified

What We Guarantee

Correctness: Recall matches brute-force oracle with high fidelity.
Tail Latency: We always report P99 and P999. No hidden outliers.
Low-end Viability: Benchmarks must run on 2-core/8GB laptops.

Running Benchmarks

We provide a comprehensive suite via the pomai-bench script.

Quick Commands

# 1. Trust benchmarks (Recall, Tail Latency, Crash Recovery)
./scripts/pomai-bench recall
./scripts/pomai-bench mixed-load
./scripts/pomai-bench crash-recovery

# 2. End-to-end Python Benchmark (CIFAR-10)
python3 benchmarks/python_cifar10_feature_bench.py --images 6000 --queries 200

Metrics to Watch

Search Latency (µs): Target P99 < 10ms.
Throughput (QPS): Should scale linearly with threads.
Recall@10: Should be > 0.90 for production.
Build Time: Time to index vectors.

Low-End Device Warning

On 2-core CPUs with 8GB RAM, avoid medium/large datasets. Multi-threading on 2 cores causes thrashing. Stick to --dataset small or use --threads 1.