diff --git a/README.md b/README.md index 3b8e2cd..600efa4 100755 --- a/README.md +++ b/README.md @@ -516,7 +516,7 @@ Options: - **Dynamic batching:** Efficiently batch embedding computations for GPU utilization - **Two-level search:** Smart graph traversal that prioritizes promising nodes -**Backends:** DiskANN or HNSW - pick what works for your data size. +**Backends:** HNSW (default) for most use cases, with optional DiskANN support for billion-scale datasets. ## Benchmarks @@ -536,8 +536,7 @@ Options: ```bash uv pip install -e ".[dev]" # Install dev dependencies -python benchmarks/run_evaluation.py data/indices/dpr/dpr_diskann # DPR dataset -python benchmarks/run_evaluation.py data/indices/rpj_wiki/rpj_wiki.index # Wikipedia +python benchmarks/run_evaluation.py # Will auto-download evaluation data and run benchmarks ``` The evaluation script downloads data automatically on first run. The last three results were tested with partial personal data, and you can reproduce them with your own data!