23 lines
1.5 KiB
Markdown
23 lines
1.5 KiB
Markdown
# ✨ Detailed Features
|
|
|
|
## 🔥 Core Features
|
|
|
|
- **🔄 Real-time Embeddings** - Eliminate heavy embedding storage with dynamic computation using optimized ZMQ servers and highly optimized search paradigm (overlapping and batching) with highly optimized embedding engine
|
|
- **📈 Scalable Architecture** - Handles millions of documents on consumer hardware; the larger your dataset, the more LEANN can save
|
|
- **🎯 Graph Pruning** - Advanced techniques to minimize the storage overhead of vector search to a limited footprint
|
|
- **🏗️ Pluggable Backends** - DiskANN, HNSW/FAISS with unified API
|
|
|
|
## 🛠️ Technical Highlights
|
|
- **🔄 Recompute Mode** - Highest accuracy scenarios while eliminating vector storage overhead
|
|
- **⚡ Zero-copy Operations** - Minimize IPC overhead by transferring distances instead of embeddings
|
|
- **🚀 High-throughput Embedding Pipeline** - Optimized batched processing for maximum efficiency
|
|
- **🎯 Two-level Search** - Novel coarse-to-fine search overlap for accelerated query processing (optional)
|
|
- **💾 Memory-mapped Indices** - Fast startup with raw text mapping to reduce memory overhead
|
|
- **🚀 MLX Support** - Ultra-fast recompute/build with quantized embedding models, accelerating building and search ([minimal example](test/build_mlx_index.py))
|
|
|
|
## 🎨 Developer Experience
|
|
|
|
- **Simple Python API** - Get started in minutes
|
|
- **Extensible backend system** - Easy to add new algorithms
|
|
- **Comprehensive examples** - From basic usage to production deployment
|