Files
LEANN/docs/features.md
yichuan520030910320 170f7644e9 simplify readme
2025-07-25 02:11:02 -07:00

1.5 KiB

Detailed Features

🔥 Core Features

  • 🔄 Real-time Embeddings - Eliminate heavy embedding storage with dynamic computation using optimized ZMQ servers and highly optimized search paradigm (overlapping and batching) with highly optimized embedding engine
  • 📈 Scalable Architecture - Handles millions of documents on consumer hardware; the larger your dataset, the more LEANN can save
  • 🎯 Graph Pruning - Advanced techniques to minimize the storage overhead of vector search to a limited footprint
  • 🏗️ Pluggable Backends - DiskANN, HNSW/FAISS with unified API

🛠️ Technical Highlights

  • 🔄 Recompute Mode - Highest accuracy scenarios while eliminating vector storage overhead
  • Zero-copy Operations - Minimize IPC overhead by transferring distances instead of embeddings
  • 🚀 High-throughput Embedding Pipeline - Optimized batched processing for maximum efficiency
  • 🎯 Two-level Search - Novel coarse-to-fine search overlap for accelerated query processing (optional)
  • 💾 Memory-mapped Indices - Fast startup with raw text mapping to reduce memory overhead
  • 🚀 MLX Support - Ultra-fast recompute/build with quantized embedding models, accelerating building and search (minimal example)

🎨 Developer Experience

  • Simple Python API - Get started in minutes
  • Extensible backend system - Easy to add new algorithms
  • Comprehensive examples - From basic usage to production deployment