LEANN

Author	SHA1	Message	Date
Andy Lee	6b399ad8d2	fix: launch another port when updating	2025-09-30 13:00:00 -07:00
Andy Lee	16f35aa067	Update faiss for batch distances calc & caching when updating	2025-09-30 12:42:40 -07:00
Andy Lee	ab9c6bd69e	Fix update. Should launch embedding server first (#130 ) * fix: set ntotal for storage as well * fix: launch embedding server before adding	2025-09-30 00:58:17 -07:00
yichuan520030910320	e2b37914ce	add dynamic add test	2025-09-30 00:48:46 -07:00
Andy Lee	e588100674	fix: set ntotal for storage as well (#129 )	2025-09-29 20:43:16 -07:00
Andy Lee	fecee94af1	Experiments (#68 ) * feat: finance bench * docs: results * chore: ignroe data README * feat: fix financebench * feat: laion, also required idmaps support * style: format * style: format * fix: resolve ruff linting errors - Remove unused variables in benchmark scripts - Rename unused loop variables to follow convention * feat: enron email bench * experiments for running DiskANN & BM25 on Arch 4090 * style: format * chore(ci): remove paru-bin submodule and config to fix checkout --recurse-submodules * docs: data * docs: data updated * fix: as package * fix(ci): only run pre-commit * chore: use http url of astchunk; use group for some dev deps * fix(ci): should checkout modules as well since `uv sync` checks * fix(ci): run with lint only * fix: find links to install wheels available * CI: force local wheels in uv install step * CI: install local wheels via file paths * CI: pick wheels matching current Python tag * CI: handle python tag mismatches for local wheels * CI: use matrix python venv and set macOS deployment target * CI: revert install step to match main * CI: use uv group install with local wheel selection * CI: rely on setup-uv for Python and tighten group install * CI: install build deps with uv python interpreter * CI: use temporary uv venv for build deps * CI: add build venv scripts path for wheel repair	2025-09-24 11:19:04 -07:00
yichuan520030910320	c8aa063f48	merge main	2025-09-23 23:21:53 -07:00
yichuan520030910320	576beb13db	add doc about multimodal	2025-09-23 23:21:03 -07:00
Andy Lee	63c7b0c8a3	Fix restart embedding server when passages change (#117 ) * fix: restart embedding server when passages change * fix: restore python 3.9 typing compatibility	2025-09-23 22:28:36 -07:00
Andy Lee	ec889f7ef4	Allow 'leann ask' to accept a positional question (#116 )	2025-09-23 21:18:57 -07:00
Andy Lee	db7ba27ff6	feat: Add support for configurable local LLM endpoints (#115 ) * feat: support configurable local llm endpoints * docs	2025-09-23 15:12:13 -07:00
Andy Lee	5f7806e16f	Introducing dynamic index update (#108 ) * feat: Add GitHub PR and issue templates for better contributor experience * simplify: Make templates more concise and user-friendly * fix: enable is_compact=False, is_recompute=True * feat: update when recompute * test * fix: real recompute * refactor * fix: compare with no-recompute * fix: test	2025-09-21 22:56:27 -07:00
yichuan-w	d034e2195b	fix build from source in diskann	2025-09-20 19:52:29 +00:00
yichuan520030910320	43894ff605	update submodule	2025-09-19 17:03:55 -07:00
Andy Lee	e93c0dec6f	[Fix] Enable AST chunking when installed (package chunking utils) (#101 ) * fix(core): package chunking utils for AST chunking; re-export in apps; CLI imports packaged utils * style * chore: fix ruff warnings (RUF059, F401) * style	2025-09-17 18:44:00 -07:00
GitHub Actions	c5a29f849a	chore: release v0.3.4	2025-09-16 20:45:22 +00:00
Yichuan Wang	3b8dc6368e	Ast fork (#92 )	2025-09-08 18:43:31 -07:00
Aiden Huang	e309f292de	docs(mcp): add root llms.txt for MCP discovery; update MCP README to reference it; refs #76 (#91 )	2025-09-07 14:39:58 -07:00
AWS Mcleod	0d9f92ea0f	Add grep search functionality - Issue #86 (#87 ) * Add grep search functionality to LeannSearcher - Add use_grep parameter to search method - Implement grep-based search on .jsonl files - Add fallback Python regex search - Support same SearchResult format as semantic search Addresses issue #86 * fix: resolve linting errors * docs: add grep search example * docs: add grep search to README examples * refactor: remove regex fallback, move grep example to features section * docs: add grep search to Advanced Features with comprehensive guide	2025-09-05 13:48:07 -07:00
GitHub Actions	b0b353d279	chore: release v0.3.3	2025-09-02 21:29:56 +00:00
Andy Lee	4dffdfedbe	feat: Add ARM64 Linux wheel support for leann-backend-hnsw (#83 ) * feat: Add ARM64 Linux wheel support for leann-backend-hnsw * fix: Use OpenBLAS for ARM64 Linux builds instead of Intel MKL * fix: Configure Faiss with SVE optimization for ARM64 builds - Set FAISS_OPT_LEVEL to "sve" for ARM64 architecture - Disable x86-specific SIMD instructions (AVX2, AVX512, SSE4.1) - Use ARM64-native SVE optimization as per Faiss conda build scripts - Add architecture detection and proper configuration messages Fixes compilation error: "xmmintrin.h: No such file or directory" on ubuntu-24.04-arm runners. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * fix: Apply ARM64 compatibility fix directly to Faiss submodule - Modify faiss/impl/pq.cpp to use x86-specific preprocessor conditions - Remove patch file approach in favor of direct submodule modification - Update CMakeLists.txt to reflect the submodule changes - Fixes ARM64 Linux compilation by preventing x86 SIMD header inclusion This resolves the "xmmintrin.h: No such file or directory" error when building ARM64 Linux wheels for Docker compatibility. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * chore: Update Faiss submodule to include ARM64 compatibility fix - Points to commit ed96ff7d with x86-specific preprocessor conditions - Enables successful ARM64 Linux wheel builds 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * retrigger ci * fix: Use different optimization levels for ARM64 based on platform - Use SVE optimization only for ARM64 Linux - Use generic optimization for ARM64 macOS to avoid clang SVE issues - Fixes macOS ARM64 compilation errors with SVE instructions 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * feat: Update DiskANN submodule with OpenBLAS fallback support - Points to commit 5c396c4 with ARM64 Linux OpenBLAS support - Enables DiskANN to build on ARM64 Linux using standard BLAS libraries - Resolves Intel MKL dependency issues for Docker ARM64 deployments 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * fix: Update DiskANN submodule with ZeroMQ polling configuration - Points to commit 3a1016e with explicit polling method setup - Resolves ZeroMQ autodetection issues on ARM64 Linux - Ensures stable cross-platform ZeroMQ builds 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * retrigger ci * fix: Update DiskANN submodule with ARM64 compiler flags fix - Points to commit a0dc600 with architecture-specific compiler flags - Removes x86 SIMD flags on ARM64 Linux to fix compilation errors - Enables successful ARM64 Linux wheel builds 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * fix: Update DiskANN submodule with ARM64 compiler flags fix - Points to commit 0921664 with architecture-specific compiler flags - Removes x86 SIMD flags on ARM64 Linux to fix compilation errors - Enables successful ARM64 Linux wheel builds 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * retrigger ci * fix: Update DiskANN submodule with cross-platform prefetch support - Points to commit 39192d6 with unified prefetch macros - Replaces all Intel-specific _mm_prefetch calls with cross-platform macros - Enables ARM64 Linux compatibility while maintaining x86 performance - Resolves all remaining compilation errors for ARM64 builds 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * fix: Update DiskANN submodule with corrected ARM64 compatibility fixes - Points to commit 3cb87a8 with proper x86 platform detection - Includes ARM64 fallback for AVXDistanceInnerProductFloat function - Resolves all remaining '__m256 was not declared' compilation errors - Enables successful ARM64 Linux wheel builds for Docker compatibility 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * fix: Update DiskANN submodule with template type handling fix - Points to commit d396bc3 with corrected template type handling - Fixes DistanceInnerProduct template instantiation for int8_t/uint8_t types - Resolves 'cannot convert const signed char* to const float' error - Completes ARM64 Linux compilation compatibility 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> fix: Update DiskANN submodule with DistanceFastL2::norm template fix - Points to commit 69d9a99 with corrected template type handling - Fixes DistanceFastL2::norm template instantiation for int8_t/uint8_t types - Resolves another 'cannot convert const signed char* to const float' error - Continues ARM64 Linux compilation compatibility improvements 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> fix: Update DiskANN submodule with LAPACKE header detection - Points to commit 64a9e01 with LAPACKE header path configuration - Adds pkg-config based detection for LAPACKE include directories - Resolves 'lapacke.h: No such file or directory' compilation error - Completes OpenBLAS integration for ARM64 Linux builds 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * fix: Update DiskANN submodule with enhanced LAPACKE header detection - Points to commit 18d0721 with fallback LAPACKE header search paths - Checks multiple standard locations for lapacke.h on various systems - Improves ARM64 Linux compatibility for OpenBLAS builds - Should resolve 'lapacke.h: No such file or directory' errors 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * fix: Add liblapacke-dev package for ARM64 Linux builds - Add liblapacke-dev to ARM64 dependencies alongside libopenblas-dev - Provides lapacke.h header file needed for LAPACK C interface - Fixes 'lapacke.h: No such file or directory' compilation error - Enables complete OpenBLAS + LAPACKE support for ARM64 wheel builds 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * fix: Update DiskANN submodule with cosine_similarity.h x86 intrinsics fix - Points to commit dbb17eb with corrected conditional compilation - Fixes immintrin.h inclusion for ARM64 compatibility in cosine_similarity.h - Resolves 'immintrin.h: No such file or directory' error - Continues systematic ARM64 Linux compilation fixes 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * fix: Update DiskANN submodule with LAPACKE library linking fix - Points to commit 19f9603 with explicit LAPACKE library discovery and linking - Resolves 'undefined symbol: LAPACKE_sgesdd' runtime error on ARM64 Linux - Completes ARM64 Linux wheel build compatibility for Docker deployments 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-09-02 14:27:06 -07:00
Yichuan Wang	d41e467df9	[CLI] More robust leann list and leann build (#84 ) * chore(submodule): bump faiss to latest storage-efficient build * [chore] add slack to share use case * [cli] better gitignore / better leann list * [cli] fix # 81	2025-09-01 18:36:27 -07:00
yichuan520030910320	e83a671918	chore(submodule): bump faiss to latest storage-efficient build	2025-09-01 13:31:12 -07:00
Gabriel Dehan	31b4973141	Metadata filtering feature (#75 ) * Metadata filtering initial version * Metadata filtering initial version * Fixes linter issues * Cleanup code * Clean up and readme * Fix after review * Use UV in example * Merge main into feature/metadata-filtering	2025-08-20 19:57:56 -07:00
Yichuan Wang	dde2221513	[EXP] Update the benchmark code (#71 ) * chore(hnsw): reorder imports to satisfy ruff I001 * chore: sync changes; fix Ruff import order; update examples, benchmarks, and dependencies - Fix import order in packages/leann-backend-hnsw/leann_backend_hnsw/hnsw_backend.py (Ruff I001) - Update benchmarks/run_evaluation.py - Update apps/base_rag_example.py and leann-core API usage - Add benchmarks/data/README.md - Update uv.lock - Misc cleanup - Note: added paru-bin as an embedded git repo; consider making it a submodule (git rm --cached paru-bin) if unintended * chore: remove unintended embedded repo paru-bin and ignore it Fix CI: avoid missing .gitmodules entry by removing gitlink and adding to .gitignore. * ci: retrigger after removing unintended gitlink (paru-bin) * feat(benchmarks): add --batch-size option and plumb through to HNSW search (default 0) * feat(hnsw): add batch_size to LeannSearcher.search and LeannChat.ask; forward only for HNSW backend * chore(logging): surface recompute and batching params; enable INFO logging in benchmark * feat(embeddings): add optional manual tokenization path (HF tokenizer+model) with mean pooling; default remains SentenceTransformer.encode * fix micro bench and fix pre commit * update readme --------- Co-authored-by: yichuan-w <yichuan-w@users.noreply.github.com>	2025-08-20 17:31:46 -07:00
Andy Lee	6d11e86e71	Run Evaluation RPJ Wiki on Arch Linux (#74 ) * chore: ignore benchmark data * perf: avoid merging offset dicts for lower mem usage * style: format * docs: rpj_wiki	2025-08-20 12:25:54 -07:00
Gabriel Dehan	13bb561aad	Add AST-aware code chunking for better code understanding (#58 ) * feat(core): Add AST-aware code chunking with astchunk integration This PR introduces intelligent code chunking that preserves semantic boundaries (functions, classes, methods) for better code understanding in RAG applications. Key Features: - AST-aware chunking for Python, Java, C#, TypeScript files - Graceful fallback to traditional chunking for unsupported languages - New specialized code RAG application for repositories - Enhanced CLI with --use-ast-chunking flag - Comprehensive test suite with integration tests Technical Implementation: - New chunking_utils.py module with enhanced chunking logic - Extended base RAG framework with AST chunking arguments - Updated document RAG with --enable-code-chunking flag - CLI integration with proper error handling and fallback Benefits: - Better semantic understanding of code structure - Improved search quality for code-related queries - Maintains backward compatibility with existing workflows - Supports mixed content (code + documentation) seamlessly Dependencies: - Added astchunk and tree-sitter parsers to pyproject.toml - All dependencies are optional - fallback works without them Testing: - Comprehensive test suite in test_astchunk_integration.py - Integration tests with document RAG - Error handling and edge case coverage Documentation: - Updated README.md with AST chunking highlights - Added ASTCHUNK_INTEGRATION.md with complete guide - Updated features.md with new capabilities * Refactored chunk utils * Remove useless import * Update README.md * Update apps/chunking/utils.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update apps/code_rag.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Fix issue * apply suggestion from @Copilot Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Fixes after pr review * Fix tests not passing * Fix linter error for documentation files * Update .gitignore with unwanted files --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Co-authored-by: Andy Lee <andylizf@outlook.com>	2025-08-19 23:35:31 -07:00
GitHub Actions	0174ba5571	chore: release v0.3.2	2025-08-19 09:41:40 +00:00
Andy Lee	03af82d695	fix: leann mcp search cwd & interactive issues (#72 )	2025-08-19 02:27:06 -07:00
GitHub Actions	738f1dbab8	chore: release v0.3.1	2025-08-19 05:56:45 +00:00
yichuan520030910320	37d990d51c	[feature] fix cli	2025-08-18 22:55:43 -07:00
Andy Lee	46905e0687	feat: Improve DiskANN cross-platform compatibility and add Arch Linux support (#66 ) * feat: Enhance CLI with improved list and smart remove commands ## ✨ New Features ### 🏠 Enhanced `leann list` command - Better UX: Current project shown first with clear separation - Visual improvements: Icons (🏠/📂), better formatting, size info - Smart guidance: Context-aware usage examples and getting started tips ### 🛡️ Smart `leann remove` command - Safety first: Always shows ALL matching indexes across projects - Intelligent handling: - Single match: Clear location display with cross-project warnings - Multiple matches: Interactive selection with final confirmation - Prevents accidents: No more deleting wrong indexes due to name conflicts - User-friendly: 'c' to cancel, clear visual hierarchy, detailed info ### 🔧 Technical improvements - Clean logging: Hide debug messages for better CLI experience - Comprehensive search: Always scan all projects for transparency - Error handling: Graceful handling of edge cases and user input ## 🎯 Impact - Safer: Eliminates risk of accidental index deletion - Clearer: Users always know what they're operating on - Smarter: Automatic detection and handling of common scenarios 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * chore: vscode ruff, and format * fix: Update DiskANN submodule with MKL linking improvements Updates DiskANN submodule to include fix for MKL linking issues: - Replaces global link_libraries() with target-specific linking - Uses dynamic MKL linking (mkl_rt) for better cross-platform compatibility - Prevents MKL contamination of unrelated targets (like zlib tests) - Resolves build failures on strict linkers (Arch Linux) while maintaining Ubuntu compatibility DiskANN commit: c593831 - fix: Replace global MKL linking with target-specific approach 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * chore: all linux deps * fix: Update Intel MKL download link to avoid 403 error - Replace problematic Intel download URL that returns 403 Forbidden - Use general Intel oneAPI MKL page instead of specific download parameters - This fixes the lychee link checker CI failure 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * fix: Configure lychee to use browser User-Agent for Intel links - Replace domain exclusion with browser User-Agent to properly check Intel links - Intel website blocks automated tools but allows browser-like requests - This enables proper link validation while avoiding 403 Forbidden errors 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * fix: Use curl User-Agent for lychee link checking Intel website has specific anti-bot logic: - Blocks browser User-Agents (returns 403) - Blocks lychee default User-Agent (returns 403) - Allows curl User-Agent (returns 200) This enables proper link validation for Intel documentation. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-08-16 14:42:20 -07:00
Andy Lee	838ade231e	🔗 Auto-register apps: Universal index discovery (#64 ) * feat: Enhance CLI with improved list and smart remove commands ## ✨ New Features ### 🏠 Enhanced `leann list` command - Better UX: Current project shown first with clear separation - Visual improvements: Icons (🏠/📂), better formatting, size info - Smart guidance: Context-aware usage examples and getting started tips ### 🛡️ Smart `leann remove` command - Safety first: Always shows ALL matching indexes across projects - Intelligent handling: - Single match: Clear location display with cross-project warnings - Multiple matches: Interactive selection with final confirmation - Prevents accidents: No more deleting wrong indexes due to name conflicts - User-friendly: 'c' to cancel, clear visual hierarchy, detailed info ### 🔧 Technical improvements - Clean logging: Hide debug messages for better CLI experience - Comprehensive search: Always scan all projects for transparency - Error handling: Graceful handling of edge cases and user input ## 🎯 Impact - Safer: Eliminates risk of accidental index deletion - Clearer: Users always know what they're operating on - Smarter: Automatic detection and handling of common scenarios 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * chore: vscode ruff, and format --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-08-16 11:50:25 -07:00
Andy Lee	da6540decd	feat: Enhance CLI with improved list and smart remove commands (#63 ) - Better UX: Current project shown first with clear separation - Visual improvements: Icons (🏠/📂), better formatting, size info - Smart guidance: Context-aware usage examples and getting started tips - Safety first: Always shows ALL matching indexes across projects - Intelligent handling: - Single match: Clear location display with cross-project warnings - Multiple matches: Interactive selection with final confirmation - Prevents accidents: No more deleting wrong indexes due to name conflicts - User-friendly: 'c' to cancel, clear visual hierarchy, detailed info - Clean logging: Hide debug messages for better CLI experience - Comprehensive search: Always scan all projects for transparency - Error handling: Graceful handling of edge cases and user input - Safer: Eliminates risk of accidental index deletion - Clearer: Users always know what they're operating on - Smarter: Automatic detection and handling of common scenarios	2025-08-15 23:49:47 -07:00
Andy Lee	6bde28584b	feat: Add Google Gemini API support for chat and embeddings (#57 ) - Add GeminiChat class with gemini-2.5-flash model support - Add compute_embeddings_gemini function with text-embedding-004 model - Update get_llm factory to support "gemini" type - Update API documentation to include gemini embedding mode - Support temperature, max_tokens, top_p parameters for Gemini chat - Support batch embedding processing with progress bars - Add proper error handling and API key validation	2025-08-15 21:54:11 -07:00
GitHub Actions	9a1e4652ca	chore: release v0.3.0	2025-08-16 00:54:47 +00:00
Andy Lee	14e84d9e2d	fix(core): skip empty/invalid chunks before embedding; guard OpenAI embeddings (#55 ) Avoid 400 errors from OpenAI when chunker yields empty strings by filtering invalid texts in LeannBuilder.build_index. Add validation fail-fast in OpenAI embedding path to surface upstream issues earlier. Keeps passages and embeddings aligned during build. Refs #54	2025-08-15 17:53:53 -07:00
Yichuan Wang	2dcfca19ff	style: apply ruff format (#56 )	2025-08-15 17:48:33 -07:00
Yichuan Wang	bee2167ee3	docs: update READMEs (MCP docs + conclusion polish) - Polish conclusion in packages/leann-mcp/README.md - Sync root README wording and links	2025-08-15 17:21:23 -07:00
yichuan520030910320	ef980d70b3	[MCP]update MCP of claude code	2025-08-15 14:29:59 -07:00
Andy Lee	db3c63c441	Docs/Core: Low-Resource Setups, SkyPilot Option, and No-Recompute (#45 ) * docs: add SkyPilot template and instructions for running embeddings/index build on cloud GPU * docs: add low-resource note in README; point to config guide; suggest OpenAI embeddings, SkyPilot remote build, and --no-recompute * docs: consolidate low-resource guidance into config guide; README points to it * cli: add --no-recompute and --no-recompute-embeddings flags; docs: clarify HNSW requires --no-compact when disabling recompute * docs: dedupe recomputation guidance; keep single Low-resource setups section * sky: expand leann-build.yaml with configurable params and flags (backend, recompute, compact, embedding options) * hnsw: auto-disable compact when --no-recompute is used; docs: expand SkyPilot with -e overrides and copy-back example * docs+sky: simplify SkyPilot flow (auto-build on launch, rsync copy-back); clarify HNSW auto non-compact when no-recompute * feat: auto compact for hnsw when recompute * reader: non-destructive portability (relative hints + fallback); fix comments; sky: refine yaml * cli: unify flags to --recompute/--no-recompute for build/search/ask; docs: update references * chore: remove * hnsw: move pruned/no-recompute assertion into backend; api: drop global assertion; docs: will adjust after benchmarking * cli: use argparse.BooleanOptionalAction for paired flags (--recompute/--compact) across build/search/ask * docs: a real example on recompute * benchmarks: fix and extend HNSW+DiskANN recompute vs no-recompute; docs: add fresh numbers and DiskANN notes * benchmarks: unify HNSW & DiskANN into one clean script; isolate groups, fixed ports, warm-up, param complexity * docs: diskann recompute * core: auto-cleanup for LeannSearcher/LeannChat (__enter__/__exit__/__del__); ensure server terminate/kill robustness; benchmarks: use searcher.cleanup(); docs: suggest uv run * fix: hang on warnings * docs: boolean flags * docs: leann help	2025-08-15 12:03:19 -07:00
yichuan520030910320	00eeadb9dd	upd pkg	2025-08-14 14:39:45 -07:00
yichuan520030910320	42c8370709	add chunk size in leann build& fix batch size in oai& docs	2025-08-14 13:14:14 -07:00
Andy Lee	fafdf8fcbe	feat(core,diskann): robust embedding server (no-hang) + DiskANN fast mode (graph partition) (#29 ) * feat: Add graph partition support for DiskANN backend - Add GraphPartitioner class for advanced graph partitioning - Add partition_graph_simple function for easy-to-use partitioning - Add pybind11 dependency for C++ executable building - Update __init__.py to export partition functions - Include test scripts for partition functionality The partition functionality allows optimizing disk-based indices for better search performance and memory efficiency. * chore: Update DiskANN submodule to latest with graph partition tools - Update DiskANN submodule to commit b2dc4ea - Includes graph partition tools and CMake integration - Enables graph partitioning functionality in DiskANN backend * merge * ruff * add a path related fix * fix: always use relative path in metadata * docs: tool cli install * chore: more data * fix: diskann building and partitioning * tests: diskann and partition * docs: highlight diskann readiness and add performance comparison * docs: add ldg-times parameter for diskann graph locality optimization * fix: update pre-commit ruff version and format compliance * fix: format test files with latest ruff version for CI compatibility * fix: pin ruff version to 0.12.7 across all environments - Pin ruff==0.12.7 in pyproject.toml dev dependencies - Update CI to use exact ruff version instead of latest - Add comments explaining version pinning rationale - Ensures consistent formatting across local, CI, and pre-commit * fix: use uv tool install for ruff instead of uv pip install - uv tool install is the correct way to install CLI tools like ruff - uv pip install --system is for Python packages, not tools * debug: add detailed logging for CI path resolution debugging - Add logging in DiskANN embedding server to show metadata_file_path - Add debug logging in PassageManager to trace path resolution - This will help identify why CI fails to find passage files * fix: force install local wheels in CI to prevent PyPI version conflicts - Change from --find-links to direct wheel installation with --force-reinstall - This ensures CI uses locally built packages with latest source code - Prevents uv from using PyPI packages with same version number but old code - Fixes CI test failures where old code (without metadata_file_path) was used Root cause: CI was installing leann-backend-diskann v0.2.1 from PyPI instead of the locally built wheel with same version number. * debug: add more CI diagnostics for DiskANN module import issue - Check wheel contents before and after auditwheel repair - Verify _diskannpy module installation after pip install - List installed package directory structure - Add explicit platform tag for auditwheel repair This helps diagnose why ImportError: cannot import name '_diskannpy' occurs * fix: remove invalid --plat argument from auditwheel repair - Remove '--plat linux_x86_64' which is not a valid platform tag - Let auditwheel automatically determine the correct platform - Based on CI output, it will use manylinux_2_35_x86_64 This was causing auditwheel repair to fail, preventing proper wheel repair * fix: ensure CI installs correct Python version wheel packages - Use --find-links with --no-index to let uv select correct wheel - Prevents installing wrong Python version wheel (e.g., cp310 for Python 3.11) - Fixes ImportError: _diskannpy.cpython-310-x86_64-linux-gnu.so in Python 3.11 The issue was that .whl glob matched all Python versions, causing uv to potentially install a cp310 wheel in a Python 3.11 environment. fix: ensure venv uses correct Python version from matrix - Explicitly specify Python version when creating venv with uv - Prevents mismatch between build Python (e.g., 3.10) and test Python - Fixes: _diskannpy.cpython-310-x86_64-linux-gnu.so in Python 3.11 error The issue: uv venv was defaulting to Python 3.11 regardless of matrix version * fix: resolve dependency issues in CI package installation - Ubuntu: Install all packages from local builds with --no-index - macOS: Install core packages from PyPI, backends from local builds - Remove --no-index for macOS backend installation to allow dependency resolution - Pin versions when installing from PyPI to ensure consistency Fixes error: 'leann-core was not found in the provided package locations' * fix: Python 3.9 compatibility - replace Union type syntax - Replace 'int \| None' with 'Optional[int]' everywhere - Replace 'subprocess.Popen \| None' with 'Optional[subprocess.Popen]' - Add Optional import to all affected files - Update ruff target-version from py310 to py39 - The '\|' syntax for Union types was introduced in Python 3.10 (PEP 604) Fixes TypeError: unsupported operand type(s) for \|: 'type' and 'NoneType' * ci: build all packages on all platforms; install from local wheels only - Build leann-core and leann on macOS too - Install all packages via --find-links and --no-index across platforms - Lower macOS MACOSX_DEPLOYMENT_TARGET to 12.0 for wider compatibility This ensures consistency and avoids PyPI drift while improving macOS compatibility. * ci: allow resolving third-party deps from index; still prefer local wheels for our packages - Remove --no-index so numpy/scipy/etc can be resolved on Python 3.13 - Keep --find-links to force our packages from local dist Fixes: dependency resolution failure on Ubuntu Python 3.13 (numpy missing) * ci(macOS): set MACOSX_DEPLOYMENT_TARGET back to 13.3 - Fix build failure: 'sgesdd_' only available on macOS 13.3+ - Keep other CI improvements (local builds, find-links installs) * fix(py39): replace union type syntax in chat.py - validate_model_and_suggest: str \| None -> Optional[str] - OpenAIChat.__init__: api_key: str \| None -> Optional[str] - get_llm: dict[str, Any] \| None -> Optional[dict[str, Any]] Ensures Python 3.9 compatibility for CI macOS 3.9. * style: organize imports per ruff; finish py39 Optional changes - Fix import ordering in embedding servers and graph_partition_simple - Remove duplicate Optional import - Complete Optional[...] replacements * fix(py39): replace remaining '\| None' in diskann graph_partition (module-level function) * fix(py39): remove zip(strict=...) usage in api; Python 3.9 compatibility * style: organize imports; fix process-group stop for embedding server * chore: keep embedding server stdout/stderr visible; still use new session and pg-kill on stop * fix: add timeout to final wait() in stop_server to prevent infinite hang * fix: prevent hang in CI by flushing print statements and redirecting embedding server output - Add flush=True to all print statements in convert_to_csr.py to prevent buffer deadlock - Redirect embedding server stdout/stderr to DEVNULL in CI environment (CI=true) - Fix timeout in embedding_server_manager.stop_server() final wait call * fix: resolve CI hanging by removing problematic wait() in stop_server * fix: remove hardcoded paths from MCP server and documentation * feat: add CI timeout protection for tests * fix: skip OpenAI test in CI to avoid failures and API costs - Add CI skip for test_document_rag_openai - Test was failing because it incorrectly used --llm simulated which isn't supported by document_rag.py * feat: add simulated LLM option to document_rag.py - Add 'simulated' to the LLM choices in base_rag_example.py - Handle simulated case in get_llm_config() method - This allows tests to use --llm simulated to avoid API costs * feat: add comprehensive debugging capabilities with tmate integration 1. Tmate SSH Debugging: - Added manual workflow_dispatch trigger with debug_enabled option - Integrated mxschmitt/action-tmate@v3 for SSH access to CI runner - Can be triggered manually or by adding [debug] to commit message - Detached mode with 30min timeout, limited to actor only - Also triggers on test failure when debug is enabled 2. Enhanced Pytest Output: - Added --capture=no to see real-time output - Added --log-cli-level=DEBUG for maximum verbosity - Added --tb=short for cleaner tracebacks - Pipe output to tee for both display and logging - Show last 20 lines of output on completion 3. Environment Diagnostics: - Export PYTHONUNBUFFERED=1 for immediate output - Show Python/Pytest versions at start - Display relevant environment variables - Check network ports before/after tests 4. Diagnostic Script: - Created scripts/diagnose_hang.sh for comprehensive system checks - Shows processes, network, file descriptors, memory, ZMQ status - Automatically runs on timeout for detailed debugging info This allows debugging CI hangs via SSH when needed while providing extensive logging by default. * fix: add diagnostic script (force add to override .gitignore) The diagnose_hang.sh script needs to be in git for CI to use it. Using -f to override .sh rule in .gitignore. test: investigate hanging [debug] * fix: move tmate debug session inside pytest step to avoid hanging The issue was that tmate was placed before pytest step, but the hang occurs during pytest execution. Now tmate starts inside the test step and provides connection info before running tests. * debug: trigger tmate debug session [debug] * fix: debug variable values and add commit message [debug] trigger - Add debug output to show variable values - Support both manual trigger and [debug] in commit message * fix: force debug mode for investigation branch - Auto-enable debug mode for debug/clean-state-investigation branch - Add more debug info to troubleshoot trigger issues - This ensures tmate will start regardless of trigger method * fix: use github.head_ref for PR branch detection For pull requests, github.ref is refs/pull/N/merge, but github.head_ref contains the actual branch name. This should fix debug mode detection. * fix: FORCE debug mode on - no more conditions Just always enable debug mode on this branch. We need tmate to work for investigation! * fix: improve tmate connection info retrieval - Add proper wait and retry logic for tmate initialization - Tmate needs time to connect to servers before showing SSH info - Try multiple times with delays to get connection details * fix: ensure OpenMP is found during DiskANN build on macOS - Add OpenMP environment variables directly in build step - Should fix the libomp.dylib not found error on macOS-14 * fix: simplify macOS OpenMP configuration to match main branch - Remove complex OpenMP environment variables - Use simplified configuration from working main branch - Remove redundant OpenMP setup in DiskANN build step - Keep essential settings: OpenMP_ROOT, CMAKE_PREFIX_PATH, LDFLAGS, CPPFLAGS 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * fix: revert DiskANN submodule to stable version The debug branch had updated DiskANN submodule to a version with hardcoded OpenMP paths that break macOS 13 builds. This reverts to the stable version used in main branch. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * fix: update faiss submodule to latest stable version 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * refactor: remove upterm/tmate debug code and clean CI workflow - Remove all upterm/tmate SSH debugging infrastructure - Restore clean CI workflow from main branch - Remove diagnostic script that was only for SSH debugging - Keep valuable DiskANN and HNSW backend improvements This provides a clean base to add targeted pytest hang debugging without the complexity of SSH sessions. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * debug: increase timeouts to 600s for comprehensive hang investigation - Increase pytest timeout from 300s to 600s for thorough testing - Increase import testing timeout from 60s to 120s - Allow more time for C++ extension loading (faiss/diskann) - Still provides timeout protection against infinite hangs This gives the system more time to complete imports and tests while still catching genuine hangs that exceed reasonable limits. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * fix: remove debug_enabled parameter from build-and-publish workflow - Remove debug_enabled input parameter that no longer exists in build-reusable.yml - Keep workflow_dispatch trigger but without debug options - Fixes workflow validation error: 'debug_enabled is not defined' 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * debug: fix YAML syntax and add post-pytest cleanup monitoring - Fix Python code formatting in YAML (pre-commit fixed indentation issues) - Add comprehensive post-pytest cleanup monitoring - Monitor for hanging processes after test completion - Focus on teardown phase based on previous hang analysis This addresses the root cause identified: hang occurs after tests pass, likely during cleanup/teardown of C++ extensions or embedding servers. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * debug: add external process monitoring and unbuffered output for precise hang detection * fix * feat: add comprehensive hang detection for pytest CI debugging - Add Python faulthandler integration with signal-triggered stack dumps - Implement periodic stack dumps at 5min and 10min intervals - Add external process monitoring with SIGUSR1 signal on hang detection - Use debug_pytest.py wrapper to capture exact hang location in C++ cleanup - Enhance CPU stability monitoring to trigger precise stack traces This addresses the persistent pytest hanging issue in Ubuntu 22.04 CI by providing detailed stack traces to identify the exact code location where the hang occurs during test cleanup phase. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * CI: move pytest hang-debug script into scripts/ci_debug_pytest.py; sort imports and apply ruff suggestion; update workflow to call the script * fix: improve hang detection to monitor actual pytest process * fix: implement comprehensive solution for CI pytest hangs Key improvements: 1. Replace complex monitoring with simpler process group management 2. Add pytest conftest.py with per-test timeouts and aggressive cleanup 3. Skip problematic tests in CI that cause infinite loops 4. Enhanced cleanup at session start/end and after each test 5. Shorter timeouts (3min per test, 10min total) with better monitoring This should resolve the hanging issues by: - Preventing individual tests from running too long - Automatically cleaning up hanging processes - Skipping known problematic tests in CI - Using process groups for more reliable cleanup 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * fix: correct pytest_runtest_call hook parameter in conftest.py - Change invalid 'puretest' parameter to proper pytest hooks - Replace problematic pytest_runtest_call with pytest_runtest_setup/teardown - This fixes PluginValidationError preventing pytest from starting - Remove unused time import 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * fix: prevent wrapper script from killing itself in cleanup - Remove overly aggressive pattern 'python.pytest' that matched wrapper itself - Add current PID check to avoid killing wrapper process - Add exclusion for wrapper and debug script names - This fixes exit code 137 (SIGKILL) issue where wrapper killed itself Root cause: cleanup function was killing the wrapper process itself, causing immediate termination with no output in CI. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> fix: prevent wrapper from detecting itself as remaining process - Add PID and script name checks in post-test verification - Avoid false positive detection of wrapper process as 'remaining' - This prevents unnecessary cleanup calls that could cause hangs - Root cause: wrapper was trying to clean up itself in verification phase 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * fix: implement graceful shutdown for embedding servers - Replace daemon threads with coordinated shutdown mechanism - Add shutdown_event for thread synchronization - Implement proper ZMQ resource cleanup - Wait for threads to complete before exit - Add ZMQ timeout to allow periodic shutdown checks - Move signal handlers into server functions for proper scope access - Fix protobuf class names and variable references - Simplify resource cleanup to avoid variable scope issues Root cause: Original servers used daemon threads + direct sys.exit(0) which interrupted ZMQ operations and prevented proper resource cleanup, causing hangs during process termination in CI environments. This should resolve the core pytest hanging issue without complex wrappers. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * fix: simplify embedding server process management - Remove start_new_session=True to fix signal handling issues - Simplify termination logic to use standard SIGTERM/SIGKILL - Remove complex process group management that could cause hangs - Add timeout-based cleanup to prevent CI hangs while ensuring proper resource cleanup - Give graceful shutdown more time (5s) since we fixed the server shutdown logic - Remove unused signal import This addresses the remaining process management issues that could cause startup failures and hanging during termination. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * fix: increase CI test timeouts to accommodate model download Analysis of recent CI failures shows: - Model download takes ~12 seconds - Embedding server startup + first search takes additional ~78 seconds - Total time needed: ~90-100 seconds Updated timeouts: - test_readme_basic_example: 90s -> 180s - test_backend_options: 60s -> 150s - test_llm_config_simulated: 75s -> 150s Root cause: Initial model download from huggingface.co in CI environment is slower than local development, causing legitimate timeouts rather than actual hanging processes. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * debug: preserve stderr in CI to debug embedding server startup failures Previous fix revealed the real issue: embedding server fails to start within 120s, not timeout issues. The error was hidden because both stdout and stderr were redirected to DEVNULL in CI. Changes: - Keep stderr output in CI environment for debugging - Only redirect stdout to DEVNULL to avoid buffer deadlock - This will help us see why embedding server startup is failing 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * fix(embedding-server): ensure shutdown-capable ZMQ threads create/bind their own REP sockets and poll with timeouts; fix undefined socket causing startup crash and CI hangs on Ubuntu 22.04 * style(hnsw-server): apply ruff-format after robustness changes * fix(hnsw-server): be lenient to nested [[ids]] for both distance and embedding requests to match client expectations; prevents missing ID lookup when wrapper nests the list * refactor(hnsw-server): remove duplicate legacy ZMQ thread; keep single shutdown-capable server implementation to reduce surface and avoid hangs * ci: simplify test step to run pytest uniformly across OS; drop ubuntu-22.04 wrapper special-casing * chore(ci): remove unused pytest wrapper and debug runner * refactor(diskann): remove redundant graph_partition_simple; keep single partition API (graph_partition) * refactor(hnsw-convert): remove global print override; rely on default flushing in CI * tests: drop custom ci_timeout decorator and helpers; rely on pytest defaults and simplified CI * tests: remove conftest global timeouts/cleanup; keep test suite minimal and rely on simplified CI + robust servers * tests: call searcher.cleanup()/chat.cleanup() to ensure background embedding servers terminate after tests * tests: fix ruff warnings in minimal conftest * core: add weakref.finalize and atexit-based cleanup in EmbeddingServerManager to ensure server stops on interpreter exit/GC * tests: remove minimal conftest to validate atexit/weakref cleanup path * core: adopt compatible running server (record PID) and ensure stop_server() can terminate adopted processes; clear server_port on stop * ci/core: skip compatibility scanning in CI (LEANN_SKIP_COMPAT=1) to avoid slow/hanging process scans; always pick a fresh available port * core: unify atexit to always call _finalize_process (covers both self-launched and adopted servers) * zmq: set SNDTIMEO=1s and LINGER=0 for REP sockets to avoid send blocking during shutdown; reduces CI hang risk * tests(ci): skip DiskANN branch of README basic example on CI to avoid core dump in constrained runners; HNSW still validated * diskann(ci): avoid stdout/stderr FD redirection in CI to prevent aborts from low-level dup2; no-op contextmanager on CI * core: purge dead helpers and comments from EmbeddingServerManager; keep only minimal in-process flow * core: fix lint (remove unused passages_file); keep per-instance reuse only * fix: keep backward-compat --------- Co-authored-by: yichuan520030910320 <yichuan_wang@berkeley.edu> Co-authored-by: Claude <noreply@anthropic.com>	2025-08-14 01:02:24 -07:00
Andy Lee	46565b9249	docs: follows #34 , patch leann backends into tool environment	2025-08-12 17:56:02 -07:00
GitHub Actions	3dad76126a	chore: release v0.2.9	2025-08-12 23:00:12 +00:00
GitHub Actions	609fa62fd5	chore: release v0.2.8	2025-08-12 19:04:51 +00:00
Yichuan Wang	eab13434ef	feat: support multiple input formats for --docs argument (#39 )	2025-08-12 10:30:31 -07:00
yichuan520030910320	b2390ccc14	[Ollama] fix ollama recompute	2025-08-12 00:24:20 -07:00
Andy Lee	e8fca2c84a	fix: detect and report Ollama embedding dimension inconsistency (#37 ) - Add validation for embedding dimension consistency in Ollama mode - Provide clear error message with troubleshooting steps when dimensions mismatch - Fail fast instead of silent fallback to prevent data corruption Fixes #31	2025-08-11 17:41:52 -07:00

1 2 3 4 5

206 Commits