LEANN/apps at 00770aebbb5af5d1952a44bbf856018fb8805bc0 - LEANN - Gitea: Git with a cup of tea

Files

T

History

Yichuan Wang 00770aebbb [Multi-vector]Add timing instrumentation and multi-dataset support for multi-vector… (#161 )

* Add timing instrumentation and multi-dataset support for multi-vector retrieval

- Add timing measurements for search operations (load and core time)
- Increase embedding batch size from 1 to 32 for better performance
- Add explicit memory cleanup with del all_embeddings
- Support loading and merging multiple datasets with different splits
- Add CLI arguments for search method selection (ann/exact/exact-all)
- Auto-detect image field names across different dataset structures
- Print candidate doc counts for performance monitoring

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>

* update vidore

* reproduce docvqa results

* reproduce docvqa results and add debug file

---------

Co-authored-by: Claude <noreply@anthropic.com>

2025-12-03 00:55:42 -08:00

..

Feature/imessage rag support (#131 )

2025-10-02 10:40:57 -07:00

metadata reveal for ast-chunking; smart detection of seq length in ollama; auto adjust chunk length for ast to prevent silent truncation (#157 )

2025-11-08 17:37:31 -08:00

Feature/imessage rag support (#131 )

2025-10-02 10:40:57 -07:00

refactor: Unify examples interface with BaseRAGExample (#12 )

2025-08-03 23:06:24 -07:00

[Fix] Enable AST chunking when installed (package chunking utils) (#101 )

2025-09-17 18:44:00 -07:00

Feature/imessage rag support (#131 )

2025-10-02 10:40:57 -07:00

multimodal/vision-based-pdf-multi-vector

[Multi-vector]Add timing instrumentation and multi-dataset support for multi-vector… (#161 )

2025-12-03 00:55:42 -08:00

semantic_file_search

Implement FileSystem wide semantic file search engine with temporal awareness using LEANN. (#103 )

2025-10-05 17:26:48 -07:00

Fix CI: improve security fix and add link checker configuration

2025-11-13 13:05:00 -08:00

feat: Add MCP integration support for Slack and Twitter (#134 )

2025-10-07 02:18:32 -07:00

__init__.py

refactor: Unify examples interface with BaseRAGExample (#12 )

2025-08-03 23:06:24 -07:00

base_rag_example.py

fixing chunking token issues within limit for embedding models

2025-10-31 17:15:00 -07:00

browser_rag.py

fix bug introduce in #58

2025-08-22 02:35:09 -07:00

chatgpt_rag.py

Feature/imessage rag support (#131 )

2025-10-02 10:40:57 -07:00

claude_rag.py

Feature/imessage rag support (#131 )

2025-10-02 10:40:57 -07:00

code_rag.py

Add AST-aware code chunking for better code understanding (#58 )

2025-08-19 23:35:31 -07:00

document_rag.py

Add AST-aware code chunking for better code understanding (#58 )

2025-08-19 23:35:31 -07:00

email_rag.py

fix bug introduce in #58

2025-08-22 02:35:09 -07:00

imessage_rag.py

Feature/imessage rag support (#131 )

2025-10-02 10:40:57 -07:00

slack_rag.py

Fix/twitter bookmarks anchor link (#143 )

2025-10-19 11:47:29 -07:00

twitter_rag.py

feat: Add MCP integration support for Slack and Twitter (#134 )

2025-10-07 02:18:32 -07:00

wechat_rag.py

refactor: Unify examples interface with BaseRAGExample (#12 )

2025-08-03 23:06:24 -07:00