LEANN/examples at 1f6c7f2f5ab432828283d87f4018cdf342461c64 - LEANN - Gitea: Git with a cup of tea

Files

T

History

Andy Lee 274bbb19ea feat: Add chunk-size parameters and improve file type filtering

- Add --chunk-size and --chunk-overlap parameters to all RAG examples
- Preserve original default values for each data source:
  - Document: 256/128 (optimized for general documents)
  - Email: 256/25 (smaller overlap for email threads)
  - Browser: 256/128 (standard for web content)
  - WeChat: 192/64 (smaller chunks for chat messages)
- Make --file-types optional filter instead of restriction in document_rag
- Update README to clarify interactive mode and parameter usage
- Fix LLM default model documentation (gpt-4o, not gpt-4o-mini)

2025-07-29 18:31:56 -07:00

..

fix ruff errors and formatting

2025-07-27 02:22:54 -07:00

fix: resolve all ruff linting errors and add lint CI check

2025-07-26 22:38:13 -07:00

fix ruff errors and formatting

2025-07-27 02:22:54 -07:00

base_rag_example.py

feat: Address review comments

2025-07-29 16:59:24 -07:00

browser_rag.py

feat: Add chunk-size parameters and improve file type filtering

2025-07-29 18:31:56 -07:00

compare_faiss_vs_leann.py

fix: resolve all ruff linting errors and add lint CI check

2025-07-26 22:38:13 -07:00

document_rag.py

feat: Add chunk-size parameters and improve file type filtering

2025-07-29 18:31:56 -07:00

document_search.py

fix ruff errors and formatting

2025-07-27 02:22:54 -07:00

email_rag.py

feat: Add chunk-size parameters and improve file type filtering

2025-07-29 18:31:56 -07:00

faiss_only.py

fix: resolve all ruff linting errors and add lint CI check

2025-07-26 22:38:13 -07:00

mail_reader_llamaindex.py

fix ruff errors and formatting

2025-07-27 02:22:54 -07:00

multi_vector_aggregator.py

fix ruff errors and formatting

2025-07-27 02:22:54 -07:00

openai_hnsw_example.py

fix: resolve all ruff linting errors and add lint CI check

2025-07-26 22:38:13 -07:00

resue_index.py

fix: resolve all ruff linting errors and add lint CI check

2025-07-26 22:38:13 -07:00

run_evaluation.py

fix: resolve all ruff linting errors and add lint CI check

2025-07-26 22:38:13 -07:00

simple_demo.py

fix: resolve all ruff linting errors and add lint CI check

2025-07-26 22:38:13 -07:00

wechat_rag.py

feat: Add chunk-size parameters and improve file type filtering

2025-07-29 18:31:56 -07:00