From cefaa2a4cc71c9c3f92d811fb6153a16db0d6b8b Mon Sep 17 00:00:00 2001 From: yichuan520030910320 Date: Mon, 21 Jul 2025 23:13:38 -0700 Subject: [PATCH] upd readme --- README.md | 6 ++++-- 1 file changed, 4 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index 97d4968..fccd5aa 100755 --- a/README.md +++ b/README.md @@ -132,7 +132,9 @@ LEANN supports RAGing a lot of data sources, like .pdf, .txt, .md, and also supp ### Process Any Documents (.pdf, .txt, .md) -Above we showed the Python API, while this CLI script demonstrates the same concepts while directly processing PDFs and documents. +Above we showed the Python API, while this CLI script demonstrates the same concepts while directly processing PDFs and documents, and even any directory that stores your personal files! + +The following scripts use Ollama `qwen3:8b` by default, so you need `ollama pull qwen3:8b` first. For other models: `--llm openai --model gpt-4o` (requires `OPENAI_API_KEY` environment variable) or `--llm hf --model Qwen/Qwen3-4B`. ```bash # Drop your PDFs, .txt, .md files into examples/data/ @@ -143,7 +145,7 @@ source .venv/bin/activate python ./examples/main_cli_example.py ``` -Uses Ollama `qwen3:8b` by default. For other models: `--llm openai --model gpt-4o` (requires `OPENAI_API_KEY` environment variable) or `--llm hf --model Qwen/Qwen3-4B`. + **Works with any text format** - research papers, personal notes, presentations. Built with LlamaIndex for document parsing.