From f13bd02fbd66348522782a114665ed5667e4dba4 Mon Sep 17 00:00:00 2001 From: aakash Date: Sat, 6 Dec 2025 03:28:08 -0800 Subject: [PATCH] docs: Add ColQwen multimodal PDF retrieval to README Add brief introduction and usage guide for ColQwen integration, similar to other RAG application sections in the README. - Quick start examples for building, searching, and interactive Q&A - Setup instructions with prerequisites - Model options (ColQwen2 vs ColPali) - Link to detailed ColQwen guide --- README.md | 48 ++++++++++++++++++++++++++++++++++++++++++++++++ 1 file changed, 48 insertions(+) diff --git a/README.md b/README.md index 0b0e59f..4bc3627 100755 --- a/README.md +++ b/README.md @@ -379,6 +379,54 @@ python -m apps.code_rag --repo-dir "./my_codebase" --query "How does authenticat +### 🎨 ColQwen: Multimodal PDF Retrieval with Vision-Language Models + +Search through PDFs using both text and visual understanding with ColQwen2/ColPali models. Perfect for research papers, technical documents, and any PDFs with complex layouts, figures, or diagrams. + +> **🍎 Mac Users**: ColQwen is optimized for Apple Silicon with MPS acceleration for faster inference! + +```bash +# Build index from PDFs +python -m apps.colqwen_rag build --pdfs ./my_papers/ --index research_papers + +# Search with text queries +python -m apps.colqwen_rag search research_papers "How does attention mechanism work?" + +# Interactive Q&A +python -m apps.colqwen_rag ask research_papers --interactive +``` + +
+📋 Click to expand: ColQwen Setup & Usage + +#### Prerequisites +```bash +# Install dependencies +uv pip install colpali_engine pdf2image pillow matplotlib qwen_vl_utils einops seaborn +brew install poppler # macOS only, for PDF processing +``` + +#### Build Index +```bash +python -m apps.colqwen_rag build \ + --pdfs ./pdf_directory/ \ + --index my_index \ + --model colqwen2 # or colpali +``` + +#### Search +```bash +python -m apps.colqwen_rag search my_index "your question here" --top-k 5 +``` + +#### Models +- **ColQwen2** (`colqwen2`): Latest vision-language model with improved performance +- **ColPali** (`colpali`): Proven multimodal retriever + +For detailed usage, see the [ColQwen Guide](COLQWEN_GUIDE.md). + +
+ ### 📧 Your Personal Email Secretary: RAG on Apple Mail! > **Note:** The examples below currently support macOS only. Windows support coming soon.