aakash
be17980114
Add ChatGPT RAG support - resolves #40
- Implement ChatGPTReader for parsing HTML/ZIP exports from ChatGPT
- Add chatgpt_rag.py following BaseRAGExample pattern
- Support both concatenated conversations and individual messages
- Handle multiple input formats (.html, .zip, directories)
- Include comprehensive error handling and user guidance
- Add metadata extraction (titles, timestamps, roles)
- Integrate with existing LEANN chunking and embedding systems
Features:
✅ HTML parsing from ChatGPT exports
✅ ZIP file extraction support
✅ Conversation detection and structuring
✅ Message role identification (user/assistant)
✅ Metadata extraction and preservation
✅ Dual processing modes
✅ Command-line interface with all LEANN options
✅ Comprehensive error handling
✅ Multiple input format support
Usage:
python -m apps.chatgpt_rag --export-path chatgpt_export.html
python -m apps.chatgpt_rag --export-path chatgpt_export.zip --query 'Python help'
2025-09-29 01:44:32 -07:00
..
2025-09-29 01:44:32 -07:00
2025-09-17 18:44:00 -07:00
2025-08-03 23:06:24 -07:00
2025-09-17 18:44:00 -07:00
2025-08-03 23:06:24 -07:00
2025-08-20 17:31:46 -07:00
2025-08-22 02:35:09 -07:00
2025-09-29 01:44:32 -07:00
2025-08-19 23:35:31 -07:00
2025-08-19 23:35:31 -07:00
2025-08-22 02:35:09 -07:00
2025-08-03 23:06:24 -07:00