Commit Graph

40 Commits

Author SHA1 Message Date
Andy Lee
8b4654921b fix: run faiss in subprocess to prevent kmp 2025-07-14 00:29:21 -07:00
Andy Lee
711fb4a775 feat: compare faiss 2025-07-13 22:44:16 -07:00
Andy Lee
3b5a185e60 refactor: check if current emb_server has correct passages/embedder 2025-07-13 22:43:51 -07:00
yichuan520030910320
b8e5728e6a fix wechat application 2025-07-13 22:29:54 -07:00
yichuan520030910320
d038319d8b upd readme wechat application 2025-07-13 22:00:49 -07:00
yichuan520030910320
c611d0f30f upd readme mail application 2025-07-13 21:48:57 -07:00
yichuan520030910320
c17899662f upd readme mail application 2025-07-13 21:30:08 -07:00
yichuan520030910320
6fa9512a64 fix wechat path 2025-07-13 18:23:31 -07:00
Andy Lee
53c58fa755 perf: switch to tranditional pdf reader 2025-07-13 17:04:23 -07:00
yichuan520030910320
c69afb56e4 Resolve submodule conflict - update to af2a264 2025-07-13 17:03:42 -07:00
yichuan520030910320
0fa8a9191f add wechat history extract app 2025-07-13 16:52:08 -07:00
Andy Lee
71ef4b7d4c fix: reproducible dpr on mac 2025-07-12 18:13:22 -07:00
Andy Lee
ecab43e307 feat: dataset for evaluation 2025-07-12 23:43:10 +00:00
Fangzhou66
88ca09440d fix some hf problem 2025-07-12 16:13:15 -07:00
yichuan520030910320
74ffd7ec64 add email test code 2025-07-11 23:59:47 -07:00
Andy Lee
eb6f504789 Datastore reproduce (#3)
* fix: diskann zmq port and passages

* feat: auto discovery of packages and fix passage gen for diskann

* docs: embedding pruning

* refactor: passage structure

* feat: reproducible research datas, rpj_wiki & dpr

* refactor: chat and base searcher

* feat: chat on mps
2025-07-11 23:37:23 -07:00
yichuan520030910320
8239bbb48f add google hostory api 2025-07-11 21:21:36 -07:00
yichuan520030910320
16ee9d0422 add traverse all dict interface 2025-07-10 15:59:16 -07:00
yichuan520030910320
8a961f8ab3 align the llamaindex result w leann& test attachment 2025-07-09 21:42:15 -07:00
yichuan520030910320
558126c46e add leann and llamaindex email infra, and need to align the results 2025-07-09 16:27:11 -07:00
yichuan520030910320
04c9684488 add email test code 2025-07-09 15:06:31 -07:00
yichuan520030910320
dafb2aacab update macos env 2025-07-08 14:37:41 -07:00
yichuan520030910320
6497e17671 add gpu chunk embedd and add complexity in hnsw 2025-07-08 18:40:52 +00:00
yichuan520030910320
dfca00c21b add mac support in this repo 2025-07-07 18:22:24 -07:00
yichuan520030910320
637dab379e add workaround code 2025-07-07 23:13:47 +00:00
yichuan520030910320
6fc57eb48e add reuse code 2025-07-07 21:07:00 +00:00
yichuan520030910320
95a653993a rm useless 2025-07-06 06:47:20 +00:00
yichuan520030910320
af0959818d rm useless 2025-07-06 05:21:05 +00:00
Andy Lee
cf17c85607 Make DiskANN and HNSW work on main example (#2)
* fix: diskann zmq port and passages

* feat: auto discovery of packages and fix passage gen for diskann
2025-07-05 22:18:12 -07:00
yichuan520030910320
df63526503 merge main 2025-07-06 00:50:58 +00:00
yichuan520030910320
e92deee1e8 fix larger file read and add faq 2025-07-06 00:48:57 +00:00
Andy Lee
910927a405 feat: support more embedders 2025-07-06 00:35:07 +00:00
Andy Lee
0aa84e147b feat: hnsw embedding server and csr format 2025-07-05 23:04:41 +00:00
yichuan520030910320
368474d036 fix larger file read and add faq 2025-07-03 23:25:36 +00:00
yichuan520030910320
a627abe794 fix file path bug still compatiable bug in hnsw search 2025-07-03 02:02:42 +00:00
yichuan520030910320
44815ee7fd add configuable funcname 2025-07-02 05:18:00 +00:00
yichuan520030910320
371e3de04e add configuable funcname 2025-07-01 05:02:01 +00:00
yichuan520030910320
b81b5d0f86 256 cannot work but increase chunk size can 2025-07-01 04:09:18 +00:00
yichuan520030910320
ee507bfe7a Initial commit 2025-06-30 11:01:12 +00:00
yichuan520030910320
46f6cc100b Initial commit 2025-06-30 09:05:05 +00:00