[ICML'26] LEMUR reduces multi-vector retrieval for late interaction models such as ColBERT into regular single-vector retrieval.
embeddings nearest-neighbor-search approximate-nearest-neighbor-search colbert multi-vector late-interaction multi-vector-embeddings
-
Updated
Jun 21, 2026 - C++