GeorgeCurtis 3 days ago

We're going to use BM25. Currently it is just dense search. Coming very soon

1
elpalek 3 days ago

have you thought about SPALDE models? ex: https://arxiv.org/abs/2109.10086

GeorgeCurtis 3 days ago

Looks really interesting, I'll have a proper read. What would be your reasoning to incorporate this if we already have vector functionality and semantic search?

elpalek 3 days ago

my project deals w/ non-english text, bm25 performance is middeling. Language specific sparse model helps.

xavcochran 3 days ago

We will definitely look into it. The SPLADE models look promising!

xavcochran 3 days ago

SPALDE*