Binary Vector Search at 350GB/S Using ARM Neon

7 points by Equiet a year ago · 2 comments

Reader

MarekDlugos a year ago

re: optimization for 1024b vectors — do you pad shorter ones, or fallback to a more general kernel?

marekgalovic a year ago

We do a projection of the original vectors so that it matches one of our optimized kernel. This generally gives us better recall vs. simple padding since all bits are utilized.

Settings