Why Nostr? What is Njump?
2024-05-29 23:22:05

Sarah Jamie Lewis on Nostr: New Paper: On the application of Bloom Filter Hierarchies representing Sub-word Token ...

New Paper: On the application of Bloom Filter Hierarchies representing
Sub-word Token Bigram Occurrence to Probabilistic Full Text Search

This is a note regarding a prototype I've been working on for a few months in the domain of Decentralized Search (and Indexing)

It covers a data structure with interesting properties that I've been playing with, and documents some experiments regarding naive full text search performance.

Comments/questions/critique welcome.

PDF: https://sarahjamielewis.com/decentralization/search/ftsbloom.pdf
Author Public Key
npub14mfj9wu5ujvu6rxj8w62dvkvqj7myqc6kz0upz3kuw3wx4dz9vgstvp58n