Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy
-
Updated
Dec 1, 2025 - Python
Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy
𝗜𝗻𝗳𝗼𝗿𝗺𝗮𝘁𝗶𝗼𝗻 𝗥𝗲𝘁𝗿𝗶𝗲𝘃𝗮𝗹 | 𝗖𝗦𝟲𝟬𝟬𝟵𝟮 | 𝗕𝗼𝗼𝗹-𝗦𝗲𝗮𝗿𝗰𝗵, 𝗥𝗮𝗻𝗸𝗲𝗿, 𝗪𝗼𝗿𝗱𝗡𝗲𝘁 𝗦𝘂𝗺𝗺𝗮𝗿𝗶𝘇𝗲𝗿
A two-stage information retrieval model using baseline TF-IDF model and refined BM25.
Parse HTML pages. Create inverted index. Search for pages
A basic and intuitive Python module for (Vector Space) IR system. (Focuses on simplicity and understandability)
IR ranking system based on Okapi BM25 and blind feedback
Content specific search engine with the aim to retrieve movies information given the content of the user's query.
Ranked document retrieval on a large text corpus.
Création d'un moteur de recherche (Parsing de la collection, Index + Index inversé, Ordonnancement, Ranking)
This project implements an in-memory search engine for indexing and retrieving documents from a CSV file using Python and NLTK. It preprocesses text, builds an inverted index, and ranks documents based on relevance to a query using the Okapi BM25 algorithm.
Add a description, image, and links to the okapi-bm25 topic page so that developers can more easily learn about it.
To associate your repository with the okapi-bm25 topic, visit your repo's landing page and select "manage topics."