PISA is a text search engine able to run on large-scale collections of documents. It allows researchers to experiment with state-of-the-art techniques, allowing an ideal environment for rapid development.
Some features of PISA are listed below:
- Written in C++ for performance;
- Indexing & Parsing & Sharding capabilities;
- Many index compression methods implemented;
- Many query processing algorithms implemented;
- Implementation of document reordering;
- Free and open-source with permissive license;
The best way to get started is by reading the official documentation.
If you want to get involved with PISA, please check out our Contributing page.
PISA is a fork of the ds2i project started by Giuseppe Ottaviano.