This project aims to be very Postgres centered, so that we can run several indexing processes at the same time. Just keep track of which process indexed what, and this should be lightning fast.
- PostgreSQL (>= 9.5) 😍
- NodeJS 💊
- Something to index 📕
- Patience 😣
- Turn off
synchronous_commitif you can. - Increase your usual settings like
shared_buffersandwork_mem. - One instance of Markov runs one Postgresql connection. Either run multiple instances or switch to node pg pooling.
- You can remove the fulltext index while inserting your data.