It seems really powerful and could maybe be used for full-text search ? https://github.com/deepdoctection/deepdoctection