Support for enabling full-text index and compression on demand #269

parvit · 2022-07-25T16:08:21Z

This change responds to issue #243.

Requires openzim/python-scraperlib#88 to work correctly.

Changes the default behavior to not require compression and full-text indexing by default which can be an issue with big sites.

Introduces two new commandline flags to support enabling the features back:

--make_fulltext_index : boolean default false, activates the fulltext indexing of xapian
--compression : string which corresponds to the requested zim compression algorithm (eg. lzma)

…m file

kelson42 · 2022-07-25T16:24:36Z

@parvit Text compression and ft indexing ahoukd be activated per default. This is what users expect and a "standard" in all our scraper. I'm not informed about problems by doing so.

rgaudin · 2022-07-25T16:31:46Z

Let's verify first #243 (comment) that disabling them would significantly improve the MEM situation. In such a case, we may introduce the opposite option (disable) as a temporary measure until we fix the root cause as both are definitely wanted features.

parvit · 2022-07-25T16:38:37Z

@kelson42 Sure if you want you can check the data i've provided in issue 243 that indicate that those two features can create memory usage problems (in the scenario of big sites) and at least allowing to disable them should be considered.

parvit · 2022-07-26T18:30:39Z

Seen that the other PR was closed than this too has no reason to be left open.

parvit added 2 commits July 25, 2022 07:54

Fix for impossibility to disable fulltext index and compression of zi…

0dcaef6

…m file

added command-line flags for clean operation

e1a2f68

kelson42 requested a review from rgaudin July 25, 2022 16:22

parvit closed this Jul 26, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Support for enabling full-text index and compression on demand #269

Support for enabling full-text index and compression on demand #269

Uh oh!

parvit commented Jul 25, 2022

Uh oh!

kelson42 commented Jul 25, 2022

Uh oh!

rgaudin commented Jul 25, 2022

Uh oh!

parvit commented Jul 25, 2022 •

edited

Loading

Uh oh!

parvit commented Jul 26, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Uh oh!

Support for enabling full-text index and compression on demand #269

Support for enabling full-text index and compression on demand #269

Uh oh!

Conversation

parvit commented Jul 25, 2022

Uh oh!

kelson42 commented Jul 25, 2022

Uh oh!

rgaudin commented Jul 25, 2022

Uh oh!

parvit commented Jul 25, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

parvit commented Jul 26, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

parvit commented Jul 25, 2022 •

edited

Loading