Skip to content

DEV-10: replace kakadu with grok#95

Merged
aelkiss merged 1 commit intomasterfrom
DEV-10-grok
Apr 18, 2022
Merged

DEV-10: replace kakadu with grok#95
aelkiss merged 1 commit intomasterfrom
DEV-10-grok

Conversation

@aelkiss
Copy link
Member

@aelkiss aelkiss commented Apr 11, 2022

  • Uses grok for compressing incoming contone TIFFs to JPEG2000 and for recompressing lossless JPEG2000 images
  • Remove unused kdu_munge code (see comments)
  • Tests compression from contone TIFF and re-compression of lossless JPEG2000

@aelkiss
Copy link
Member Author

aelkiss commented Apr 11, 2022

This produces valid images; still to-do is to pick a PSNR that seems reasonable for our images (requires testing on sample images)

@aelkiss
Copy link
Member Author

aelkiss commented Apr 14, 2022

This produces valid images; still to-do is to pick a PSNR that seems reasonable for our images (requires testing on sample images)

Done, using requested PSNR 32. This results in similar average PSNR on a variety of scanned book images to kakadu 6.4.0 as evaluated by imagemagick with compare -metric PSNR, albeit with about 33% larger total compressed size.

* Uses grok for compressing incoming contone TIFFs to JPEG2000 and for
recompressing lossless JPEG2000 images, using parameters that produce
similar quality output to Kakadu across a sample of digitized book
images

* Remove unused kdu_munge code (see comments)

* Update tool versions for PREMIS (should really get these via dpkg)

* Test that XMP is preserved when remediating images
@aelkiss
Copy link
Member Author

aelkiss commented Apr 18, 2022

This adds a couple of tests around preserving XMP. @respinos I think this should be good to go after tests are passing, but let me know if anything else comes to mind regarding using grok to create jpeg2000.

@aelkiss aelkiss merged commit 838a9c9 into master Apr 18, 2022
@aelkiss aelkiss deleted the DEV-10-grok branch April 18, 2022 19:46
niquerio added a commit to mlibrary/divp that referenced this pull request Jan 15, 2026
relevant HT PR: hathitrust/feed#95
grok is available in the mlibrary apt repo
niquerio added a commit to mlibrary/divp that referenced this pull request Jan 15, 2026
relevant HT PR: hathitrust/feed#95
grok is available in the mlibrary apt repo
niquerio added a commit to mlibrary/divp that referenced this pull request Jan 15, 2026
relevant HT PRs:
* hathitrust/feed#95
* hathitrust/feed#172

grokj2k is available in the mlibrary apt repo
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants