Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fixity Auditing in Islandora #1931

Open
kstapelfeldt opened this issue Oct 19, 2021 · 14 comments
Open

Fixity Auditing in Islandora #1931

kstapelfeldt opened this issue Oct 19, 2021 · 14 comments
Labels
Type: use case proposes a new feature or function for the software using user-first language.

Comments

@kstapelfeldt
Copy link
Member

Title (Goal) Fixity Auditing in Islandora
Primary Actor Collection Manager
Scope Preservation
Story As a collection manager I wish to be able to perform fixity audits on objects in the repository and see reports pertaining to the health of objects in my repository.
@kstapelfeldt kstapelfeldt added the Type: use case proposes a new feature or function for the software using user-first language. label Oct 19, 2021
@rosiel
Copy link
Member

rosiel commented Oct 20, 2021

Is this the same as "As a collection manager, I wish to generate checksums for the media in my repository."?

@kstapelfeldt
Copy link
Member Author

They could be added together, but generating checksums would be discrete from checking checksums and seeing reports about it, I think?

@rosiel
Copy link
Member

rosiel commented Oct 20, 2021

Possibly. Do you think there's a use case for "collection managers" to generate checksums that nothing happens with?

This was from:

  • Feature Parity, line 84, phrased as "Enables checksums on datastreams".
  • ICG Report, "Include and document digital preservation features in Islandora."

It sounds like this is part of a larger, "Preservation" interest/suite of features. I wonder if it'd be worth a dedicated interest group (or, "Initiative" which seem to be a bit different from our version of IGs)?

@mjordan
Copy link
Contributor

mjordan commented Oct 20, 2021

Fixity auditing it verifying a file's checksum periodically over time to ensure that the file has not changed inadvertently. So generating checksums is not the same thing as fixity auditing, it's a precondition for fixity auditing.

@kstapelfeldt
Copy link
Member Author

What Mark said. I think the use case would be that I have something else that checks the checksums. They could go together.

I think an interest group could be a great idea, but we know about immediate gaps in the toolset, so I think we can proceed to build things like checksum generation, and fixity checking and reporting (and ostensibly, a lot already exists in @mjordan work that could be embraced more officially)

@mjordan
Copy link
Contributor

mjordan commented Oct 20, 2021

I am happy to contribute Riprap and Islandora Riprap Integration to the IF (and remain its maintainer). It currently generates checksums, performs the periodic verification of checksums, and reports checksum mismatches found during that periodic verification.

FYI Islandora already requires the Filehash contrib module, which generates checksums. AFAIK it doesn't do anything specific with them other than store them in a db table.

@mjordan
Copy link
Contributor

mjordan commented Oct 20, 2021

@kstapelfeldt we have a Preservation IG but it has been inactive for some time. Could revive it.

@kstapelfeldt
Copy link
Member Author

I think that could be a good longer-term goal for guiding preservation work in Islandora, so long as there is sufficient interest. For me, RipRap would be an excellent addition to the official repositories of Islandora.

@DonRichards
Copy link
Member

@mjordan Is it possible to run RipRap without Fedora? Like storing a hash into a field or something 🤷

@kstapelfeldt
Copy link
Member Author

I'm not the tagged person, but FWIW we're running RipRap without Fedora. It's definitely possible and Mjordan wrote this so it will work with Drupal.

@mjordan
Copy link
Contributor

mjordan commented Nov 29, 2021

@kstapelfeldt is correct. You need to use the PluginFetchDigestFromDrupal plugin instead of the PluginFetchDigestFromFedoraAPI plugin.

@rosiel
Copy link
Member

rosiel commented Dec 13, 2022

The Drupal Filehash module is already part of islandora, and we could improve how we use it!

  • available since filehash 2.0, we can keep an "original" checksum?
  • integrate with a local drupal checksum checker? Could it be a view?
  • Current behaviour of islandora derivatives "replaces" files on disk, which would cause checksum check errors!
  • Discuss best practices around duplicate checking
  • Discuss best practices around hashing algorithms
  • Make our hardcoded sha-1 check not rely on by luck having the sha-1 algorithm selected.

@dannylamb
Copy link
Contributor

Yep, i think i've turned the 'original checksum' thing on a site or two. We're totally already keeping hashes of everything in the DB on drupal side. There's a lot to expand on there. Exciting!

@rosiel
Copy link
Member

rosiel commented Dec 21, 2022

(Tech call) DGI has also released a Fixity Check module: https://github.com/discoverygarden/dgi_fixity

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Type: use case proposes a new feature or function for the software using user-first language.
Projects
Development

No branches or pull requests

5 participants