-
Notifications
You must be signed in to change notification settings - Fork 71
CLAW Migration Sprint Kickoff
This meeting is a hybrid teleconference and IRC chat. Anyone is welcome to join. Here is the info:
- Time: 11:00pm Eastern Daylight Time US (UTC-4)
- Zoom link: https://zoom.us/j/203396828
- IRC:
- Join the #islandora chat room via Freenode Web IRC (enter a unique nick)
- Or point your IRC client to #islandora on irc.freenode.net
- Adam Soroka
- Benjamin Rosner
- Ann McShane
- Carolyn Moritz
- Seth Shaw
- Rosie LeFaive
- Andrija Sagic
- LSU (Jason, Cara, Scott)
- Jared Whiklo 🌠
- Paul Pound
- Michael Bolam
- Rachel Tillay
- Marcus Barnes
- Jonathan Green
- Patrick Dunlavey
- Yamil Suarez
- Best way to test?
- Full list of tickets: https://github.com/Islandora-CLAW/CLAW/issues?q=is%3Aissue+is%3Aopen+label%3Amigration
- https://github.com/Islandora-CLAW/CLAW/issues/889
- https://github.com/Islandora-CLAW/CLAW/issues/893
- https://github.com/Islandora-CLAW/CLAW/issues/894
- https://github.com/Islandora-CLAW/CLAW/issues/895
- https://github.com/Islandora-CLAW/CLAW/issues/896
- https://github.com/Islandora-CLAW/CLAW/issues/897
- https://github.com/Islandora-CLAW/CLAW/issues/876
- https://github.com/Islandora-CLAW/CLAW/issues/898
- Any missing tickets we need to make?
-
Discussed on Friday, what is the best way to test a migration from 7.x to CLAW. Jared has a 7.x vagrant and a CLAW vagrant. Would spinning up some sandboxes be beneficial
Rachel suggests that the decision be held until the 12:00 meeting as the people on that call.
Rachel can go and see if she could create some sample packages from some OSS content. Danny suggests that trying to hit some of the weird issues we are going to hit, large images, weird size images, complex MODS records.
-
Go through tickets to determine any issues ahead of time.
-
https://github.com/Islandora-CLAW/CLAW/issues/889
- If we extract name elements from MODS, then they will have a role.
- Seth doesn't think we can assign a taxonomy to a field, instead you'd need a compound field which has sub-fields. These sub-fields would hold the "role" (which is a link to the appropriate taxonomy term) and the "name" (which is a link to the appropriate Agent resource which is the person).
- Looking for a generic solution for this.
-
https://github.com/Islandora-CLAW/CLAW/issues/893
- Scrap in the list of MARC Relators and generate a taxonomy vocabulary. The list returned is just of the URI so we need to get all the URIs and then hit each URI to get the term details.
- May need to be updated once a year.
- Instead we are pulling it in as a text file with pipes (|) between the text and URI which will be used in the above mentioned compound field.
-
https://github.com/Islandora-CLAW/CLAW/issues/894
- Make it a simple field on the resource
- Re-use code from above pulling of MARC relators list and use it here.
- Can be a good starter issue to try a migration from a small csv.
- As of MODS 3.7 you are allowed to use other lists to add terms, we are wide open.
- Perhaps import it as a configuration entity and then allow people to edit that configuration entity later to add new terms.
-
https://github.com/Islandora-CLAW/CLAW/issues/895
- Lots of changes in CLAW, make sure this doesn't die because of them.
-
https://github.com/Islandora-CLAW/CLAW/issues/896
- Create fields for all the various elements described in the MIG spreadsheet.
- https://docs.google.com/spreadsheets/d/18u2qFJ014IIxlVpM3JXfDEFccwBZcoFsjbBGpvL0jJI/edit#gid=0
-
https://github.com/Islandora-CLAW/CLAW/issues/897
- Much like 894, in that we need to generate a new content type to hold the information for an "Agent"
-
https://github.com/Islandora-CLAW/CLAW/issues/898
- Determine how we might use OpenRefine in a manual or automated process, possibly as two (or more) separate processes.
- Lots of volunteers to test this out (awesome).
- There is a PHP client we could possibly used to automated some cleaning.
-
https://github.com/Islandora-CLAW/CLAW/issues/876
- How do we want to identify an "object"
- Lots of the work we are doing would allow for this, but some repositories don't support Bibframe.
- Perhaps once we have more time to do this properly.
-
Should we generate a new ticket for automated OpenRefine work?
- Maybe we'll wait on a new ticket and try to make sure that the module in migrate_7x_claw allows us to input from CSV or automate some of the processing.
-
What type of date formats should we handle?
- As long as we account for complex dates (ie. 19??, circa 1800)
-
You may be looking for the islandora-community wiki · new to islandora? · community calendar · interest groups · roadmap