Fix-It Ticket for Smithsonian Institution Integration (original #397) #1784
Labels
🟩 priority: low
Low priority and doesn't need to be rushed
🧱 stack: catalog
Related to the catalog and Airflow DAGs
This issue has been migrated from the CC Search Frontend repository
We know that some museums in SI have discrepancies between field names, e.g. in some museums they use "summary" and in others "description" to describe the result.
To improve the results of SI objects, we need to go through each museum within SI to look for any missing metadata mapping/potential improvements.
Original Comments:
annatuma commented on Mon May 18 2020:
ChariniNana commented on Fri Jul 24 2020:
An initial analysis on missing metadata mapping is as follows:
The numbers and percentages of missing creators:-
The numbers and percentages of missing descriptions in the meta data field:-
The reason for missing the creator value is because the field from which to get it is not yet included in the CREATOR_TYPES dictionary and the description is missing since it's not yet covered in DESCRIPTION_TYPES as defined in the Smithsonian script.
Other findings:-
We entirely lose the following museums due to unavailability of the mandatory value
foreign_landing_url
and/or due to not knowing whether they have the CC0 licenserecord_link
andguid
fields from which we get theforeign_landing_url
are missing.record_link
andguid
fields from which we get theforeign_landing_url
are missing.record_link
andguid
fields from which we get theforeign_landing_url
are missing. Theusage
->access
fields from which we determine whether images are CC0 licensed are also missing.record_link
andguid
fields from which we get theforeign_landing_url
are missing. Theusage
->access
fields from which we determine whether images are CC0 licensed are also missing.source
ChariniNana commented on Mon Jul 27 2020:
For populating the description information, it was noted that the
freetext -> notes -> Notes
field would be appropriate for NMNH.source
ChariniNana commented on Fri Jul 31 2020:
ChariniNana commented on Fri Jul 31 2020:
The text was updated successfully, but these errors were encountered: