Description
openedon Feb 18, 2023
Summary
Rekognition data in the form of object labels was collected for roughly 100m records in the Openverse catalog.
These labels should be sanitized for suitability in the Openverse project and applied to records in the Openverse Catalog as tags.
Description
Some exploratory work was done to assess the quality of these labels. The team generally felt positive about them, given we would blanket remove a subset of them (e.g. ones that assume a gender). We will need to do a broader analysis to determine if there are more labels we would want to exclude, and then incorporate them into the existing tags for each record in the catalog. The automated tags include a confidence score associated with the tag value, and we should also incorporate those values into the overall document score for relevant searches.
Best guess at list of implementation plans:
- Strategy for filtering then upserting the tags into their associated records.
- Determining whether/how to surface these tags in the frontend and differentiate them from provider-supplied tags
Documents
- Project Proposal
- Implementation Plan: Determine and design how machine-generated tags will be displayed/conveyed in the API
- Implementation Plan: Determine and design how machine-generated tags will be displayed/conveyed in the Frontend
- Implementation Plan: Augment the catalog database with suitable Rekognition tags
- Reviewed Rekognition label list
Issues
- Project Proposal: Incorporate Rekognition data into the catalog #3896
- Implementation Plan: Determine and design how machine-generated tags will be displayed/conveyed in the API #4038
- Implementation Plan: Determine and design how machine-generated tags will be displayed/conveyed in the Frontend #4039
- Implementation Plan: Augment the catalog database with suitable Rekognition tags #4040
Milestone
Metadata
Assignees
Labels
Type
Projects
Status
⏸ On Hold