Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reorganize ingest_metadata field #748

Open
shirey opened this issue Oct 4, 2024 · 1 comment
Open

Reorganize ingest_metadata field #748

shirey opened this issue Oct 4, 2024 · 1 comment
Assignees
Labels
P Pitt dev team

Comments

@shirey
Copy link
Member

shirey commented Oct 4, 2024

The Dataset.ingest_metadata field will be reorganized to accommodate:

  • Easier access to metadata for searching
  • Adding fields to accommodate the Workflow Tracking effort
  • Adding the Dataset.calculated_metadata field to allow storage of calculated/derived metadata

This work is specified in the ingest_metadata field reorganization document

From the above document a summary of the changes that need to be made are:

  • MOVE:

    • Dataset.ingest_metadata.metadata → Dataset.metadata
    • Dataset.ingest_metadata.files → Dataset.files
  • DELETE:

    • Dataset.ingest_metadata.thumbnail_file_abs_path
    • Dataset.ingest_metadata.extra_metadata
  • ADD:

    • Dataset.ingest_metadata.workflow_description (as String/Text)
    • Dataset.ingest_metadata.workflow_version (as String/Text)
    • Dataset.calculated_metadata (as JSON text)
  • RENAME (Elasticsearch/Search API only):

    • Dataset.metadata → Dataset.ingest_metadata (Elasticsearch/Search API only)
@tjmadonna
Copy link

Relevant SenNet PR

sennetconsortium/entity-api#510

Script for updating datasets

https://gist.github.com/tjmadonna/48f7d00d84ee0b36094bf337e7cabc87

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
P Pitt dev team
Projects
Status: In Progress
Development

No branches or pull requests

4 participants