Skip to content

Roadmap Tracking Issue - EPIC #5

@jqnatividad

Description

@jqnatividad

OVERALL VISION: To increase the utility and performance of the CKAN Datastore:

  • by enriching resources, so that right after a file is pushed by DP+, it does a lot of data-wrangling tasks that are typically done manually:
    • a lot of metadata is inferred, so the Data Publisher does not have to laboriously enter it in
    • descriptive statistics are computed, allowing the Data Publisher and the end-user to better understand the resource
    • location information is automatically normalized and geocoded
    • related datasets/resources are automatically inferred
    • auto-tagging
  • by taking advantage of PostgreSQL native features
    • also use it as a Document Database leveraging JSONB?
    • partitioning/sharding?
  • by tapping into the rich PostgreSQL extensions ecosystem (in particular - PostGIS, Timescale, Citus, CartoDB, Apache Age and ZomboDB)
  • give it "Data Lake"-like capabilities
  • enable Datastore API users to issue performant, reliable SQL queries

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions