Skip to content

URL format for linking to resources inside ZIP files #137

@hubgit

Description

@hubgit

I have a situation where some resources (e.g. a PDF file) are in the same directory as the datapackage.json file, and some other files (e.g. the source files used to generate the PDF file) are in a nearby ZIP archive.

I would like the resources section of the data package description file to describe resources contained in the ZIP archive, so that a preview can be shown without having to fetch and extract the ZIP file (which may be large) to get to a data package description file inside it.

To do this, the resource URLs need to link to locations inside the ZIP file. There are a few existing approaches to this, using URL fragments or similar:

  • http://example.org/package.zip#path=path/to/the/file: similar to the way individual pages of PDF files can be referenced, but using a path parameter name.
  • http://example.org/package.zip#path/to/the/file: as I can't think of any other use for a fragment identifier on a ZIP file URL other than to link to a path within it, this seems cleaner than using the path parameter. This is also how PHP's stream wrappers allow paths with ZIP files to be accessed.
  • jar:package.zip!path/to/the/file: the JAR URL scheme, which is said to work "for any ZIP based file".

Is there any preference for which of these formats should be used in the resources section of Data Package JSON?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    Status

    No status

    Status

    No status

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions