Skip to content

How to make a small zarr file from a large VCF? #365

Open
@hyanwong

Description

@hyanwong

For testing, it can be useful to subset the zarr file that is created from a huge VCF. Is there a way to restrict the samples and/or sites in the VCF to a smaller workable subset, or is the approved way to subset the zarr file somehow later? If the latter, is there any example code to show how to do this?

I don't want to do this on-the-fly during analysis using masks, because the zarr file itself it too large to place in permanent storage (larger files in personal storage areas get deleted).

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions