Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

move external data sets to ExperimentHub #114

Open
jayhesselberth opened this issue Oct 30, 2022 · 3 comments
Open

move external data sets to ExperimentHub #114

jayhesselberth opened this issue Oct 30, 2022 · 3 comments

Comments

@jayhesselberth
Copy link
Member

CRAN / Bioconductor won't allow downloading of external data; it won't pass their checks. We can't use download.file anywhere.

Need to:

  1. Generate a smaller data set (a sample of the data being downloaded) that can be included in the pacakge and used for readme / vignettes
  2. Put larger data sets in an ExperimentHub
@sheridar
Copy link
Member

I'll downsample the current vignette data so we can include it in the package

@jayhesselberth
Copy link
Member Author

jayhesselberth commented Oct 30, 2022

this seems to work:

# sample 1,000 cells from the splen_so object
library(Seurat)

download.file(
  "https://djvdj-data.s3.us-west-2.amazonaws.com/splenocytes.zip",
  "splenocytes.zip",
  quiet = TRUE
)

unzip("splenocytes.zip", overwrite = FALSE)

# Load Seurat object
load("splenocytes/splen_so.rda")

set.seed(42)
# https://github.com/satijalab/seurat/issues/3108#issuecomment-685975338
splen_so_tiny <- splen_so[, sample(colnames(splen_so), size = 1000, replace=FALSE)]

# xz provides better compression than bzip2 default
usethis::use_data(splen_so_tiny, compress = 'xz')

Would then need to take these cell barcodes and filter the 10x files

@jayhesselberth
Copy link
Member Author

You should shoot for 5 MB or less of packaged data. splen_so_tiny above is ~1.8 MB.

sheridar added a commit that referenced this issue Nov 16, 2022
* drop magrittr dependency
* add downsampled data for vignettes

part of #101 and #114
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants