Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

runner.aws_batch: Download .snakemake/metadata/ too #374

Merged
merged 1 commit into from
Jun 17, 2024

Conversation

tsibley
Copy link
Member

@tsibley tsibley commented Jun 17, 2024

Snakemake stores state information per input/output here and uses it to determine if it needs to re-run rules or not. It seems akin to the file mtimes which we already take care to preserve on upload/download. Additionally, the metadata recorded is used in Snakemake's report generation and is generally useful for looking at workflow statistics.

Continue to not download all of .snakemake/ en masse because it can potentially contain files that interfere with local usage and/or are large and unnecessary.

Resolves: #373
Related-to: nextstrain/docker-base#220

Checklist

  • Checks pass

@tsibley tsibley force-pushed the trs/aws-batch/download-snakemake-metadata branch 2 times, most recently from 131c55a to 1a3ba39 Compare June 17, 2024 21:50
@tsibley tsibley requested a review from joverlee521 June 17, 2024 21:51
@tsibley
Copy link
Member Author

tsibley commented Jun 17, 2024

To test this out, install the CLI from the build for this PR:

curl -fsSL --proto '=https' https://nextstrain.org/cli/installer/mac | bash -s pr-build/374

then submit an AWS Batch build with --image nextstrain/base:branch-trs-upload-snakemake-metadata.

Copy link
Contributor

@joverlee521 joverlee521 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Worked as expected! 🎉

I ran the zika/ingest via AWS Batch

nextstrain build \
    --image nextstrain/base:branch-trs-upload-snakemake-metadata \
    --aws-batch \
    ingest

Then generated the Snakemake report locally

nextstrain build ingest --report

And see the expected stats:

Screenshot 2024-06-17 at 3 52 49 PM

tsibley added a commit to nextstrain/docker-base that referenced this pull request Jun 17, 2024
Snakemake stores state information per input/output here and uses it to
determine if it needs to re-run rules or not.  It seems akin to the file
mtimes which we already take care to preserve on upload/download.
Additionally, the metadata recorded is used in Snakemake's report
generation and is generally useful for looking at workflow statistics.

Continue to not upload all of .snakemake/ en masse because it can
potentially contain files that interfere with local usage and/or are
large and unnecessary.

Resolves: <nextstrain/cli#373>
Related-to: <nextstrain/cli#374>
Snakemake stores state information per input/output here and uses it to
determine if it needs to re-run rules or not.  It seems akin to the file
mtimes which we already take care to preserve on upload/download.
Additionally, the metadata recorded is used in Snakemake's report
generation and is generally useful for looking at workflow statistics.

Continue to not download all of .snakemake/ en masse because it can
potentially contain files that interfere with local usage and/or are
large and unnecessary.

Resolves: <#373>
Related-to: <nextstrain/docker-base#220>
@tsibley tsibley force-pushed the trs/aws-batch/download-snakemake-metadata branch from 1a3ba39 to b14ae65 Compare June 17, 2024 23:53
@tsibley tsibley merged commit 245c99f into master Jun 17, 2024
6 of 7 checks passed
@tsibley tsibley deleted the trs/aws-batch/download-snakemake-metadata branch June 17, 2024 23:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

aws-batch: support Snakemake --report
2 participants