Skip to content

Commit

Permalink
Move accession to the first column of metadata_all.tsv #36
Browse files Browse the repository at this point in the history
  • Loading branch information
j23414 authored Feb 24, 2024
2 parents c710ca1 + 3631e90 commit 86012b2
Showing 1 changed file with 8 additions and 1 deletion.
9 changes: 8 additions & 1 deletion phylogenetic/rules/merge_sequences_usvi.smk
Original file line number Diff line number Diff line change
Expand Up @@ -22,7 +22,13 @@ This part of the workflow usually includes the following steps:
"""

rule append_usvi:
"""Appending USVI sequences"""
"""Appending USVI sequences
Notable columns:
- accession: Either the GenBank accession or USVI accession.
- genbank_accession: GenBank accession for Auspice to generate a URL to the NCBI GenBank record. Empty for USVI sequences.
- url: URL used in Auspice, to either link to the USVI github repo (https://github.com/blab/zika-usvi/) or link to the NCBI GenBank record ('https://www.ncbi.nlm.nih.gov/nuccore/*')
"""
input:
sequences = "data/sequences.fasta",
metadata = "data/metadata.tsv",
Expand All @@ -43,5 +49,6 @@ rule append_usvi:
-n accession \
-e '$genbank_accession' \
| csvtk concat -tl - {input.usvi_metadata} \
| tsv-select -H -f accession --rest last \
> {output.metadata}
"""

0 comments on commit 86012b2

Please sign in to comment.