Skip to content

Commit

Permalink
Update for version 0.1
Browse files Browse the repository at this point in the history
  • Loading branch information
iwc-workflows-bot committed May 2, 2021
0 parents commit 5b43d4a
Show file tree
Hide file tree
Showing 8 changed files with 2,373 additions and 0 deletions.
7 changes: 7 additions & 0 deletions .dockstore.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,7 @@
version: 1.2
workflows:
- name: 'COVID-19-CONSENSUS-CONSTRUCTION'
primaryDescriptorPath: /consensus-from-variation.ga
subclass: Galaxy
testParameterFiles:
- /consensus-from-variation-test.yml
3 changes: 3 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
0.1
---------
- Initial version of COVID-19: consensus construction
28 changes: 28 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,28 @@
COVID-19: consensus construction
--------------------------------

This workflow aims at generating reliable consensus sequences from variant
calls according to transparent criteria that capture at least some of the
complexity of variant calling.

It takes a collection of VCFs and a collection of the corresponding
aligned reads (for the purpose of calculating genome-wide coverage) such as
produced by any of the four variant calling workflows in
https://github.com/galaxyproject/iwc/tree/main/workflows/sars-cov-2-variant-calling
and generates a collection of viral consensus sequences and a multisample FASTA
of all these sequences.

Each consensus sequence is guaranteed to capture all called, filter-passing
variants as defined in the VCF of its sample that reach a user-defined
consensus allele frequency threshold.

Filter-failing variants and variants below a second user-defined minimal
allele frequency threshold will be ignored.

Genomic positions of filter-passing variants with an allele frequency in
between the two thresholds will be hard-masked (with N) in the consensus
sequence of their sample.

Genomic positions with a coverage (calculated from the read alignments input)
below another user-defined threshold will be hard-masked, too, unless they are
consensus variant sites.
23 changes: 23 additions & 0 deletions consensus-from-variation-test.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,23 @@
- doc: Test consensus building from called variants
job:
Reference genome:
class: File
location: 'https://zenodo.org/record/4555735/files/NC_045512.2_reference.fasta?download=1'
aligned reads data for depth calculation:
class: Collection
collection_type: 'list'
elements:
- identifier: SRR11578257
class: File
path: test-data/aligned_reads_for_coverage.bam
Variant calls:
class: Collection
collection_type: 'list'
elements:
- identifier: SRR11578257
class: File
path: test-data/final_snpeff_annotated_variants.vcf
outputs:
multisample_consensus_fasta:
file: test-data/masked_consensus.fa

Loading

0 comments on commit 5b43d4a

Please sign in to comment.