-
Notifications
You must be signed in to change notification settings - Fork 260
Closed
Description
when running:
bcftools consensus --output results/SPFL100652/references/consensus.bcftools.fasta --chain results/SPFL100652/references/consensus.bcftools.chain --fasta-ref references/concat_EPI_ISL_412866_EPI_ISL_1653999.fasta --mark-del - --mask results/SPFL100652/references/coverage_mask_lowcoverage.bed --mask-with n -H A -i "INFO/AD[0]<INFO/AD[*]" results/SPFL100652/references/consensus.bcftools.bcf.gzThe resulting .fasta and .chain files only contain a consensus of the first sequence (the first chrom) - EPI_ISL_412866 (RSV-A) - present in the reference file. The second sequence - EPI_ISL_1653999 (RSV-B) - is missing from the output, even if it is present in the reference fasta, the pileup bcf and the mask bed inputs.
It happens in both bcftools 1.20 (that we currently use) and bcftools 1.22 (the most recent avilable on bioconda).
This ZIP file contains all input and outputs mentionned above: SPFL100652.zip
Am I doing something wrong? Passing the wrong parameters or something?
(This could be overlapping with #2052 maybe ? That issues also mentions some sequence not being present in the output FASTA)
Context:
- This is the last step of the rule in V-pipe that genereates consensus sequences using bcftools.
- These are swab test samples of RSV viruses (sub-group unknown at this stage) and we're aligning the reads against a fasta file with references that includes sequence for both sub-group A and sub-group B RSV.
- The expected behaviour is that we get a consensus with two sequences (potentially one sequence being mostly or entirely
ndepending on which sub-group was present in the swab sample) - Currently we only get EPI_ISL_412866 (RSV-A) outputs.
Metadata
Metadata
Assignees
Labels
No labels