Ancient DNA postprocessing pipeline (Human)

A nextflow pipeline for the basic post-processing of human shotgun or capture libraries (not metagenomics samples). See the overview below for the workflow.

Prerequisites

The pipeline runs with

Nextflow v22.10 or larger
Singularity or Docker

Note: To run nextflow+singularity, your kernel needs to support user-namespaces (see here or here). NOTE: The pipeline is configured to work within the computational environment of the Max-Planck-Institute for Evolutionary Anthropology.

RUN

NXF_VER=24.04.4
nextflow run mpieva/postprocessing -r v0.5 --split SPLIT -profile PROFILE [OPTIONS]

use the -r v0.5 flag to run a specific version of the pipeline

SPLIT

The pipeline starts with a directory of already demultiplexed and mapped BAM-files, provided with the --split flag. Unmapped and Paired sequences are removed in the analyzeBAM step.

OPTIONS

--help                                             Display the HELP Text
--split                        PATH     [required] A directory with demultiplexed and mapped BAM-files
                                                   (mapped to the genome specified in --reference_file)
--reference_file               FASTA    [required] Reference genome used for mapping, required for 'samtools calmd'
--reference_name               NAME     [required] The folder/name of the reference in '/mnt/solexa/Genomes/' (e.g. hg19_evan).
                                                   Used for naming output files and for double-checking the bam-header 
--target_file                  BED      [optional] Targetfile (BED) for subsetting BAM to 'ontarget' sequences
--target_name                  NAME     [optional] Name of the target for naming output-files (default: shotgun)

--bamfilter_minlength          N        [optional] Minimum length of retained sequences (default: 35)
--bamfilter_minqual            N        [optional] Minimum mapping quality of retained sequences (default: 25)
--bamfilter_keep_vendorfail             [optional] Keep reads in bamfile that have the "vendor failed" flag

--bamrmdup_cheap                        [optional] Bam-rmdup 'cheap' computation: skip the consensus calling
--bamrmdup_circular            CHR:LEN  [optional] Bam-rmdup 'circular' option - CHR is circular with length LEN

PROFILES

Profiles can be set with the -profile flag (only one dash!!). They preserve parameters (target-file and references) for different common analyses.

The follwing profiles are available

shotgun {
  reference_file = "/mnt/solexa/Genomes/hg19_evan/whole_genome.fa"
  reference_name = "hg19_evan"
  target_name    = "shotgun"
  target_file    = false
}
AA108_AA115_archaicAdmixture {
  reference_file = "/mnt/solexa/Genomes/hg19_evan/whole_genome.fa"
  reference_name = "hg19_evan"
  target_name    = "AA108_AA115_archaicAdmixture"
  target_file    = "/home/public/AncientDNA/probe_designs/AA108-115_archaic_admixture/Archaic.align.noN.sorted.bed"
}
Reich_1240k {
  reference_file = "/mnt/solexa/Genomes/hg19_evan/whole_genome.fa"
  reference_name = "hg19_evan"
  target_name    = "Reich_1240k"
  target_file    = "/mnt/archgen/Reference_Genomes/Human/hs37d5/SNPCapBEDs/1240K.pos.list_hs37d5.0based.bed"
}

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
assets		assets
conf		conf
modules/local		modules/local
src		src
workflows		workflows
CHANGELOG.md		CHANGELOG.md
LICENSE.md		LICENSE.md
README.md		README.md
main.nf		main.nf
nextflow.config		nextflow.config

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Ancient DNA postprocessing pipeline (Human)

Prerequisites

RUN

SPLIT

OPTIONS

PROFILES

Pipeline Overview

Contributions

About

Uh oh!

Releases

Packages

Languages

License

mpieva/postprocessing

Folders and files

Latest commit

History

Repository files navigation

Ancient DNA postprocessing pipeline (Human)

Prerequisites

RUN

SPLIT

OPTIONS

PROFILES

Pipeline Overview

Contributions

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages