🦠 Viral Detection Pipeline

This repository contains a reproducible pipeline for detecting viral sequences from sequencing data.
It can be run using Docker (recommended for full reproducibility) or alternatively with Conda if Docker is unavailable.

📋 Table of Contents

Overview
Option 1: Run with Docker (Recommended)
Option 2: Run with Conda
Input Configuration
Output
Updating or Removing Environments

🧬 Overview

This pipeline performs the following key steps:

Quality control and alignment of sequencing reads
Viral sequence identification using bowtie2 and STAR
Variant calling with bcftools
Optional downstream analysis and summary statistics

The codebase supports:

Docker for fully containerized execution
Conda for systems where Docker cannot be used

🐋 Option 1: Run with Docker (Recommended)

Docker ensures complete reproducibility with all dependencies pre-installed.

Prerequisites

Docker installed
Internet connection for image building (first time only)

Clone this repository

git clone https://github.com/nickcjacobs/ViralDetection.git
cd ViralDetection

Build the Docker image

From the repository root:

docker build -t viral_detection .

Run the pipeline

docker run --rm -v $(pwd):/app -w /app viral_detection \
    bash bin/viral_detection.sh config/pipeline_input.txt

Explanation:

-v $(pwd):/app mounts your current directory into the container

-w /app sets the working directory inside the container

The pipeline reads parameters from config/pipeline_input.txt

🧫 Option 2: Run with Conda

If Docker is not available, you can run the same pipeline in a Conda environment.

Prerequisites

Miniconda or Mambaforge

Step 1: Clone this repository

git clone https://github.com/nickcjacobs/ViralDetection.git
cd ViralDetection

Step 2: Create and activate the environment

conda env create -f environment.yml
conda activate viral_detection

This installs:

samtools, bcftools, bowtie2, STAR, seqtk, parallel, pysam, and other required tools.

Step 3: Run the pipeline

bash bin/viral_detection.sh config/pipeline_input.txt

⚙️ Input Configuration

All input file paths and settings are specified in:

config/pipeline_input.txt

Ensure this file includes the correct paths to your FASTQ files, reference genome, and other required inputs before running the pipeline.

📊 Output

The pipeline produces:

Processed and aligned reads

Detected viral sequences

Variant calls (.vcf files)

Summary and log files in the designated output folder

Output locations and naming conventions are controlled by your configuration file.

🔄 Updating or Removing Environments

If you modify environment.yml and want to apply updates:

conda env update -f environment.yml --prune

To remove the Conda environment entirely:

conda remove --name viral_detection --all

Pull requests are welcome! If you’d like to add new features or improve existing ones:

Fork this repository

Create a feature branch

Submit a pull request describing your changes

For major updates, please open an issue first to discuss proposed modifications.

Maintainer: Nick Jacobs Repository: github.com/KlugerLab/ViralDetection

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
bin		bin
config		config
data		data
Dockerfile		Dockerfile
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

🦠 Viral Detection Pipeline

📋 Table of Contents

🧬 Overview

🐋 Option 1: Run with Docker (Recommended)

Prerequisites

Clone this repository

Build the Docker image

Run the pipeline

🧫 Option 2: Run with Conda

Prerequisites

Step 1: Clone this repository

Step 2: Create and activate the environment

Step 3: Run the pipeline

⚙️ Input Configuration

📊 Output

🔄 Updating or Removing Environments

About

Uh oh!

Releases

Packages

Languages

Uh oh!

Uh oh!

KlugerLab/ViralDetection

Folders and files

Latest commit

History

Repository files navigation

🦠 Viral Detection Pipeline

📋 Table of Contents

🧬 Overview

🐋 Option 1: Run with Docker (Recommended)

Prerequisites

Clone this repository

Build the Docker image

Run the pipeline

🧫 Option 2: Run with Conda

Prerequisites

Step 1: Clone this repository

Step 2: Create and activate the environment

Step 3: Run the pipeline

⚙️ Input Configuration

📊 Output

🔄 Updating or Removing Environments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages