This repository contains a Bash script for automating various steps in a metagenomic analysis pipeline. The script merges .fq files, performs adapter trimming, and classifies reads using Kraken.
Clone this repository to your local machine.
git clone https://github.com/Zubair2021/metagenome_analysis.git- Bash: Make sure you have Bash shell installed on your system.
- Kraken2: Download and install from Kraken2 GitHub Repository.
- Bracken: Download and install from Bracken GitHub Repository.
You can typically install Kraken2 and Bracken using the following commands:
# For Kraken2
git clone https://github.com/DerrickWood/kraken2.git
cd kraken2
./install_kraken2.sh $PWD
# For Bracken
git clone https://github.com/jenniferlu717/Bracken.git
cd Bracken
./install_bracken.sh-
Kraken2 Database: You can build a Kraken2 database following the instructions here.
kraken2-build --standard --threads 12 --db $KRAKEN_DB_NAME -
Bracken Database: After building the Kraken2 database, you can build the Bracken database.
bracken-build -d $KRAKEN_DB_NAME -k 35 -l 150
Note: Replace $KRAKEN_DB_NAME with the name you wish to give to your Kraken2 database.
BASE_DIR: The base directory containing nested folders with.fqfiles.TARGET_DIR: Target directory to store merged.fqfiles and output files.TRUSEQ_ADAPTERS: Path to the adapter sequences in.faformat.KRAKEN_DB: Path to the Kraken database.
To run the script, navigate to the directory containing the script and execute:
bash metagenome_full_pipeline.shBASE_DIR="/path/to/raw_data"
TARGET_DIR="/path/to/output"
TRUSEQ_ADAPTERS="/path/to/adapters.fa"
KRAKEN_DB="/path/to/krakenDB"The output directory will contain the following:
- Merged
.fqfiles - Trimmed
.fqfiles - Kraken classification reports
If you wish to contribute, please open an issue or submit a pull request.