Skip to content

Geize/Processing-16S

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

8 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

16S rRNA analysis pipeline


The propose of this page is the reproducibility of the 16S data described in my publications (e.g.,Tomazetto et al.) or those are coming up soon. If you are looking for a tutorial, here is the best one you can have.

However, you are more than welcome to reach out to me if you need help. Do not worry, I won't jump into your work. But I'll be glad if you can cite one of my works as a reference. 😊


1. Login to the server:

  • Install qiime2.
  • Run the activate step.
  • Test if qiime2 successfully activated πŸ”₯- very important step.
qiime --help

  • Before you exit the server, create a working directory.
  • Now exit, please. 😊

2. Transfer the fastq.gz files from your computer/HDD to the server.

(base) ➜ ~  scp data/*.fastq.gz  mylogin@login:/home/geize/16S/

3. Create the sample-metadata file.

I usually create metadata files according to the project and what is need to compare. However, below you find of an example metadata file. You can also add other information as much as you wish.

πŸ”₯Important note.
Do not forget to save the file as tab-delimited - it's the best option. Validate your metadata using a Keemei.

#SampleID   Description   Condition   ...
Sample01    Soil    Ref      ...
Sample02    Cons    A        ...
Sample03    Cons    B        ...
and dadada...


4.Back to the server to create the manifest file.

The following python script can help you to create a manifest file from any directory. Just type the command bellow to get more instructions - very fancy. 😁

(base) ➜ ~  python Write_manifestFile.py -help


5. Processing the data.

Once you have the manifest file, DrT_Qiime2.sh, and of course, fastq.gz in a directory, you can run the script. All the steps are wrapped in a single script, including the importing the data, denoising, trimming, joining, table ASVs, tree, and basic diverstiy analysis. The script also has the steps to convert QIIME2 artifacts into visualization files, which you can view using qiime tools view or here.

Check out the output files and directories!

πŸ”₯1. Important note.
The taxonomic classification was set up for amplicon libraries constructed using 515F/806R primer pair - the V4 region.

πŸ”₯2. Important note.

Open the file qiime2_OhNO.err. Try to find a message indicating error. Hope, you won't find anything. πŸ˜…

The qiime tools validate command was added after some critical steps. Open the qiime2_GreatJOb.out file, and check if there are similar message "Result importing.qza appears to be valid at level=max." after each step.


6. Plotting some graphics.

Here, you also find a R script to plot some graphics. Of course, you can plot different graphics emphasizing "little this or little that". It is up to you.


All the best for us! πŸ€“


About

Scripts for processing of 16S rRNA dataset

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages