The following need to be installed in order to work properly:
- fastqc - fastq quality checking tool
- STAR - rna-seq alignment tool
- cufflinks - outputs gene structure information from rna-seq reads
- samtools - sort SAM file and output to BAM or SAM
Useful:
Put the inputs (reference fasta and rna-seq read fastq) inside of the inputs folder
Run chmod 755 pipeline.sh
in order to give read/execution permissions to others
Run ./pipeline.sh –c
in order to clear the outputs of the previous run.
Run ./pipeline.sh
or sh pipeline.sh
in order to run the pipeline.
The fastqc outputs can be seen in the fastqc_results
folder.
The cufflinks results can be seen in the cufflinks_results
folder.
The star results will be outputted in the folder that pipeline.sh resides.
This pipeline can take a long time as STAR can take a long time to run especially on large sets of data.