Project of the Master of Bioinformatics (AMI2B) of the University Paris-Saclay realized by :
The goal is to reproduce parts of the analysis described in these papers (to read):
They performed RNA-Seq in samples from patients with uveal melanoma. Some samples are mutated in SF3B1 . We want to analyze this data in order to find differentially expressed genes, i.e. genes that are more (or less) expressed in one condition (SF3B1 mutated samples) compared to another (SF3B1 non mutated samples).
To do this, we have designed and implemented a reproductible workflow.
- A Directory Tools to build all containers (using Singularity) with the tools that will be used in the workflow.
- A Directory Workflow with the rules and files (using Snakemake) needed for the workflow.
- A Script run.sh to execute the workflow.
- A README file
"French Institute of Bioinformatics (IFB) provides life scientists with a federation of clouds, Biosphere, and bioinformatics cloud services to analyze life science data. Biosphere is used for scientific production in the life sciences, developments, and to support events like cloud and scientific training sessions, hackathons or workshops."
For the moment the pipeline work for a VM BioPipes "ifb..mxlarge (16 vCPU, 64Go GB RAM, 400Go local Disk)"
It should also work for a VM of 8 CPUs, below 8 the indexing of the whole genome is impossible.
./run.sh