Integration of proteomics with genomics and transcriptomics increases the diagnosis rate of Mendelian disorders

This project contains different scripts to automatize and visualize analysis performed for the "Integration of proteomics with genomics and transcriptomics increases the diagnosis rate of Mendelian disorders" paper.

Webserver, produced as one of the outputs of the pipeline.

Project structure

This project is setup as a wBuild workflow. This is an automatic build tool for R reports based on snakemake.

The wbuild.yaml is the main configuration file to setup up the workflow
The Scripts folder contains scripts which will be rendered as HTML reports
The src folder contains additional helper functions and scripts
The Output folder will contain all files produced in the analysis pipeline
- Output/html contains the final HTML report

Data and prerequisites

This project depends on the packages wBuild and PROTRIDER, developed by Gagneur Lab

The pipeline starts with the series of files available via Zenodo: DOI:10.5281/zenodo.4501904

raw_data
- proteomics_annotation.tsv - sample annotation
- proteomics_not_normalized.tsv - Proteomics intensity matrix
- raw_counts.tsv - RNA-seq count matrix
- Patient_HPO_phenotypes.tsv - Phenotype data recorded using HPO terms for diagnosed cases.
- enrichment_proportions_variants.tsv - Results of rare variant enrichment/proportion analysis calculated on the full dataset.
- patient_variant_hpo_data.tsv - Gene annotation for all individuals. Since the genetic data are not publicly shareable, we provide only gene-level information for outlier genes only.
datasets
- disease_genes.tsv - List of Mendelian disease genes aggregated from several studies.
- HGNC_mito_groups.tsv - Subset of HGNC gene groups related to mitochondria.
  
  Downloaded automatically:
- gencode.v29lift37.annotation.gtf.gz - Gene-level model based on the GENCODE 29 transcript model
- Table_S1_gene_info_at_protein_level.xlsx - Supplementary Tble1 from GTEx proteomics study Jiang et al, 2020, Cell Data is available at the GTEx page
- allComplexes.txt - CORUM protein complexes, available at CORUM web page

The proteomic raw data and MaxQuant search files have been deposited to the ProteomeXchange Consortium via the PRIDE partner repository and can be accessed using the dataset identifier PXD022803

Repository setup

First download the repo and its dependencies:

# analysis code
git clone https://github.com/prokischlab/omicsDiagnostics
cd omicsDiagnostics

and install wbuild using pip by running.

pip install wBuild
wbuild init

Since wBuild init will reset the current Snakefile, readme.md, and wbuild.yaml we have to revert them again with git.

git checkout Snakefile
git checkout wbuild.yaml
git checkout readme.md

Next clone outrider2 branch from original OUTRIDER repository. OUTRIDER2 includes implementation of protrider algorithm.

# OUTRIDER2 to detect outliers in proteomics data 
git clone --branch outrider2 https://github.com/gagneurlab/OUTRIDER.git

Specify correct file and folder locations in the wbuild.yaml. For higher stability we recommend specifying of full paths.

Install dependencies

Create Conda environment
- conda env create --name omicsDiagnosticsMinimal --file=environment.yml
R packages
- Make sure that data.table is installed or install with install.packages("data.table")
- Rscript src/installRPackages.R src/requirementsR.txt

Run the full pipeline

To run the full pipeline, execute the following commands with 10 cores in parallel:

conda activate omicsDiagnosticsMinimal
snakemake graph
snakemake -c 10

If dag doesn't work run: snakemake --snakefile Snakefile.dag --dag | dot -Tpng > dag.png

omicsDiagnostics

A comprehensive tool for analyzing and visualizing multi-omics data in the context of rare disease diagnostics.

Features

Integration of RNA and protein expression data
Visualization of patient-specific omics profiles
Interactive exploration of genetic variants
Phenotype similarity analysis
Protein complex analysis

Shiny App

The application is available online at: https://prokischlab.shinyapps.io/omicsDiagnosticsAPP/

Installation

Clone this repository

Install required R packages:

install.packages(c("shiny", "data.table", "plotly", "DT", "yaml", "gganatogram", 
                  "shinyjs", "shinybusy", "shinyWidgets", "shinythemes", "tippy", "bslib"))

Usage

Run the app locally:
```
shiny::runApp("omicsDiagnosticsAPP")
```
Or use the online version at https://prokischlab.shinyapps.io/omicsDiagnosticsAPP/

Data Preparation

The app uses pre-processed data stored in the shiny_data directory. To prepare the data:

Run the data preparation script:

source("omicsDiagnosticsAPP/prepare_shiny_data.R")

Deployment

To deploy the app to ShinyApps.io:

Make sure you have the rsconnect package installed
Run the deployment script:
```
source("omicsDiagnosticsAPP/deploy.R")
```

License

This project is licensed under the MIT License - see the LICENSE file for details.

Contact

For questions or support, please contact the development team.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Integration of proteomics with genomics and transcriptomics increases the diagnosis rate of Mendelian disorders

Project structure

Data and prerequisites

Repository setup

Install dependencies

Run the full pipeline

omicsDiagnostics

Features

Shiny App

Installation

Usage

Data Preparation

Deployment

License

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
Scripts		Scripts
omicsDiagnosticsAPP		omicsDiagnosticsAPP
rsconnect/shinyapps.io/prokischlab		rsconnect/shinyapps.io/prokischlab
src		src
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
LICENSE		LICENSE
Snakefile		Snakefile
Snakefile.dag		Snakefile.dag
app.R		app.R
config_protrider.yml		config_protrider.yml
dag.png		dag.png
deploy.R		deploy.R
environment.yml		environment.yml
manifest.json		manifest.json
readme.md		readme.md
requirements.txt		requirements.txt
wbuild.yaml		wbuild.yaml

License

prokischlab/omicsDiagnostics

Folders and files

Latest commit

History

Repository files navigation

Integration of proteomics with genomics and transcriptomics increases the diagnosis rate of Mendelian disorders

Project structure

Data and prerequisites

Repository setup

Install dependencies

Run the full pipeline

omicsDiagnostics

Features

Shiny App

Installation

Usage

Data Preparation

Deployment

License

Contact

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages