ChemRICH

Chemical Similarity Enrichment analysis of metabolomics datasets.

Assuming a statistical independence among metabolites is incorrect for metabolomics datasets because of the existence of 1) metabolic pathways 2) same origin 3) genetic regulation of metabolism and 4) chemical similarity among metabolites. Therefore, any p-value adjustment approach to correct for the multiple hypothesis testing of the raw entity level p-values causes false negative results, leading to missed biological insights from metabolomics datasets.

Classical pathway analyses provide a narrowed intepretation for metabolomics datasets for two major reasons - 1) biochemical databases are incomplete for metabolomics so we are biased towards only a handful for selected compounds 2) a hypergeometric test relies on a background database which does not exist for metabolomics datasets.

Alternatively, we can define data-driven and chemistry-driven compounds sets. Those sets can be used by a background database independent test such as the Kolmogorov–Smirnov test KS-Test to obtain the set-level significance. There could be several other ways to define chemical sets including chemical ontologies such as MeSH.

ChemRICH is an approach that defines chemical classes for metabolites and then runs a KS test to obtain the set-level p-values. It uses chemical similarity against the MeSH database to obtain chemical classes, but users can also provide their own chemical classes to run the KS-test. Underlying hypothesis is that "P-values under the null hypothesis are uniformly distributed between 0 and 1" (
https://www.mv.helsinki.fi/home/mjxpirin/GWAS_course/material/GWAS2.html )

A new version of ChemRICH has been developed in R (version 4.0).

Visit the new ChemRICH site at ChemRICH.idsl.me (updated August 2020)

ChemRICH is maintained and developed by the Integrated Data Science Laboratory for Metabolomics and Exposomics (IDSL.ME)

Name		Name	Last commit message	Last commit date
Latest commit History 107 Commits
.Rproj.user		.Rproj.user
20181015_KOMP_chemrich_input.xlsx		20181015_KOMP_chemrich_input.xlsx
README.md		README.md
chemrich_1_ChemRICH_results.xlsx		chemrich_1_ChemRICH_results.xlsx
chemrich_chemical_classes.R		chemrich_chemical_classes.R
chemrich_input_any_set_regression.xlsx		chemrich_input_any_set_regression.xlsx
chemrich_input_chemical_classes.xlsx		chemrich_input_chemical_classes.xlsx
chemrich_input_correlation_modules.xlsx		chemrich_input_correlation_modules.xlsx
chemrich_input_lipids.xlsx		chemrich_input_lipids.xlsx
chemrich_input_mesh_classes.xlsx		chemrich_input_mesh_classes.xlsx
chemrich_input_mesh_prediction.xlsx		chemrich_input_mesh_prediction.xlsx
chemrich_input_metabolon_subpathway.xlsx		chemrich_input_metabolon_subpathway.xlsx
chemrich_input_minimum.xlsx		chemrich_input_minimum.xlsx
chemrich_minimum_analysis.R		chemrich_minimum_analysis.R
chemrich_multiple_groups.R		chemrich_multiple_groups.R
cidmesh_smiles_fpbit.RData		cidmesh_smiles_fpbit.RData
correlation_modules.R		correlation_modules.R
correlation_modules_prediction_regression.xlsx		correlation_modules_prediction_regression.xlsx
iarc_features_for_correg_modules.xlsx		iarc_features_for_correg_modules.xlsx
iarc_features_input_chemrich.xlsx		iarc_features_input_chemrich.xlsx
mesh_bit_loc_list.RData		mesh_bit_loc_list.RData
predict_mesh_chemical_class.R		predict_mesh_chemical_class.R
treenames.df.RData		treenames.df.RData

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ChemRICH

Visit the new ChemRICH site at ChemRICH.idsl.me (updated August 2020)

About

Releases

Packages

Languages

barupal/ChemRICH

Folders and files

Latest commit

History

Repository files navigation

ChemRICH

Visit the new ChemRICH site at ChemRICH.idsl.me (updated August 2020)

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages