Alan Murphy and Nathan Skene 2021-04-08
The MungeSumstats package is designed to facilitate the standardisation of GWAS summary statistics as utilised in our Nature Genetics paper.1
The package is designed to handle the lack of standardisation of output files by the GWAS community. There is a group who have now manually standardised many GWAS: R interface to the IEU GWAS database API • ieugwasr and gwasvcf but because a lot of GWAS remain closed access, these repositories are not all encompassing.
MungeSumstats provides a framework to standardise the format fo any GWAS summary statistics enabling downstream integration and analysis. The package works by addressing the most common discrepancies across summary statistics.
The MungeSumstats package is available from github. To be able to install the package one needs to run the following lines of code:
if (!require("devtools")) {
install.packages("devtools")
}
devtools::install_github("neurogenomics/MungeSumstats")
You can then load the package and data package:
library(MungeSumstats)
See the vignette for use cases of MungeSumstats.
If you have any problems please do file an issue here on github.
If you use the MungeSumstats package as well then please cite
1. Nathan G. Skene, T. E. B., Julien Bryois. Genetic identification of brain cell types underlying schizophrenia. Nature Genetics (2018). doi:10.1038/s41588-018-0129-5