Y. David Chen, Ph.D.
Manuscript title: Chen et al. Extensive epigenomic dysregulation is a hallmark of homologous recombination deficiency in triple-negative breast cancer
dataproc/
Essential bioinformatic data preprocessing procedures e.g. MethylationEPIC processingmain/
Core statistical analyses and associated data visualization, e.g. differential methylation analysissuppl/
Supplemental analyses, including machine learning (ML) modelingutils.R
andplotThemes.R
contain reusable, global helpers and constants for importing in any R scripts
Scripts used for data download/acquisition, data cleansing, exploratory data analysis (EDA) not included in the final paper, and constants/PATHs/CONFIGs storage may be excluded/masked.
- MethylationEPIC QC, preprocessing, and data prep
- Preprocessing and extraction of segmented copy number from methylation arrays
- Study population summarization (Table One)
- Fraction of Genome Altered
- Methylation Dysregulation Index
- Promoter hypermethylation of BRCA1 and RAD51C
- Global methylation clusters with RPMM
- Methylation at transcriptional regulatory regions
- Age-adjusted EWAS
- Genomic context enrichment
- Methylation heatmap visualization at various levels
- Supervised prediction of subject race with SNP probes
- Supervised learning of continuous HRD probability scores and biological discovery on TCGA
Materials from this repository may be subject to copyright. Use the materials responsibly and at your own risk.
If you referenced the datasets, findings, or methods including code from this work, please include the appropriate citation(s).