Clusterization of the MDS disease by mutation and cytogenetics features and study of the clinical features distribution across the 11 so-found clusters.
Built with R version 3.5.1
-
Nicola Roberts HDP R package (GitHub link) which implements the hierarchical Dirichlet process:
devtools::install_github("nicolaroberts/hdp", build_vignettes = TRUE)
-
Various R libraries:
ggplot2
reshape2
gridExtra
tibble
survminer
survival
glmnet
RColorBrewer
IRdisplay
- improve heatmap plot
- axes and title for each plot
- split genes by hotspots
- top genes by component
- try dataset with all genes and cytogenetics
- understand theory better
- try other clustering method
- play with hdp parameters