Proposed Analysis: Chromothripsis analysis with ShatterSeek #1007
Description
What are the scientific goals of the analysis?
- Identify candidate chromothripsis regions based on SV and CNV calls
- Classify tumors as +/- chromothripsis
- Plot chromothripsis incidence by tumor subtype
What methods do you plan to use to accomplish the scientific goals?
ShatterSeek (already installed in OpenPBTA Docker). I plan to build off of the analysis started for #393. I will create a new analysis module, but I may adapt code from the existing sv-analysis module. I will most likely apply 01-process-sv-file.R, which was reviewed here, for preprocessing. Other code may also be adapted depending on its interpretability with limited documentation.
What input data are required for this analysis?
pbta-sv-manta.tsv.gz
pbta-cnv-consensus.seg.gz
How long do you expect is needed to complete the analysis? Will it be a multi-step analysis?
I aim to complete this within a week, but it may depend on how accurately chromothripsis regions can be identified based on criteria recommended by the authors. A subset of candidate regions will be manually inspected to evaluate accuracy.
Who will complete the analysis (please add a GitHub handle here if relevant)?
What relevant scientific literature relates to this analysis?
Cortés-Ciriano, I., Lee, J.JK., Xi, R. et al. Comprehensive analysis of chromothripsis in 2,658 human cancers using whole-genome sequencing. Nat Genet 52, 331–341 (2020). https://doi.org/10.1038/s41588-019-0576-7
Activity