Skip to content

Many Datasets Include Genes that are not in genes.csv #407

Open
@jjacobson95

Description

@jjacobson95

Many Files include genes that are not present in the genes.csv file. These non-overlapping genes should be dropped.

This was originally identified in #405, but is much more widespread than that.

Dataset Datatype Overlap Non-Overlap
beataml mutations 3064 1
beataml proteomics 6850 1
beataml transcriptomics 19141 1
bladderpdo copy_number 9019 2121
bladderpdo mutations 265 1
bladderpdo transcriptomics 19338 0
ccle copy_number 18929 0
ccle mutations 18364 736
ccle proteomics 11454 0
ccle transcriptomics 18870 0
cptac copy_number 19033 1
cptac mutations 18002 864
cptac proteomics 14706 1
cptac transcriptomics 19023 1
ctrpv2 copy_number 18929 0
ctrpv2 mutations 18489 949
ctrpv2 proteomics 11454 0
ctrpv2 transcriptomics 18870 0
fimm copy_number 18928 0
fimm mutations 11466 84
fimm proteomics 11298 0
fimm transcriptomics 18870 0
gcsi copy_number 18929 0
gcsi mutations 18436 765
gcsi proteomics 11454 0
gcsi transcriptomics 18870 0
gdscv1 copy_number 17715 0
gdscv1 mutations 18707 0
gdscv1 proteomics 8333 0
gdscv1 transcriptomics 19016 0
gdscv2 copy_number 17715 0
gdscv2 mutations 18698 0
gdscv2 proteomics 8333 0
gdscv2 transcriptomics 19016 0
hcmi copy_number 19026 0
hcmi mutations 17160 542
hcmi transcriptomics 19012 0
mpnst copy_number 19058 0
mpnst mutations 5709 0
mpnst proteomics 8801 0
mpnst transcriptomics 5299 0
mpnstpdx copy_number 19058 0
mpnstpdx mutations 13343 0
mpnstpdx proteomics 8801 0
mpnstpdx transcriptomics 5613 0
nci60 copy_number 18929 0
nci60 mutations 16421 242
nci60 proteomics 11371 0
nci60 transcriptomics 18870 0
pancpdo copy_number 17680 0
pancpdo mutations 15610 0
pancpdo transcriptomics 19012 0
prism copy_number 18929 0
prism mutations 18414 746
prism proteomics 11454 0
prism transcriptomics 18870 0
sarcpdo mutations 76 0
sarcpdo transcriptomics 17895 0

Metadata

Metadata

Assignees

Type

No type

Projects

Status

No status

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions