Skip to content
This repository was archived by the owner on Mar 11, 2019. It is now read-only.

Commit 28ef2db

Browse files
Luis Francisco Hernández SánchezLuis Francisco Hernández Sánchez
Luis Francisco Hernández Sánchez
authored and
Luis Francisco Hernández Sánchez
committed
Added statistics for proteoforms with at least one modification.
1 parent dfde0cd commit 28ef2db

File tree

1 file changed

+57
-1
lines changed

1 file changed

+57
-1
lines changed

docs/statistics.md

Lines changed: 57 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -141,7 +141,7 @@ WITH DISTINCT protein, size(collect(ptmSet)) as ptmSetCount
141141
RETURN min(ptmSetCount), avg(ptmSetCount), max(ptmSetCount)
142142
~~~~
143143

144-
* Get all proteoforms: 13812
144+
* Get all proteoforms: 13879
145145
~~~~
146146
MATCH (pe:PhysicalEntity{speciesName:'Homo sapiens'})-[:referenceEntity]->(re:ReferenceEntity{databaseName:'UniProt'})
147147
WITH DISTINCT pe, re
@@ -153,6 +153,62 @@ WITH DISTINCT pe, re, ptms
153153
RETURN DISTINCT CASE WHEN re.variantIdentifier IS NOT NULL THEN re.variantIdentifier ELSE re.identifier END as protein, collect(DISTINCT pe.stId) as equivalentPe, ptms
154154
~~~~
155155

156+
* Get all proteoforms with at least one post translational modification: 1227 (8.84% of the proteoforms)
157+
~~~~
158+
MATCH (pe:PhysicalEntity{speciesName:'Homo sapiens'})-[:referenceEntity]->(re:ReferenceEntity{databaseName:'UniProt'})
159+
WITH DISTINCT pe, re
160+
OPTIONAL MATCH (pe)-[:hasModifiedResidue]->(tm:TranslationalModification)-[:psiMod]->(mod:PsiMod)
161+
WITH DISTINCT pe.stId AS physicalEntity,
162+
re.identifier AS protein,
163+
re.variantIdentifier AS isoform,
164+
tm.coordinate as coordinate,
165+
mod.identifier as type ORDER BY type, coordinate
166+
WITH DISTINCT physicalEntity,
167+
protein,
168+
CASE WHEN isoform IS NOT NULL THEN isoform ELSE protein END as isoform,
169+
COLLECT(type + ":" + CASE WHEN coordinate IS NOT NULL THEN coordinate ELSE "null" END) AS ptms
170+
WHERE size(ptms) > 1
171+
RETURN DISTINCT isoform, ptms
172+
ORDER BY isoform, ptms
173+
~~~~
174+
If we do not distinguish between isoforms then it is 1218 proteoforms.
175+
176+
* Number of proteins with at least a post translational modification: 611 (5.67% of the proteins in Reactome).
177+
~~~~
178+
MATCH (pe:PhysicalEntity{speciesName:'Homo sapiens'})-[:referenceEntity]->(re:ReferenceEntity{databaseName:'UniProt'})
179+
WITH DISTINCT pe, re
180+
OPTIONAL MATCH (pe)-[:hasModifiedResidue]->(tm:TranslationalModification)-[:psiMod]->(mod:PsiMod)
181+
WITH DISTINCT pe.stId AS physicalEntity,
182+
re.identifier AS protein,
183+
re.variantIdentifier AS isoform,
184+
tm.coordinate as coordinate,
185+
mod.identifier as type ORDER BY type, coordinate
186+
WITH DISTINCT physicalEntity,
187+
protein,
188+
CASE WHEN isoform IS NOT NULL THEN isoform ELSE protein END as isoform,
189+
COLLECT(type + ":" + CASE WHEN coordinate IS NOT NULL THEN coordinate ELSE "null" END) AS ptms
190+
WHERE size(ptms) > 1
191+
RETURN DISTINCT protein, collect(ptms)
192+
~~~~
193+
194+
* Number of proteins with at least a post translational modification distinguishing isoforms: 637
195+
~~~~
196+
MATCH (pe:PhysicalEntity{speciesName:'Homo sapiens'})-[:referenceEntity]->(re:ReferenceEntity{databaseName:'UniProt'})
197+
WITH DISTINCT pe, re
198+
OPTIONAL MATCH (pe)-[:hasModifiedResidue]->(tm:TranslationalModification)-[:psiMod]->(mod:PsiMod)
199+
WITH DISTINCT pe.stId AS physicalEntity,
200+
re.identifier AS protein,
201+
re.variantIdentifier AS isoform,
202+
tm.coordinate as coordinate,
203+
mod.identifier as type ORDER BY type, coordinate
204+
WITH DISTINCT physicalEntity,
205+
protein,
206+
CASE WHEN isoform IS NOT NULL THEN isoform ELSE protein END as isoform,
207+
COLLECT(type + ":" + CASE WHEN coordinate IS NOT NULL THEN coordinate ELSE "null" END) AS ptms
208+
WHERE size(ptms) > 1
209+
RETURN DISTINCT isoform, collect(ptms)
210+
~~~~
211+
156212
* Number of reactions of each proteoform:
157213
~~~~
158214
MATCH (pe:PhysicalEntity{speciesName:'Homo sapiens'})-[:referenceEntity]->(re:ReferenceEntity{databaseName:'UniProt'})

0 commit comments

Comments
 (0)