Skip to content

Commit 2c926f0

Browse files
authored
Update readme.md
1 parent a661fdc commit 2c926f0

File tree

1 file changed

+3
-5
lines changed

1 file changed

+3
-5
lines changed

readme.md

Lines changed: 3 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -8,8 +8,8 @@
88
* **Methods**: In this work, we present an attempt to imitate the AlphaFold system for protein prediction architecture [[3](http://predictioncenter.org/casp13/doc/presentations/Pred_CASP13-DeepLearning-AlphaFold-Senior.pdf)]. We use 1-D Residual Networks (ResNets) to predict dihedral torsion angles and 2-D ResNets to predict distance maps between the protein amino-acids[[4](https://arxiv.org/abs/1512.03385)]. We use the CASP7 ProteinNet dataset section for training and evaluation of the model [[5](https://arxiv.org/abs/1902.00249)]. An open-source implementation of the system described can be found [here](https://github.com/EricAlcaide/MiniFold).
99
* **Results**: We are able to obtain distance maps and torsion angle predictions for a protein given it's sequence and PSSM. Our angle prediction model scores a 0.39 of MAE (Mean Absolute Error), and 0.39 and 0.43 R^2 coefficients for Phi and Psi respectively, whereas SoTA is around 0.69 (Phi) and 0.73 (Psi). Our methods do not include post-processing of Deep Learning outputs, which can be very noisy.
1010
* **Conclusion**: We have shown the potential of Deep Learning methods and its possible application to solve the Protein Folding Problem. Despite technical limitations, Neural Networks are able to capture relations between the data. Although our visually pleasant results, our system lacks components such as the protein structure prediction from both dihedral torsion angles and the distance map of a given protein and the post-processing of our predictions in order to reduce noise.
11-
* **Post processing of predictions**: Presently the post processing of the predictions is done using a python script which converts the predicted results into RR format known as Residue-Residue contact format. This format represents the probability of contact between pairwise residues. Data in this format are inserted between MODEL and END records of the submission file. The prediction starts with the sequence of the predicted target splitted.The sequence is followed by the list of contacts in the five-column format as represented below:-
12-
11+
* **Post processing of predictions**: Presently the post processing of the predictions is done using a python script which converts the predicted results into RR format known as Residue-Residue contact prediction format. This format represents the probability of contact between pairwise residues. Data in this format are inserted between MODEL and END records of the submission file. The prediction starts with the sequence of the predicted target splitted.The sequence is followed by the list of contacts in the five-column format as represented below:-
12+
1313
PFRMAT RR
1414
TARGET T0999
1515
AUTHOR 1234-5678-9000
@@ -32,9 +32,7 @@
3232
5 15 0 8 0.307
3333
7 14 0 8 0.30
3434
END
35-
36-
37-
The predictions in this format can then be utilised as input to build 3D models using structure modelling softwares.
35+
The predictions in this format can then be utilised as input to build 3D models using structure modelling softwares.
3836

3937
#### Citation
4038
```

0 commit comments

Comments
 (0)