Skip to content

yhgon/rawMSA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

rawMSA: end-to-end Deep Learning makes protein sequence profiles and feature extraction obsolete

this is fork from official source

This repository contain all the information about the datasets and the models used in the paper Claudio Mirabello, Björn Wallner, rawMSA: End-to-end Deep Learning Makes Protein Sequence Profiles and Feature Extraction obsolete doi: https://doi.org/10.1101/394437 pdf.

  • The folder datasets contains the lists of proteins used in the 5-fold crossvalidation and the scripts necessary to produce the correct alignments and input files in the correct ".num" format
  • The folder scripts contains the python and bash scripts to run predictions and ensembling from the models
  • The folder models contains .h5 models for keras/tensorflow for both the CMAP and SS-RSA networks. These models are binary files that might not work on some keras/tensorflow versions. Send us an email if that is the case.

The full hdf5 dataset containing the SS and RSA classes, as well as the MSA inputs to the SS and RSA models, is too large to be kept on git (150 GB approx.) and can be found here: http://duffman.it.liu.se/rawmsa/

Contact: claudio (dot) mirabello [at] liu (dot) se for original authors.

About

from https://bitbucket.org/clami66/rawmsa.git

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published