A high-speed Disordered Flexible Linker (DFL) predictor trained on DLD domain Linker dataset and Disprot Database.
LINKER-Pred-Lite project belongs to a series of LINKER-Pred-Suite projects which focus on the Structure and Function prediction of Disordered Flexible Linkers (DFLs). Currently we have LINKER-Pred2 for fast DFL prediction and LINKER-Pred2 for more accurate DFL prediction. LINKER-Pred-Lite is trained on more than 2000 DFL linker datasets from our DLD dataset and Disprot, its performance is better than TOP predictors on the CAID2 Linker dataset. There is no MSA searching needed for the prediction. Therefore, compared to LINKER-Pred2, the prediction may be slightly worse, but it only cost ~0.1 second for one prediction.
This predictor requires sequences embedded with ProtTrans. Note,
- File format should be
[SEQUENCE_NAME/ID].npy, replace SEQUENCE_NAME/ID with the actual sequence ID, it should be the same as the name from the.fastafile. - Matrix shape:
ProtTrans:(1, SEQUENCE_LENGTH, 1024)
📣
(We maintain this separation due to the requirements from CAID3, but we may edit or merge them in the future.)
- Pull the Docker image from DockerHub
docker pull dimeng851/punch_linker_light:v2
- RUN the following command:
Replace
CONTAINER_NAME- any name you like;
PATH_TO_INPUT_FASTA- path to input file, which is ONE FASTA file including all query sequences;
PATH_TO_PROTTRANS- a folder which includes all protTrans embedded sequences;
PATH_OUTPUT- a folder which will be used to save all outputs, including: a. timings.csv; b. disorder folder, where we will keep all the prediction results.docker run -d \ -it \ --name [CONTAINER_NAME] \ --mount type=bind,source=[PATH_TO_INPUT_FASTA],target=/punch_linker_light/data/input.fasta \ --mount type=bind,source=[PATH_TO_PROTTRANS],target=/punch_linker_light/data/protTrans \ --mount type=bind,source=[PATH_OUTPUT],target=/punch_linker_light/output \ dimeng851/punch_linker_light:v2
An example:
docker run -d \ -it \ --name punch_linker_light_con \ --mount type=bind,source=data/input.fasta,target=/punch_linker_light/data/input.fasta \ --mount type=bind,source=data/protTrans,target=/punch_linker_light/data/protTrans \ --mount type=bind,source=data/results,target=/punch_linker_light/output \ dimeng851/punch_linker_light:v2
- Find the results in OUTPUT folder.
Email Di.
di.meng@ucdconnect.ie
📬 Di Meng - di.meng@ucdconnect.ie
📬 Juliana Glavina - jglavina@iib.unsam.edu.ar
📬 Gianluca Pollastri - gianluca.pollastri@ucd.ie
📬 Lucía Beatriz Chemes - lchemes@iib.unsam.edu.ar