Protein Peeling

The objective of this tool is to divide a protein into small compact units that compose protein three-dimensional structures. These fragments, called protein units (PU), are a new level of description to well understand and analyze the organization of protein structures defined by Jean-Christophe Gelly's team.

The method only works from the contact probability matrix of inter-Calpha distances. Distances are transformed into probabilities with a logistic function. The original programm split the protein by hierarchical clustering.

Reference 1 : Jean-Christophe Gelly, Alexandre G. de Brevern, and Serge Hazout. ’Protein Peeling’ : an approach for splitting a 3d protein structure into compact fragments. Bioinforma-tics (Oxford, England), 22(2) :129–133, January 2006

The original version of Protein Peeling developped by Jean-Christophe Gelly's team is here : Protein Peeling 3D

The aim of this university project is to propose a new method of Protein Peeling without hierarchical segmentation but by searching iteratively for the regions of the most compact and independent proteins.

The compactness (κ) and the separation(σ) are defined in this paper :

Reference 2 : Guillaume Postic, Yassine Ghouzam, Romain Chebrek, and Jean-Christophe Gelly. An ambiguity principle for assigning protein structural domains. Science Advances,3(1), 2017.

Requirements

PyMol

 sudo apt-get install pymol

Python library

pip install numpy
pip install pandas
pip install matplotlib
pip install biopython

Run

Please launch the tool at base.

Run the program

 python3 Protein_Peeling.py PDBfile CHAIN MIN MAX

With :

PDBfile : the path to your PDBfile, we recommand you to put your pdb file in data/ folder
CHAIN : the Chain of your protein (ex: A)
MIN : min size of PU (ex:10)
MAX: max size of PU (ex:50)

Example usage

 python3 Protein_Peeling.py /datas/1atn.pdb A 10 50

The tool will create a folder in results/ with the name of pdb (ex: 1atn ).
Inside results/1atn, different folders "PUx" (x: number of the splitting) with all the possibilities to split the protein are created ( results/1atn/PU1/).
Inside each possibilities, pdbfiles and pymol reprensentation are generated. (the representation is made with src/visu_all_pdbs.py and Pymol )
It will print in the terminal the fold wich countain the best splitting and plot contact map. The file ranking_spitting.txt contains the rank and the information of each splitting.

**Protein 1atn with the best spliiting (foldname:PU14) colored by it's PUs**

PU example (residues:202-251) from splitting 12 (foldname:PU14)

Contact map divided according to the all spliiting combinations

Documentation

The Documentation is made with Pydoc.

pydoc -w ProteinPeeling

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
datas		datas
results/1atn		results/1atn
src		src
ProteinPeeling.html		ProteinPeeling.html
ProteinPeeling.py		ProteinPeeling.py
Protein_Peeling_CR_TIEO.pdf		Protein_Peeling_CR_TIEO.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Protein Peeling

Requirements

Run

Example usage

Documentation

About

Releases

Packages

Languages

soniamaitieo/ProteinPeeling

Folders and files

Latest commit

History

Repository files navigation

Protein Peeling

Requirements

Run

Example usage

Documentation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages