This program generates an inventory of amino acids present on the surface of peripheral proteins of a same family.
(See report.pdf and presentation.pdf for more details)
UNIX/shell environment
Python3, R, PyMOL
$./launch.sh
Then choose a CATH superfamily (3.20.20.190 or 2.29.30.29)
- Display the structural alignment colored by secondary structure
- Display the structural alignment in PyMOL
- Study of a fold/loop (gives out a csv file of all amino acids)
- Add sequences to the structural alignment
- Generate a plot (amino acid inventory)
- Delete unused files and Quit
- Choose a CATH superfamily (http://www.cathdb.info/) and create a directory /superfamilies/NEW-FAMILY
- Provide a structural alignment file (FASTA format) /NEW-FAMILY/fasta/cath/alignment.fasta
- Provide the pdb files corresponding /NEW-FAMILY/pdb
- Provide a file containing sequences to add (FASTA format) : /NEW-FAMILY/fasta/new/seq_to_align.fasta
- In the header of the python script (/scripts/script.py) indicate the superfamily studied, the model chosen and the different secondary structures to analyse (name, type and number in the sequence)
- Hélène Kabbech - Bioinformatics Master student (University of Paris Diderot)
- Internship supervised by Prof. Nathalie Reuter (CBU Bergen)