-
Notifications
You must be signed in to change notification settings - Fork 11
implementing PDB to FASTA file format conversion in multiple languages
License
kad-ecoli/pdb2fasta
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
This is a collection of implemention of the "pdb2fasta" program for
converting PDB format protein structure to FASTA format amino acid sequence.
On Mac or Linux, you can run the programs with:
pdb2fasta pdb.pdb > seq.fasta
, where "pdb.pdb" is the input PDB file pdb.pdb to FASTA sequence file seq.fasta.
The following implements are available: Python2 (.py), Perl5 (.pl),
Matlab (.m), R (.R), C (.c), C++ (.cpp), BASH (.sh), Java (.java).
These programs have consistent behavior over the following scenarios:
[1] If atoms have alternative locations (e.g. 3b2c), only those atom with
alternative location identifier ' ' (space) or 'A' will be considered.
[2] If a protein contain non-standard amino acids (e.g. 1a62), "MSE" residues
will be converted to "MET", while other non-standard amino acids are
ignored.
[3] If a residue have insetion code (e.g. 2p83), it will still be considered.
[4] If a PDB contains multi-models (e.g. 2m9l), only the first model will be
considered.
About
implementing PDB to FASTA file format conversion in multiple languages
Topics
Resources
License
Stars
Watchers
Forks
Packages 0
No packages published