PASTA

This is an implementation of the PASTA (Practical Alignment using SATe and TrAnsitivity) algorithm. This version is currently for testing purposes only.

All questions and inquires should be addressed to our user email group: pasta-users@googlegroups.com

Acknowledgment

The current version of this code is heavily based on the SATe code (http://phylo.bio.ku.edu/software/sate/sate.html). Refer to sate-doc directory for documentation of the SATe code.

INSTALLATION

Current version of PASTA has been developed and tested entirely on Linux. It has been tested on MAC as well, but less extensively. Windows won't work currently, but is planned for future versions.

You need to have:

Python
Dendropy (http://packages.python.org/DendroPy/)
Java (for OPAL)

LINUX:

Download the zip file, extract it and cd into the pasta.[version]/pasta. Then run:

python steup.py develop

You probably need to add a sudo in front of that command. If you don't have root access, use --prefix to install in a different location. That different location needs to be part of your PYTHONPATH environmental variable. Email: pasta-users@googlegroups.com for installation issues.

MAC:

Download the zip file, extract it and cd into the pasta.[version]. If you are installing on MAC, you need to go through some additional steps. Inside pasta.[version], you need to clone the following git repository:

https://github.com/sate-dev/sate-tools-mac

You can use git clone command to clone the repository, or just download the zip file (https://github.com/sate-dev/sate-tools-mac/archive/master.zip).

Then run the setup.py command as described under Linux.

EXECUTION

To run, use: python run_pasta.py -i input_fasta -t starting_tree

NOTE that current version of the PASTA code does NOT compute the starting tree through a process similar to what is described in the paper. Instead, it simply uses a FastTree on the input, if input is aligned, or else runs MAFFT on the input to align it, and then runs FastTree. Our approach for getting the starting tree is very simple, and is described below:

Choose a random subset of your sequences (size 100).
Get a SATe alignment on this subset (you need to install SATe for this; alternatively just run PASTA on it).
Build a HMMER model on the alignment of 100 subsets.
Use HMMAlign to align the remaining sequences into the small subset.
Run FastTree on the output of step 4.

We do have a separate program that computes this simple starting tree. See https://github.com/smirarab/sepp and use UPP. Make sure you set -A 100 -P 100 to get the starting tree described in the PASTA paper.

LICENSE

PASTA uses the same license as SATe (GNU Public License).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

PASTA

Acknowledgment

INSTALLATION

LINUX:

MAC:

EXECUTION

LICENSE

Files

README.md

Latest commit

History

README.md

File metadata and controls

PASTA

Acknowledgment

INSTALLATION

LINUX:

MAC:

EXECUTION

LICENSE