Efficient and versatile phylogenomic software by maximum likelihood http://www.iqtree.org
The IQ-TREE software was created as the successor of IQPNNI and TREE-PUZZLE (thus the name IQ-TREE). IQ-TREE was motivated by the rapid accumulation of phylogenomic data, leading to a need for efficient phylogenomic software that can handle a large amount of data and provide more complex models of sequence evolution. To this end, IQ-TREE can utilize multicore computers and distributed parallel computing to speed up the analysis. IQ-TREE automatically performs checkpointing to resume an interrupted analysis.
As input IQ-TREE accepts all common sequence alignment formats including PHYLIP, FASTA, Nexus, Clustal and MSF. As output IQ-TREE will write a self-readable report file (name suffix .iqtree
), a NEWICK tree file (.treefile
) which can be visualized by tree viewer programs such as FigTree, Dendroscope or iTOL.
- Efficient search algorithm: Fast and effective stochastic algorithm to reconstruct phylogenetic trees by maximum likelihood. IQ-TREE compares favorably to RAxML and PhyML in terms of likelihood while requiring similar amount of computing time (Nguyen et al., 2015).
- Ultrafast bootstrap: An ultrafast bootstrap approximation (UFBoot) to assess branch supports. UFBoot is 10 to 40 times faster than RAxML rapid bootstrap and obtains less biased support values (Minh et al., 2013).
- Ultrafast model selection: An ultrafast and automatic model selection (ModelFinder) which is 10 to 100 times faster than jModelTest and ProtTest. ModelFinder also finds best-fit partitioning scheme like PartitionFinder.
- Phylogenetic testing: Several fast branch tests like SH-aLRT and aBayes test (Anisimova et al., 2011) and tree topology tests like the approximately unbiased (AU) test (Shimodaira, 2002).
The strength of IQ-TREE is the availability of a wide variety of phylogenetic models:
- Common models: All common substitution models for DNA, protein, codon, binary and morphological data with rate heterogeneity among sites and ascertainment bias correction for e.g. SNP data.
- Partition models: Allowing individual models for different genomic loci (e.g. genes or codon positions), mixed data types, mixed rate heterogeneity types, linked or unlinked branch lengths between partitions.
- Mixture Models: fully customizable mixture models and empirical protein mixture models and.
IQ-TREE+PoMo is still under development. Please check out
iqtree --help
Especially, the section titled POLYMORPHISM AWARE MODELS (PoMo)
.
POLYMORPHISM AWARE MODELS (PoMo):
PoMo is run when
- a Counts File is used as input file, and/or when
- it is specified in the model string (see below).
-st C[FR] or C[FR]ps Counts File (automatically detected).
Useful to customize the virtual population size `ps`
3 <= ps <= 19; ps has to be an odd number, 2 or 10.
F: Sum over partial likelihoods at the tip of the tree (weighted).
R: Random binomial sampling of PoMo states from data (sampled).
Default is `CF9`.
-m <sm>+<pm>+<ft> Default: `HKY+rP+FO`.
<sm>: Substitution model.
DNA: HKY (default), JC, F81, K2P, K3P, K81uf, TN/TrN, TNef,
TIM, TIMef, TVM, TVMef, SYM, GTR, or a 6-digit model
specification (e.g., 010010 = HKY).
<pm>: PoMo model.
- rP (default; reversible PoMo with tree inference).
<ft>: Frequency type (optional; default: +F, counted).
F or +FO or +FU or +FQ.
Counted, optimized, user-defined, equal state frequency.
This overwrites the specifications of the DNA model.
The default model string is: -m HKY+rP+F.
Until now, only DNA models work with PoMo.
Model testing and rate heterogeneity do not work with PoMo yet.
For a quick start you can also try the IQ-TREE web server, which performs online computation using a dedicated computing cluster. It is very easy to use with as few as just 3 clicks! Try it out at
http://iqtree.cibiv.univie.ac.at
Please refer to the user documentation and frequently asked questions. If you have further questions, feedback, feature requests, and bug reports, please sign up the following Google group (if not done yet) and post a topic to the
https://groups.google.com/d/forum/iqtree
The average response time is one working day.
To cite IQ-TREE please use:
- L.-T. Nguyen, H.A. Schmidt, A. von Haeseler, and B.Q. Minh (2015) IQ-TREE: A fast and effective stochastic algorithm for estimating maximum likelihood phylogenies. Mol. Biol. Evol., 32, 268-274. DOI: 10.1093/molbev/msu300
For the ultrafast bootstrap (UFBoot) please cite:
- B.Q. Minh, M.A.T. Nguyen, and A. von Haeseler (2013) Ultrafast approximation for phylogenetic bootstrap. Mol. Biol. Evol., 30:1188-1195. DOI: 10.1093/molbev/mst024
Some parts of the code were taken from the following packages/libraries: Phylogenetic likelihood library, TREE-PUZZLE, BIONJ, Nexus Class Libary, Eigen library, SPRNG library, Zlib library, gzstream library, vectorclass library, GNU scientific library.
IQ-TREE was partially funded by the Austrian Science Fund - FWF (grant no. I760-B17 from 2012-2015 and and I 2508-B29 from 2016-2019) and the University of Vienna (Initiativkolleg I059-N).