Genetic algorithm for large-scale maximum parsimony phylogenetic analysis of proteins
Inferring phylogeny is a difficult computational problem. For example, for only 13 taxa, there are more then 13 billion possible unrooted phylogenetic trees. Heuristics are necessary to minimize the time spent evaluating non-optimal trees. We describe here an approach for heuristic searching, using...
Saved in:
| Published in | Biochimica et biophysica acta Vol. 1725; no. 1; pp. 19 - 29 |
|---|---|
| Main Authors | , , , |
| Format | Journal Article |
| Language | English |
| Published |
Netherlands
Elsevier B.V
30.08.2005
|
| Subjects | |
| Online Access | Get full text |
| ISSN | 0304-4165 0006-3002 1872-8006 1872-8006 |
| DOI | 10.1016/j.bbagen.2005.04.027 |
Cover
| Summary: | Inferring phylogeny is a difficult computational problem. For example, for only 13 taxa, there are more then 13 billion possible unrooted phylogenetic trees. Heuristics are necessary to minimize the time spent evaluating non-optimal trees. We describe here an approach for heuristic searching, using a genetic algorithm, that can reduce the time required for weighted maximum parsimony phylogenetic inference, especially for data sets involving a large number of taxa. It is the first implementation of a weighted maximum parsimony criterion using amino acid sequences. To validate the weighted criterion, we used an artificial data set and compared it to a number of other phylogenetic methods. Genetic algorithms mimic the natural selection's ability to solve complex problems. We have identified several parameters affecting the genetic algorithm. Methods were developed to validate these parameters, ensuring optimal performance. This approach allows the construction of phylogenetic trees with over 200 taxa in practical time on a regular PC. |
|---|---|
| Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
| ISSN: | 0304-4165 0006-3002 1872-8006 1872-8006 |
| DOI: | 10.1016/j.bbagen.2005.04.027 |