Fast gap-free enumeration of conformations and sequences for protein design

ABSTRACT Despite significant successes in structure‐based computational protein design in recent years, protein design algorithms must be improved to increase the biological accuracy of new designs. Protein design algorithms search through an exponential number of protein conformations, protein ense...

Full description

Saved in:

Bibliographic Details
Published in	Proteins, structure, function, and bioinformatics Vol. 83; no. 10; pp. 1859 - 1877
Main Authors	Roberts, Kyle E., Gainza, Pablo, Hallen, Mark A., Donald, Bruce R.
Format	Journal Article
Language	English
Published	United States Blackwell Publishing Ltd 01.10.2015 Wiley Subscription Services, Inc
Subjects	A search Algorithms Amino Acid Sequence Amino acids combinatorial search Computational Biology - methods computational protein design Design Protein Conformation Protein Engineering - methods Sequence Analysis, Protein - methods Software structure-based design A search computational protein design combinatorial search structure-based design
Online Access	Get full text
ISSN	0887-3585 1097-0134 1097-0134
DOI	10.1002/prot.24870

Cover

More Information
Summary:	ABSTRACT Despite significant successes in structure‐based computational protein design in recent years, protein design algorithms must be improved to increase the biological accuracy of new designs. Protein design algorithms search through an exponential number of protein conformations, protein ensembles, and amino acid sequences in an attempt to find globally optimal structures with a desired biological function. To improve the biological accuracy of protein designs, it is necessary to increase both the amount of protein flexibility allowed during the search and the overall size of the design, while guaranteeing that the lowest‐energy structures and sequences are found. DEE/A‐based algorithms are the most prevalent provable algorithms in the field of protein design and can provably enumerate a gap‐free list of low‐energy protein conformations, which is necessary for ensemble‐based algorithms that predict protein binding. We present two classes of algorithmic improvements to the A algorithm that greatly increase the efficiency of A. First, we analyze the effect of ordering the expansion of mutable residue positions within the A tree and present a dynamic residue ordering that reduces the number of A* nodes that must be visited during the search. Second, we propose new methods to improve the conformational bounds used to estimate the energies of partial conformations during the A* search. The residue ordering techniques and improved bounds can be combined for additional increases in A* efficiency. Our enhancements enable all A*‐based methods to more fully search protein conformation space, which will ultimately improve the accuracy of complex biomedically relevant designs. Proteins 2015; 83:1859–1877. © 2015 Wiley Periodicals, Inc.
Bibliography:	istex:5C245ACA54F843322C36BD8AB063D11136EFF0A1 ArticleID:PROT24870 ark:/67375/WNG-JZVHSM9L-W NIH - No. 2R01-GM-78031-05 Kyle E. Roberts and Pablo Gainza contributed equally to this work. ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23
ISSN:	0887-3585 1097-0134 1097-0134
DOI:	10.1002/prot.24870