Fast search algorithms for computational protein design

One of the main challenges in computational protein design (CPD) is the huge size of the protein sequence and conformational space that has to be computationally explored. Recently, we showed that state‐of‐the‐art combinatorial optimization technologies based on Cost Function Network (CFN) processin...

Full description

Saved in:
Bibliographic Details
Published inJournal of computational chemistry Vol. 37; no. 12; pp. 1048 - 1058
Main Authors Traoré, Seydou, Roberts, Kyle E., Allouche, David, Donald, Bruce R., André, Isabelle, Schiex, Thomas, Barbe, Sophie
Format Journal Article
LanguageEnglish
Published United States Blackwell Publishing Ltd 05.05.2016
Wiley Subscription Services, Inc
Wiley
Subjects
Online AccessGet full text
ISSN0192-8651
1096-987X
1096-987X
DOI10.1002/jcc.24290

Cover

More Information
Summary:One of the main challenges in computational protein design (CPD) is the huge size of the protein sequence and conformational space that has to be computationally explored. Recently, we showed that state‐of‐the‐art combinatorial optimization technologies based on Cost Function Network (CFN) processing allow speeding up provable rigid backbone protein design methods by several orders of magnitudes. Building up on this, we improved and injected CFN technology into the well‐established CPD package Osprey to allow all Osprey CPD algorithms to benefit from associated speedups. Because Osprey fundamentally relies on the ability of A* to produce conformations in increasing order of energy, we defined new A* strategies combining CFN lower bounds, with new side‐chain positioning‐based branching scheme. Beyond the speedups obtained in the new A*‐CFN combination, this novel branching scheme enables a much faster enumeration of suboptimal sequences, far beyond what is reachable without it. Together with the immediate and important speedups provided by CFN technology, these developments directly benefit to all the algorithms that previously relied on the DEE/ A* combination inside Osprey* and make it possible to solve larger CPD problems with provable algorithms. © 2016 Wiley Periodicals, Inc. Computational protein design (CPD) through Cost Function Networks (CFN) provides important speedups to explore large sequence‐conformation spaces and provably identifies the sequence with the conformation of optimal stability (Global Minimum Energy Conformation, GMEC). In addition to quickly finding the GMEC of highly complex protein design problems, CFN‐based methods also enable the efficient enumeration of suboptimal solutions. These approaches offer an attractive alternative to the usual CPD methods and were implemented in the well‐established CPD package Osprey.
Bibliography:NIH - No. 2-R01-GM-78031-5
istex:1402265CF7B558610C75984515D271B20DFA6F89
INRA
Agence Nationale de la Recherche - No. ANR-10-BLA-0214; No. ANR-12-MONU-0015-03
Region Midi-Pyrénées
ark:/67375/WNG-FC67T3X6-J
ArticleID:JCC24290
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 14
ObjectType-Article-1
ObjectType-Feature-2
content type line 23
PMCID: PMC4828276
ISSN:0192-8651
1096-987X
1096-987X
DOI:10.1002/jcc.24290