Rosetta:MSF:NN: Boosting performance of multi-state computational protein design with a neural network
Rosetta:MSF:NN: Boosting performance of multi-state computational protein design with a neural network
Rational protein design aims at the targeted modification of existing proteins. To reach this goal, software suites like Rosetta propose sequences to introduce the desired properties. Challenging design problems necessitate the representation of a protein by means of a structural ensemble. Thus, Rosetta multi-state design (MSD) protocols have been developed wherein each state represents one protein conformation. Computational demands of MSD protocols are high, because for each of the candidate sequences a costly three-dimensional (3D) model has to be created and assessed for all states. Each of these scores contributes one data point to a complex, design-specific energy landscape. As neural networks (NN) proved well-suited to learn such solution spaces, we integrated one into the frameworkRosetta:MSFinstead of the so far used genetic algorithm with the aim to reduce computational costs. As its predecessor,Rosetta:MSF:NNadministers a set of candidate sequences and their scores and scans sequence space iteratively. During each iteration, the union of all candidate sequences and their Rosetta scores are used to re-train NNs that possess a design-specific architecture. The enormous speed of the NNs allows an extensive assessment of alternative sequences, which are ranked on the scores predicted by the NN. Costly 3D models are computed only for a small fraction of best-scoring sequences; these and the corresponding 3D-based scores replace half of the candidate sequences during each iteration. The analysis of two sets of candidate sequences generated for a specific design problem by means of a genetic algorithm confirmed that the NN predicted 3D-based scores quite well; the Pearson correlation coefficient was at least 0.95. ApplyingRosetta:MSF:NN:enzdesto a benchmark consisting of 16 ligand-binding problems showed that this protocol converges ten-times faster than the genetic algorithm and finds sequences with comparable scores.
- University of Regensburg (UR) Germany
- University of Michigan United States
- University of Michigan–Ann Arbor United States
- University of Michigan–Flint United States
- University of Regensburg Germany
Science, Q, R, Computational Biology, Proteins, Epistasis, Genetic, Mutation, Medicine, Thermodynamics, Amino Acid Sequence, Neural Networks, Computer, Databases, Protein, Algorithms, Research Article
Science, Q, R, Computational Biology, Proteins, Epistasis, Genetic, Mutation, Medicine, Thermodynamics, Amino Acid Sequence, Neural Networks, Computer, Databases, Protein, Algorithms, Research Article
18 Research products, page 1 of 2
- 2007IsRelatedTo
- 2007IsRelatedTo
- 2002IsRelatedTo
- 2007IsRelatedTo
- 1994IsRelatedTo
- 2003IsRelatedTo
chevron_left - 1
- 2
chevron_right
citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).7 popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.Top 10% influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).Average impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.Average
