Document Type
Journal Article
Date of this Version
2001
Publication Source
Journal of Molecular Biology
Volume
306
Issue
3
Start Page
607
Last Page
628
DOI
10.1006/jmbi.2000.4422
Abstract
Combinatorial experiments provide new ways to probe the determinants of protein folding and to identify novel folding amino acid sequences. These types of experiments, however, are complicated both by enormous conformational complexity and by large numbers of possible sequences. Therefore, a quantitative computational theory would be helpful in designing and interpreting these types of experiment. Here, we present and apply a statistically based, computational approach for identifying the properties of sequences compatible with a given main-chain structure. Protein side-chain conformations are included in an atom-based fashion. Calculations are performed for a variety of similar backbone structures to identify sequence properties that are robust with respect to minor changes in main-chain structure. Rather than specific sequences, the method yields the likelihood of each of the amino acids at preselected positions in a given protein structure. The theory may be used to quantify the characteristics of sequence space for a chosen structure without explicitly tabulating sequences. To account for hydrophobic effects, we introduce an environmental energy that it is consistent with other simple hydrophobicity scales and show that it is effective for side-chain modeling. We apply the method to calculate the identity probabilities of selected positions of the immunoglobulin light chain-binding domain of protein L, for which many variant folding sequences are available. The calculations compare favorably with the experimentally observed identity probabilities.
Copyright/Permission Statement
© 2001. This manuscript version is made available under the CC-BY-NC-ND 4.0 license http://creativecommons.org/licenses/by-nc-nd/4.0/
Keywords
protein design, combinatorial library, sequence variability, profile, protein L
Recommended Citation
Kono, H., Zou, J., & Saven, J. G. (2001). Statistical Theory of Protein Combinatorial Libraries. Journal of Molecular Biology, 306 (3), 607-628. http://dx.doi.org/10.1006/jmbi.2000.4422
Date Posted: 07 December 2016