; ; ; ; ; ; ; ;
1 Department of Electrical Engineering, Technical University of Denmark 2 University of Copenhagen 3 Department of Informatics and Mathematical Modeling, Technical University of Denmark 4 University of Oxford
We propose a method to formulate probabilistic models of protein structure in atomic detail, for a given amino acid sequence, based on Bayesian principles, while retaining a close link to physics. We start from two previously developed probabilistic models of protein structure on a local length scale, which concern the dihedral angles in main chain and side chains, respectively. Conceptually, this constitutes a probabilistic and continuous alternative to the use of discrete fragment and rotamer libraries. The local model is combined with a nonlocal model that involves a small number of energy terms according to a physical force field, and some information on the overall secondary structure content. In this initial study we focus on the formulation of the joint model and the evaluation of the use of an energy vector as a descriptor of a protein's nonlocal structure; hence, we derive the parameters of the nonlocal model from the native structure without loss of generality. The local and nonlocal models are combined using the reference ratio method, which is a well-justified probabilistic construction. For evaluation, we use the resulting joint models to predict the structure of four proteins. The results indicate that the proposed method and the probabilistic models show considerable promise for probabilistic protein structure prediction and related applications. © 2013 Wiley Periodicals, Inc.
Proteins, 2014, Vol 82, Issue 2, p. 288-299
Algorithms; Amino Acid Sequence; Bacterial Proteins; Bayes Theorem; Hydrogen Bonding; Models, Molecular; Models, Statistical; Protein Structure, Secondary; Protein Structure, Tertiary; Structural Homology, Protein; Thermodynamics; IgG Fc-binding protein, Streptococcus; proteins; 04500, Mathematical biology and statistical methods; 10060, Biochemistry studies - General; 10064, Biochemistry studies - Proteins, peptides and amino acids; 10515, Biophysics - Biocybernetics; Computational Biology; Bayesian model mathematical and computer techniques; probabilistic model mathematical and computer techniques; reference ratio method mathematical and computer techniques; statistical protein structure prediction mathematical and computer techniques; Biochemistry and Molecular Biophysics; Models and Simulations; BIOCHEMISTRY; BIOPHYSICS; CONTACT MAP PREDICTION; FORCE-FIELD; STATISTICAL POTENTIALS; FOLDING SIMULATIONS; NEURAL-NETWORKS; CHALLENGES; ENSEMBLES; PROFILES; ENERGIES; DYNAMICS; Bayesian models; Disordered proteins; Protein structure prediction; Reference ratio method; Template-free modeling
Main Research Area: