The contact order of a protein is a measure of the locality of the inter-amino acid contacts in the protein's native state tertiary structure[1]. It is calculated as the average sequence distance between residues that form native contacts in the folded protein divided by the total length of the protein. Higher contact orders indicate longer folding times,[2][3] and low contact order has been suggested as a predictor of potential downhill folding, or protein folding that occurs without a free energy barrier.[4] This effect is thought to be due to the lower loss of conformational entropy associated with the formation of local as opposed to nonlocal contacts.[3]

Relative contact order (CO) is formally defined as:

where N is the total number of contacts, ΔSi,j is the sequence separation, in residues, between contacting residues i and j, and L is the total number of residues in the protein.[2] The value of contact order typically ranges from 5% to 25% for single-domain proteins, with lower contact order belonging to mainly helical proteins, and higher contact order belonging to proteins with a high beta-sheet content.

Protein structure prediction methods are more accurate in predicting the structures of proteins with low contact orders. This may be partly because low contact order proteins tend to be small, but is likely to be explained by the smaller number of possible long-range residue-residue interactions to be considered during global optimization procedures that minimize an energy function.[5] Even successful structure prediction methods such as the Rosetta method overproduce low-contact-order structure predictions compared to the distributions observed in experimentally determined protein structures.[3]

The percentage of the natively folded contact order can also be used as a measure of the "nativeness" of folding transition states. Phi value analysis in concert with molecular dynamics has produced transition-state models whose contact order is close to that of the folded state in proteins that are small and fast-folding.[6] Further, contact orders in transition states as well as those in native states are highly correlated with overall folding time.[7]

In addition to their role in structure prediction, contact orders can themselves be predicted based on a sequence alignment, which can be useful in classifying the fold of a novel sequence with some degree of homology to known sequences.[8]

See also edit

References edit

  1. ^ Plaxco, Kevin W; Simons, Kim T; Baker, David (1998-04-10). "Contact order, transition state placement and the refolding rates of single domain proteins11Edited by P. E. Wright". Journal of Molecular Biology. 277 (4): 985–994. doi:10.1006/jmbi.1998.1645. ISSN 0022-2836.
  2. ^ a b Plaxco, Kevin W; Simons, Kim T; Baker, David (April 1998). "Contact order, transition state placement and the refolding rates of single domain proteins". Journal of Molecular Biology. 277 (4): 985–994. doi:10.1006/jmbi.1998.1645. PMID 9545386.
  3. ^ a b c Bonneau, Richard; Ruczinski, Ingo; Tsai, Jerry; Baker, David (August 2002). "Contact order and ab initio protein structure prediction". Protein Science. 11 (8): 1937–1944. doi:10.1110/ps.3790102. PMC 2373674. PMID 12142448.
  4. ^ Zuo, G; Wang, J; Wang, W (2006). "Folding with downhill behavior and low cooperativity of proteins". Proteins. 63 (1): 165–73. doi:10.1002/prot.20857. PMID 16416404. S2CID 11970404.
  5. ^ Mount DM. (2004). Bioinformatics: Sequence and Genome Analysis 2nd ed. Cold Spring Harbor Laboratory Press: Cold Spring Harbor, NY.
  6. ^ Pandit, AD; Jha, A; Freed, KF; Sosnick, TR (2006). "Small proteins fold through transition states with native-like topologies". J Mol Biol. 361 (4): 755–70. doi:10.1016/j.jmb.2006.06.041. PMID 16876194.
  7. ^ Paci, E; Lindorff-Larsen, K; Dobson, CM; Karplus, M; Vendruscolo, M (2005). "Transition state contact orders correlate with protein folding rates". J Mol Biol. 352 (3): 495–500. doi:10.1016/j.jmb.2005.06.081. PMID 16120445.
  8. ^ Shi, Yi; Zhou, Jianjun; Arndt, David; Wishart, David S.; Lin, Guohui (2008). "Protein contact order prediction from primary sequences". BMC Bioinformatics. 9: 255. doi:10.1186/1471-2105-9-255. PMC 2440764. PMID 18513429.