The complementarity plot (CP) is a graphical tool for structural validation of atomic models for both folded globular proteins and protein-protein interfaces.[1][2][3] It is based on a probabilistic representation of preferred amino acid side-chain orientation, analogous to the preferred backbone orientation of Ramachandran plots). It can potentially serve to elucidate protein folding as well as binding. The upgraded versions of the software suite is available and maintained in github for both folded globular proteins[4] as well as inter-protein complexes.[5] The software is included in the bioinformatic tool suites OmicTools[6] and Delphi tools.[7]

A pictorial description of the Complementarity Plot with its different regions
Distribution of points in the Complementarity Plot corresponding to buried amino acid side-chains from a high resolution protein crystal structure

Background edit

Validation of three dimensional protein crystal structures are traditionally based on a multitude of parameters ranging from (i) the distribution of residues in the Ramachandran plot,[8][9] (ii) deviations from ideality,[10][11] for bond lengths and angles, (iii) atomic short contacts (steric clash scores),[12] (iv) the distribution of the side-chain conformers (rotamers)[13] and, (v) hydrogen bonding parameters.[14] The advent of the complementarity plot as a structural validation tool for proteins essentially provides a conjugation of the traditional approaches. CP detects both local errors in atomic coordinates and also correctly matches an amino acid sequence to its native three dimensional fold situated amid decoys. The Complementarity Plot is based on the combined use of shape and electrostatic complementarity[1] of completely / partially buried residues with respect to their environment constituted by rest of the polypeptide chain and is a sensitive indicator of the harmony or disharmony of interior residues with regard to the short and long range forces sustaining the native fold. The term 'Complementarity Plot' (CP) is perhaps a misnomer as there are actually three plots, each serving a given range of solvent exposure of the plotted residues (CP1, CP2, CP3 for burial bins 1, 2, 3).

Pictorial description edit

The complementarity plot has been largely inspired by the Ramachnadran Plot in its design (but not in its physicochemical attributes). Ramachandran Plot is deterministic in nature, in contrast, CP is probabilistic. Ramachandran plot deals with main-chain torsion angles and errors in such parameters are essentially locally restricted. In contrast, CP deals with geometric and electrostatic fit of the interior side-chains with their local and non-local neighborhood. Disharmony (misfit) in these conjugated parameters may arise due to a plethora of errors coming from bond angles or torsions from effectively the whole folded polypeptide chain. However, analogous to the Ramachandran Plot, the region within the first contour is termed 'probable' (analogous to the 'allowed' region), between the first and second contour, 'less probable' ('partially allowed') and outside the second contour 'improbable' ('disallowed').

Applications edit

CP has a multitude of applications in experimental as well as in computational structural biology.[2] Thorough investigation of the effect of small errors in both main- and side-chain bond angles / torsions on the overall fold shows that the CP is effective in the detection of these errors even while failure of the other already existing parameters based on prohibition of local steric overlap and deviation from ideality. Consequences of such small angular errors are not restricted locally, resulting in geometric and electrostatic misfit of interior residues throughout the fold, potentially detectable by the CPs. These errors may arise from (i) misfitting of side-chain torsions/ wrong rotamer assignments (especially relevant for low-resolution structures), (ii) incorrect tracing of the main-chain trajectories during refinement (resulting in low-intensity errors diffused over the entire polypeptide chain). CP can also detect packing anomalies, and, in particular, can potentially signal unbalanced partial charges within protein interiors. It is useful in homology modeling and protein design. A version of the plot (CPint)[15] has also been built and made available to probe similar errors in protein-protein interfaces.


CPdock edit

 
CPdock

In contrast to the residue-wise plots, there is also a variant available for the Complementarity Plot, namely CPdock [16] for plotting single Sc, EC values for the protein-protein interface and adjudging thereby the quality of the complex atomic structure (either experimentally solved or computationally built) therein. Sc,[17] EC[18] are shape and electrostatic complementarities computed for 'interacting protein-protein surfaces' originally proposed by Peter Colman and co-workers in the 1990s. CPdock was primarily developed as a scoring function to serve as an initial filter in protein-protein docking and can be a very helpful tool in protein design - as has lately been demonstrated in COVID-research both in scoring as well as in the evaluation of docked complexes to eliminate the effect of co-substrate binding in a targeted inhibitor binding.[19][20]

Software edit

CP@SINP: http://www.saha.ac.in/biop/www/sarama.html
CP: https://github.com/nemo8130/SARAMA-updated
CPint: https://github.com/nemo8130/SARAMAint-updated
CPdock: https://github.com/nemo8130/CPdock

EnCPdock (web-server): https://scinetmol.in/EnCPDock/

References edit

  1. ^ a b Basu S, Bhattacharyya D, Banerjee R (2012) Self-Complementarity within Proteins: Bridging the Gap between Binding and Folding. Biophys J 102:2605–2614 . doi: 10.1016/j.bpj.2012.04.029 http://www.saha.ac.in/biop/www/sarama.html
  2. ^ a b Basu S, Bhattacharyya D, Banerjee R (2014) Applications of complementarity plot in error detection and structure validation of proteins. Indian J Biochem Biophys 51:188–200
  3. ^ Basu S, Bhattacharyya D, Wallner B (2014) SARAMAint: The Complementarity Plot for Protein–Protein Interface. J Bioinforma Intell Control 3:309–314 . doi: 10.1166/jbic.2014.1103
  4. ^ basu, sankar (26 June 2017). "SARAMA-updated: Complementarity Plot : A structure validation tool for globular proteins". GitHub.
  5. ^ basu, sankar (26 June 2017). "SARAMAint-updated: The Complementarity Plot for Protein-Protein Interface (CPint)". GitHub.
  6. ^ "SARAMA: a bioinformatics tool for Structure validation | Protein structure analysis". omictools.
  7. ^ Group, Professor Emil Alexov. "Computational Biophysics & Bioinformatics". compbio.clemson.edu.
  8. ^ Ramachandran, G.N., Ramakrishnan, C., Sasisekharan, V., Stereochemistry of polypeptide chain configurations. J.Mol. Biol., 1963, 7, 95-99.
  9. ^ Kleywegt, G.J., Jones, T.A., Phi/Psi-chology: Ramachandran revisited. Structure., 1996, 4, 1395–1400.
  10. ^ Touw, W.G., and Vriend, G., On the complexity of Engh and Huber refinement restraints: the angle tau as example. Acta Cryst D, 2010, 66, 1341–1350.
  11. ^ Laskowski, R.A., MacArthur, M.W., Moss, D.S., Thornton, J.M., PROCHECK: a program to check the stereochemical quality of protein structures. J. Appl. Crystallogr., 1993, 26, 283-291.
  12. ^ Davis,I.W., Leaver-Fay, A., Chen, V.B., Block, J.N., Kapral, G.J., Wang,X., Murray, L.W., Arendall, W.B., III, Snoeyink, J., Richardson, J.S., Richardson, D.C., MolProbity: all-atom contacts and structure validation for proteins and nucleic acids. Nucl. Acids. Res.,35, W375–W383.
  13. ^ Shapovalov, M.S., and Dunbrack, R.L., Jr. A smoothed backbone-dependent rotamer library for proteins derived from adaptive kernel density estimates and regressions. Structure., 2001, 19, 844-858.
  14. ^ Hooft, R.W.W., Sander, C., and Vriend, G., Positioning hydrogen atoms by optimizing hydrogen-bond networks in protein structures. Proteins., 1996, 26, 363-376.
  15. ^ Basu, Sankar; Bhattacharyya, Dhananjay; Wallner, Björn (1 December 2014). "SARAMAint: The Complementarity Plot for Protein–Protein Interface". Journal of Bioinformatics and Intelligent Control. 3 (4): 309–314. doi:10.1166/jbic.2014.1103.
  16. ^ Basu, Sankar (2017-12-07). "CPdock: the complementarity plot for docking of proteins: implementing multi-dielectric continuum electrostatics". Journal of Molecular Modeling. 24 (1): 8. doi:10.1007/s00894-017-3546-y. ISSN 0948-5023. PMID 29218430. S2CID 23174519.
  17. ^ Lawrence, M. C.; Colman, P. M. (1993-12-20). "Shape complementarity at protein/protein interfaces". Journal of Molecular Biology. 234 (4): 946–950. doi:10.1006/jmbi.1993.1648. ISSN 0022-2836. PMID 8263940.
  18. ^ McCoy, A. J.; Chandana Epa, V.; Colman, P. M. (1997-05-02). "Electrostatic complementarity at protein/protein interfaces". Journal of Molecular Biology. 268 (2): 570–584. doi:10.1006/jmbi.1997.0987. ISSN 0022-2836. PMID 9159491.
  19. ^ Basu, Sankar; Chakravarty, Devlina; Bhattacharyya, Dhananjay; Saha, Pampa; Patra, Hirak K. (2021-05-31). "Plausible blockers of Spike RBD in SARS-CoV2-molecular design and underlying interaction dynamics from high-level structural descriptors". Journal of Molecular Modeling. 27 (6): 191. doi:10.1007/s00894-021-04779-0. ISSN 0948-5023. PMC 8165686. PMID 34057647.
  20. ^ Basu, Sankar; Assaf, Simon S.; Teheux, Fabian; Rooman, Marianne; Pucci, Fabrizio (2021). "BRANEart: Identify Stability Strength and Weakness Regions in Membrane Proteins". Frontiers in Bioinformatics. 1. doi:10.3389/fbinf.2021.742843. ISSN 2673-7647. PMC 9581023. PMID 36303753.