Non-canonical base pairing edit

File:Hoogsteen and Watson–Crick base pairing.png
Figure 1: Comparison of canonical Watson-Crick base pairing and non-canonical Hoogsteen base pairing.

Non-canonical base pairing occurs when nucleobases hydrogen bond, or base pair, to one another in schemes other than the standard Watson-Crick base pairs (which are adenine (A) -- thymine (T) in DNA, adenine (A) -- uracil (U) in RNA, and guanine (G) -- cytosine (C) in both DNA and RNA). The first discovered non-canonical base pairs are Hoogsteen base pairs, which were first described by American biochemist Karst Hoogsteen.

Non-canonical base pairings commonly occur in the secondary structure of RNA (e.g. pairing of G with U), and in tRNA recognition. These non-Watson-Crick base pairs allow for many unique RNA structures that allow RNA to participate in many diverse functions throughout the cell. Non-canonical base pairs are typically less stable than standard base pairings.[1] The presence of non-canonical base pairs in double stranded DNA results in a disrupted double helix.[2]

History edit

James Watson and Francis Crick published the double helical structure of DNA and proposed the canonical Watson-Crick base pairs in 1953.[3] Ten years later, in 1963, Karst Hoogsteen reported that he had used single crystal X-ray diffraction to investigate alternative base pair structures, and he found an alternative structure for the nucelobase pair adenine-thymine in which the purine (A) takes on an alternative conformation with respect to the pyrimidine (T).[4] Five years after Hoogsteen proposed the A-T Hoogsteen base pair, optical rotary dispersion spectra which provided evidence for a G-C Hoogsteen base pair were reported.[5] The G-C Hoogsteen base pair was first observed via X-ray crystallography years later, in 1986, by co-crystallizing DNA with triostin A (an antibiotic).[6] Ultimately, after years of studying both Watson-Crick and Hoogsteen base pairs, it has been determined that both occur naturally in DNA, and that they exist in equilibrium with one another; the conditions in which the DNA exists ultimately determine which form will be favored.[7]

Since the structures of the canonical Watson-Crick and non-canonical Hoogsteen base pairs were determined, many other types of non-canonical base pairs have been presented and described. Types of non-canonical base pairs can be defined by which faces of the nucleobases are interacting in a given base pair. When classified this way, nearly 40 types of non-canonical base pairs can be identified. These can be sorted into the larger categories of cis and trans.[1]

Structure edit

Base pairing edit

 
Figure 2: Base pairing edge of nucleobases. Top figure is an example of a pyrimidine (Adenine) where the edges are known as Watson-Crick, Hoogsteen, and Sugar Edges. Bottom figure is an example of a Purine (Cytosine) with the Watson-Crick, C-H, and Sugar Edges

60% of the paired bases in RNA structures are the canonical Watson-Crick base pairs with the remaining being non-canonical base pairs.[8] Base pairing occurs when two bases form hydrogen bonds with each other. These hydrogen bonds can be either polar or non-polar interactions. The polar hydrogen bonds are formed by N-H...O/N and/or O-H...O/N interactions. Non-polar hydrogen bonds are formed between C-H...O/N. [9]

Edge Interactions edit

Each base has three potential edges where it can interact with another base. The Pyrimidine bases have 3 edges which are able to hydrogen bond. Those are known as the Watson-Crick edge(WC), the Hoogsteen edge(H), and the Sugar edge(S). Purine bases also have three hydrogen-bonding edges.[8] Like the Pyrimidine there is the Watson-Crick edge(WC) and the Sugar edge(S) but the third edge is refereed to as the "C-H" edge(H). This C-H edge is sometimes also referred to as the Hoogsteen edge for simplicity. There various edges for the Purine and Pyrimidine bases are shown in Figure 2.[9]

 
Cis and Trans Orientations of the glycosidic bond in RNA base pairs.

Besides the three edges of interaction Base pairs also vary in various cis/trans forms. The cis and trans structures depend on the orientation of the ribose sugar as compared to the hydrogen bond interaction. These various orientations are shown in Figure 3. With the cis/trans forms and the 3 edges of hydrogen bonding there are 12 basic types of base pairing geometries which can be found in RNA structures. Those 12 types are WC:WC (cis/trans), W:HC (cis/trans), WC:S (cis/trans), H:S (cis/trans), H:H (cis/trans), and S:S (cis/trans).[9]

Classification of Base Pairs edit

These 12 types can be further divided into more subgroups which are dependent on the directionality of the glycosidic bonds and steric extensions.[10] With all of the various base pair combinations there are 169 theoretically possible base pair combinations. The number of actual base pair combinations is however much lower since some of the combinations result in non-favorable interactions. This number of possible non-canonical base pairs is still being determined since it is very dependent on the base pairing criteria. [11]Understanding the base pair configuration is difficult since the pairing is very dependent on the bases surroundings. These surroundings consist of adjacent base pairs, adjacent loops, or third interaction such as a base triple. [12]

 
Figure 4: 6 rigid-body parameters

Since the various bases are rigid and planar, the bonding between them bases are well defined. The spatial interactions between the two bases can be classified in 6 rigid-body parameters or intra-base pair parameters (3 translational, 3 rotational) as shown in Figure 4. [13]These parameters describe the base pairs three dimensional confirmation. The three translational arrangements are known as shear, stretch, and stagger. These three parameters are directly related to the proximity and direction of the pairs hydrogen bonding. The rotational arrangements are buckle, propeller, and opening. Rotational arrangements relate to the non-planar confirmation as compared to the ideal coplanar geometry. [9] Intra-base pair parameters are used to determine the structure and stabilities of non-canonical base pairs. These parameters were originally created for the base pairings in DNA but can also fit the non-canonical base models.[13]

Types of Base Pairs edit

The most common non-canonical base pairs are trans A:G Hoogsteen/sugar edge, A:U Hoogsteen/WC, and G:U Wobble pairs.[14]

Hoogsteen Base Pairs edit

 
Figure 5: An example of non-canonical base pairing. Shown is a Hoogsteen AU base pair.

Hoogsteen base pairs occur between adenine (A) and thymine(T), and guanine (G) and cytosine(C), similarly to Watson-Crick base pairs; however, the purine takes on an alternative conformation with respect to the pyrimidine. In the A-U Hoogsteen base pair, the adenine is rotated 180° about the glycosidic bond, resulting in an alternative hydrogen bonding scheme which has one hydrogen bond in common with the Watson-Crick base pair (adenine N6 and thymine N4), while the other, instead of occurring between adenine N1 and thymine N3 as in the Watson-Crick base pair, occurs between adenine N7 and thymine N3.[7] The A-U base pair is shown in Figure 5. In the G-C Watson-Crick base pair, similarly to the A-T Hoogsteen base pair, the purine (guanine) is rotated 180° about the glycosidic bond while the pyrimidine (cytosine) remains in place. One hydrogen bond from the Watson-Crick base pair is maintained (guanine O6 and cytosine N4) and the other occurs between guanine N7 and a protonated cytosine N3 (note that the Hoogsteen G-C base pair has two hydrogen bonds, while the Watson-Crick G-C base pair has three).[7]

 
Figure 6: Four examples of wobble base pairs.

Wobble Base Pairs edit

Wobble base pairing occur between two nucleotides that are not Watson-Crick base pairs. The 4 main examples are guanine-uracil (G-U), hypoxanthine-uracil (I-U), hypoxanthine-adenine (I-A), and hypoxanthine-cytosine (I-C). These wobble base pairs are very important in tRNA. Most organisms have less than 45 tRNA molecules but 61 tRNA molecules would be necessary to canonically pair to the codon. Wobble base pairing was proposed by Watson in 1966. Wobble base pairing allows for the 5' anticodon to non-standard base pair.

Two and Three-Dimensional Structures edit

Secondary and three-dimensional structures of of RNA are formed and stabilized through non-canonical base pairs. Base pairs make up many secondary structural blocks which aid the folding of RNA complexes and three dimensional structures. The overall folded RNA is stabilized by the tertiary and secondary structures canonically base pairing together. [9] Due to the many non-canonical base pairs there are an unlimited amount of structures which allow for the diverse functions of RNA.[8] The arrangement of the non-canonical bases allow long-range RNA interactions, recognition of proteins and other molecules, and structural stabilizing elements.[13] Many of the common non-canonical base pairs can be added to a stacked RNA stem without disturbing its helical character.[15]

Secondary Structure edit

 
Figure 7: This depicts a hairpin structure found in pre m-RNA

Basic secondary structural elements of RNA include bulges, double helices, hairpin loops and internal loops. An example of a hairpin loop of RNA is given in Figure 7. As shown in the figure hairpin loops and internal loops require a sudden change in backbone direction. Non-canonical base pairing allows for increased flexibility at junctions or turns in the secondary structure. [9]

Three Dimensional Structures edit

 
Figure 8: An xxample of a Pseudoknot three-dimensional structure.

Three-dimensional structures are formed through the long-range intra-molecular interactions between the secondary structures. This leads to the formation of pseudoknots, ribose zippers, kissing hairpin loops, or co-axial pseudocontinuous helices. [9] The three-dimensional structures of RNA are primarily determined through molecular simulations or computationally guided measurements. [13]

Experimental Methods edit

Watson-Crick canonical base pairing is not the only edge-to-edge conformation possible for the nucleotide since non-canonical pairing can take place as well. Sugar-phosphate backbone has an ionic character, which makes the bases sensitive to their environment, leading to conformational changes, such as non-canonical pairing.[16][1] There are various methods of prediction for these conformations, such as NMR structure determination and X-ray crystallography.[16] Most recently, however, a new algorithm has been created in order to analyze, reconstruct, and visualize three-dimensional nucleic acid structures, called 3DNA software.[16] 3DNA uses the data collected from the DNA sequence of interest, and using its algorithm, can detect that two bases will pair if one or more hydrogen bonds are identified between them. The program models the exact position of the hydrogen atoms in the structure, in a three-dimensional representation.[16][1] Other software has been developed, such as CURVES+, which allows researchers to submit a nucleic acid structure and alienate the individual nucleotide of interest.[17] This allows for a more detailed study of the backbone, the grooves parameters of the nucleic acid, and the base pair bonding and geometry.[17] There is also NUPARM-Plus, which is a free-access program that also allows the researcher to upload their nucleic acid structure and alienate the base pair of interest.[18] However, NUPARM-Plus focuses on indicating the planarity of the axis between the bases and the quality of the hydrogen bond as well.[18] In general, these different methods help identify and/or predeict the non-canonical base pairing that leads to different structural conformations.

Biological Applications edit

RNA has many purposes throughout the cell including many important steps in gene expression. Various conformations of the non-Watson-Crick base pairs allow for a multitude of biological functions such as mRNA splicing, siRNA, transport, protein recognition, protein binding, and translation.[19][20]

One example of a biological application of non-canonical base pairs in in the kink turn. A kink-turn is found throughout many functional RNA species. It is comprised of a three-nucleotide bilge which is due to 3 Hoogsteen base pairs. This kink-turn acts as a marker where various proteins can bind such as the human 15-5k protein or proteins in the L7Ae family.[21] A similar scenario is described in the binding of the HIV-1 Rev-response element (RRE) RNA. The RNA has an extra wide deep groove that is caused by cis Watson-Crick G:A pair followed by a trans Watson-Crick G:G. The HIV-1 Rev-response element is then able to bind due to the deepened groove.[15]


See also edit

References edit

  1. ^ a b c d Lemieux, Sébastien; Major, François (1 October 2002). "RNA canonical and non-canonical base pairing types: a recognition method and complete repertoire". Nucleic Acids Res. 30 (19): 4250–4263. doi:10.1093/nar/gkf540. PMC 140540. PMID 12364604.
  2. ^ Das, Jhuma; et al. (2006). "Non-Canonical Base Pairs and Higher Order Structures in Nucleic Acids: Crystal Structure Database Analysis". Journal of Biomolecular Structure and Dynamics. 24 (2): 149–161. doi:10.1080/07391102.2006.10507108. PMID 16928138.
  3. ^ Watson, J. D.; Crick, F. H. C. (1953-04). "Molecular Structure of Nucleic Acids: A Structure for Deoxyribose Nucleic Acid". Nature. 171 (4356): 737–738. doi:10.1038/171737a0. ISSN 1476-4687. {{cite journal}}: Check date values in: |date= (help)
  4. ^ Hoogsteen, K. (1963-09-10). "The crystal and molecular structure of a hydrogen-bonded complex between 1-methylthymine and 9-methyladenine". Acta Crystallographica. 16 (9): 907–916. doi:10.1107/S0365110X63002437. ISSN 0365-110X.
  5. ^ Courtois, Y.; Fromageot, P.; Guschlbauer, W. (1968). "Protonated Polynucleotide Structures". European Journal of Biochemistry. 6 (4): 493–501. doi:10.1111/j.1432-1033.1968.tb00472.x. ISSN 1432-1033.
  6. ^ Quigley, G. J.; Ughetto, G.; Marel, GA van der; Boom, JH van; Wang, A. H.; Rich, A. (1986-06-06). "Non-Watson-Crick G.C and A.T base pairs in a DNA-antibiotic complex". Science. 232 (4755): 1255–1258. doi:10.1126/science.3704650. ISSN 0036-8075. PMID 3704650.
  7. ^ a b c Nikolova, Evgenia N.; Zhou, Huiqing; Gottardo, Federico L.; Alvey, Heidi S.; Kimsey, Isaac J.; Al-Hashimi, Hashim M. (2013-12). "A Historical Account of Hoogsteen Base-pairs in Duplex DNA". Biopolymers. 99 (12). doi:10.1002/bip.22334. ISSN 0006-3525. PMC 3844552. PMID 23818176. {{cite journal}}: Check date values in: |date= (help)
  8. ^ a b c Leontis, Neocles B. (2001). "Geometric Nomenclature and Classifications of RNA base pairs". RNA. 7: 499–512 – via WebofKnowledge.
  9. ^ a b c d e f g Halder, Sukanya; Bhattacharyya, Dhananjay (2013-11-01). "RNA structure and dynamics: A base pairing perspective". Progress in Biophysics and Molecular Biology. 113 (2): 264–283. doi:10.1016/j.pbiomolbio.2013.07.003. ISSN 0079-6107.
  10. ^ Šponer, Judit E.; Leszczynski, Jerzy; Sychrovský, Vladimír; Šponer, Jiří (2005-10). "Sugar Edge/Sugar Edge Base Pairs in RNA: Stabilities and Structures from Quantum Chemical Calculations". The Journal of Physical Chemistry B. 109 (39): 18680–18689. doi:10.1021/jp053379q. ISSN 1520-6106. {{cite journal}}: Check date values in: |date= (help)
  11. ^ Sharma, Purshotam; Šponer, Judit E.; Šponer, Jiří; Sharma, Sitansh; Bhattacharyya, Dhananjay; Mitra, Abhijit (2010-03-11). "On the Role of the cis Hoogsteen:Sugar-Edge Family of Base Pairs in Platforms and Triplets—Quantum Chemical Insights into RNA Structural Biology". The Journal of Physical Chemistry B. 114 (9): 3307–3320. doi:10.1021/jp910226e. ISSN 1520-6106.
  12. ^ Heus, Hans A.; Hilbers, Cornelis W. (2003-10). "Structures of Non-canonical Tandem Base Pairs in RNA Helices: Review". Nucleosides, Nucleotides and Nucleic Acids. 22 (5–8): 559–571. doi:10.1081/NCN-120021955. ISSN 1525-7770. {{cite journal}}: Check date values in: |date= (help)
  13. ^ a b c d Olson, Wilma K.; Li, Shuxiang; Kaukonen, Thomas; Colasanti, Andrew V.; Xin, Yurong; Lu, Xiang-Jun (2019-05-08). "Effects of Noncanonical Base Pairing on RNA Folding: Structural Context and Spatial Arrangements of G·A Pairs". Biochemistry: acs.biochem.9b00122. doi:10.1021/acs.biochem.9b00122. ISSN 0006-2960. PMC 6729125. PMID 31008589.{{cite journal}}: CS1 maint: PMC format (link)
  14. ^ Roy, Ashim; Panigrahi, Swati; Bhattacharyya, Malyasri; Bhattacharyya, Dhananjay (2008-3). "Structure, Stability, and Dynamics of Canonical and Noncanonical Base Pairs: Quantum Chemical Studies". The Journal of Physical Chemistry B. 112 (12): 3786–3796. doi:10.1021/jp076921e. ISSN 1520-6106. {{cite journal}}: Check date values in: |date= (help)
  15. ^ a b Hermann, Thomas; Westhof, Eric (1999-12). "Non-Watson-Crick base pairs in RNA-protein recognition". Chemistry & Biology. 6 (12): R335–R343. doi:10.1016/S1074-5521(00)80003-4. {{cite journal}}: Check date values in: |date= (help)
  16. ^ a b c d Lu, Xiang-Jun; Olson, Wilma K. (2003-09-01). "3DNA: a software package for the analysis, rebuilding and visualization of three‐dimensional nucleic acid structures". Nucleic Acids Research. 31 (17): 5108–5121. doi:10.1093/nar/gkg680. ISSN 0305-1048.
  17. ^ a b Blanchet, Christophe; Pasi, Marco; Zakrzewska, Krystyna; Lavery, Richard (2011-07-01). "CURVES+ web server for analyzing and visualizing the helical, backbone and groove parameters of nucleic acid structures". Nucleic Acids Research. 39 (suppl_2): W68–W73. doi:10.1093/nar/gkr316. ISSN 0305-1048. PMC 3125750. PMID 21558323.{{cite journal}}: CS1 maint: PMC format (link)
  18. ^ a b "NUPARM-Plus-A Program for analyzing sequence dependent variations in nucleic acids". nucleix.mbu.iisc.ernet.in. Retrieved 2019-10-08.
  19. ^ Fernandes, Cláudia L.; Escouto, Gabriela B.; Verli, Hugo (2013-06-28). "Structural glycobiology of heparinase II fromPedobacter heparinus". Journal of Biomolecular Structure and Dynamics. 32 (7): 1092–1102. doi:10.1080/07391102.2013.809604. ISSN 0739-1102.
  20. ^ Storz, Gisela; Altuvia, Shoshy; Wassarman, Karen M. (2005-06-01). "An abundance of rna regulators". Annual Review of Biochemistry. 74 (1): 199–217. doi:10.1146/annurev.biochem.74.082803.133136. ISSN 0066-4154.
  21. ^ Huang, Lin; Lilley, David M. J. (2018). "The kink-turn in the structural biology of RNA". Quarterly Reviews of Biophysics. 51: e5. doi:10.1017/S0033583518000033. ISSN 0033-5835.