The central role of RNA in cellular function has been well established through numerous studies. Not only does RNA translate the genetic code decoding it into protein, but it also has various catalytic and regulatory functions yet to be completely elucidated. RNA versatility and the complex roles which it plays in cellular life and function are possible through its extensive variation in length from microRNAs and transfer RNAs (tRNAs) of less than 100 nucleosides in length, to long non-coding RNAs (lncRNAs), messenger RNAs (mRNAs) and ribosomal RNAs (rRNAs) of 1000s of nucleosides in length. Its six backbone torsion angles and glycosidic bond between base and ribose provide conformational dynamics not afforded by the peptide bond of proteins and more flexibility than the DNA backbone. RNA has over 100 different post-transcriptional modifications. The post-transcriptional modifications are enzymatically inserted at site-specific locations, for instance the methylation of uridine to 5-methyluridine or ribothymidine is almost invariant at position 54 in all tRNAs. Some of the modifications are quite common, such as 2’-O-methylation and pseudouridylation. Others are found uniquely in one RNA species such as the complex tricyclic modification of G to wyosine (9H-imidazo[1,2-a]purin-9-one,3,4-dihydro-4,6-dimethyl-3-b-d-ribofuranosyl-) found at position 37 in tRNAPhe of eucaryotes. The chemical, dynamic and structural properties of the modified nucleosides far surpass those of the individual characteristics provided by the four major nucleosides: the purine (Pu) ribonucleosides, adenosine (A) and guanosine (G), and the pyrimidine (Py) ribonucleosides, cytidine (C) and uridine (U). The chemical diversity of modifications is similar to that of amino acid side chains and can be as simple as methylations or single atom substitutions of sulfur for oxygen to as complex as the addition of amino acids and formation of a third ring to the purine nucleobase. Even modifications as simple as methyl groups can exhibit distinguishable dynamic motions that depend on their location within the RNA structure, for instance the methyl groups of yeast tRNAPhe. The prodigious chemical variation provided by the numerous modifications hitherto discovered allow RNA to perform numerous cellular functions.
Although the existence of modified nucleosides has been known for over six decades, much of how modified nucleosides alter the structure, function, and properties of RNAs is still unknown. More than a hundred modified nucleosides can be found in all domains of life in mRNA, rRNA, and other RNAs such as small nucleolar RNA (snoRNA) and small nuclear RNA (snRNA). Still, the greatest number and variety of modifications can be found in tRNAs (Figure 1).
Modifications of the nucleobases and of the 2’-OH of the ribose are crucial for various biological processes (Figure 2). While the biologically important role of RNA modifications has been extensively studied, the physicochemical nature of the RNA modifications is also of interest because these properties enable or facilitate RNA function. Reviews specifying the chemical moieties found at various positions of tRNA can be found elsewhere. Here, we will focus on the impact of specific modified nucleoside chemistries which add to the chemistry of tRNAs and alter their structure and stability.
In general, modified nucleosides impact the overall structure and local chemistry of tRNAs, the ability of macromolecules to recognize tRNAs, or the decoding ability of tRNAs. Specific modifications commonly found at the junction of the stems of the cloverleaf structure (Figure 1), referred to as the core, elbow or hinge in the three-dimensional structure of most tRNAs, are often associated with functional folding, allowing tRNAs to fold into the common L-shape form critical to ribosomal protein synthesis. Similarly, common features found in several tRNAs are often used by proteins and other macromolecules as recognition determinants to interact with specific tRNA species, or tRNAs in general. The properties and modified nucleosides of the tRNA core are often used as chemical and structural determinants, such as in the case of tRNA-dependent gene regulation in Gram-positive bacteria, the T-box mechanism.
Certain modifications are only found at very specific locations, whereas others can be found in a number of locations common to many tRNA species (Figure 1). For example, the purine at position 37 of a number of tRNAs is modified with a bulky moiety not found at other positions. Similarly, tRNAs with a uridine at wobble position-34 read A-ending codons. The wobble position uridines are often modified with a methylene carbon directly bonded to the C5 atom of the uracil nucleobase (xm5U) with a variety of further derivations. The xm5U modifications are indicative of a specificity required of tRNA for the accurate decoding of mRNA. In contrast, pseudouridine (Ψ) is found at multiple positions such as 38, 39, and 55 among others. The overall structure of tRNA is folded correctly in part due to certain modifications, particularly methylations, found at the core. The three-dimensional L-shape is maintained with the help of modified nucleosides that drive the structure into forming strong, favorable interactions between the D- and T-arms. Furthermore, the local chemical environment can be either stabilized or destabilized by a single modified nucleoside to allow for greater rigidity or flexibility, respectively, through stacking and hydrogen bonding capabilities. Complex modifications at the wobble position and position 37 have both structural as well as functional roles imperative to decoding. Here, we discuss and cite examples of how the chemical and dynamic nature of the modified nucleosides affects their ability to elicit functionally important properties of tRNA, cloverleaf secondary structure, L-shaped three-dimensional structure, anticodon conformation and dynamics.
2.1.1. Position 9 and 10 (A9, G9 and G10)
Nucleoside positions 9 and 10 of the tRNA cloverleaf two-dimensional structure are important to the RNA’s functional folding. Especially evident are their contributions to the folding of mitochondrial (mt) tRNAs with truncated DSLs and/or TSLs. The methylations resulting in N1-methylguanosine, m1G9, and N1-methyladenosine, m1A9, have been implicated in the correct folding of mitochondrial tRNA. Lack of the m1G9 and m1A9 modifications through mutation of one of the cytoplasmic orthologs of the methyltransferase (TRMT10A) has been associated with microcephaly, intellectual disability, and short stature. Modification of A9 to m1A9 in the human mitochondrial tRNALys (mt tRNALys) disrupts formation of an intra-stem Watson–Crick base pair. The methylation to m1A9 is sufficient to shift the equilibrium from a stable alternative, non-canonical extended hairpin structure to the more favored, and functional cloverleaf secondary structure. The extended hairpin structure although predominating in a dynamic equilibrium with the cloverleaf structure, was not a substrate for aminoacylation by lysyl-tRNA synthetase, nor functional on the ribosome. The m1A9 was thought to be an early maturation event occurring after maturation of the 5’ terminus of mt tRNALys by RNAse P. The presence of the methyl group of m1A9 in mt tRNALys stabilizes the cloverleaf structure by 3–4 kJ/mol and is dependent on the Mg2+ concentration. Such structural plasticity is a common characteristic in animal mitochondrial tRNAs including mt tRNALeuUUU and mt tRNAAsp where the cloverleaf structure is attained often after certain bases have been modified, and with Mg2+ coordination.
By example, the mitochondrial tRNAs of the nematode Ascaris suum lack either the TSL or the DSL found in the tRNAs of most organisms and are characterized by the presence of m1A9. tRNAs without this modification had reduced aminoacylation and lower EF-Tu (elongation factor thermos unstable) binding activities compared to native tRNAs. Aminoacylation and tertiary structure were nearly restored to normal by the presence of a single methyl group, indicating the importance m1A9 in the absence of the TSL in mitochondrial tRNAs in nematodes, and other organisms. Comparable to m1A9 in mt tRNA, N1-methylguanosine (m1G) disrupts canonical base pairing in mt and cytoplasmic tRNAs by virtue of the methyl-group blocking the Watson–Crick face of the nucleoside, thus disturbing secondary structures formation.
2.1.2. G10: 2-Methylguanosines
During tRNA synthesis, processing and modification, it is important that the secondary and tertiary base-pairings which yield the functional folding of tRNA occur and be maintained afterwards. For functional folding to be achieved, the four major stems terminate at a junction within the cloverleaf secondary structure leaving an open core (Figure 1 and Figure 3a). Convergence of the stems produce an internal loop of the molecule. Methylations of nucleosides at the junctions of the accepting stem and DSL, and the DSL and ASL are excellent examples of stem interruptions leading to the internal loop which is the core of tRNAs. The stem interruptions by 2-methylated guanosines at position 10 at the junction of the acceptor stem and the stem of the DSL, and at position 26 between the stem of the DSL and that of the ASL facilitate the secondary and tertiary folding of tRNAs (Figure 1 and Figure 3). This family of structurally related nucleosides, N2-methylguanosine (m2G), N2,N2-dimethylguanosine (m22G), and N2,N2,2′-O-trimethylguanosine (m22Gm), are conserved at positions 10 and 26 and control the L-fold in the tertiary tRNA structure in all three domains of life. The methyl groups, located on the Watson–Crick face of the nucleobase, negate canonical base pairing. The duplexes terminate at the modifications in bacteria, eukaryotes, and archaea tRNAs. For example, in the Stetteria hydrogenophila archaeal tRNAs, the modifications at the terminations of the duplexes play a crucial role in stabilizing tRNA conformation of the archaeal thermophiles—organisms thriving at inhospitably high temperatures. However, experimental substitution of the methylated guanosines has demonstrated that the N2-methylguanosines play an additional role to that of terminating tRNA stems at the junction of the cloverleaf structure.
Studies using base substitutions have shown the importance of interactions of certain bases with proteins. Since an inosine–cytosine (I●C) base pair resembles a G●C base pair structurally, substitutions of G with inosine have been used to assess the role of guanosines in maintaining the helical structure. (The filled circle, ●, designates a canonical or similar base pair.) Early studies using inosine-substituted tRNA by recombinant RNA technology, identified the 2-amino group in guanosines at positions 2, 3 and 10 as important structural determinants of tRNA for protein recognition. Inosine substitutions at positions 2, 3 and 10 decreased in vitro aminoacylation of E. coli tRNAGln by glutaminyl-tRNA synthetase (GlnRS), with a 300-fold decrease in the specificity constant (kcat/Km). These studies reveal G10 as a key structural identity element for GlnRS recognition and demonstrated the importance of hydrogen bonding between the 2-amino groups of the guanosine with the synthetase amino acids. The highly conserved G10 in E. coli tRNAs and the G10 interaction in the tRNAAsp–AspRS complex further suggest the importance of the methyl group and its effect on structure and dynamics as features recognized by the cognate aminoacyl-tRNA synthetases.
2.1.3. Position 26 (G26)
Without going into detail, folding of the tRNA cloverleaf secondary structure into the L-shaped tertiary structure is a general architectural feature of tRNA necessary for aminoacylation and translation on the ribosome. These general structural features of tRNAs have also been shown to be a necessary global recognition element for modifying enzymes, such as the methyltransferases producing m22G26 in yeast tRNAAsp and tRNAPhe. Although m22G26 in between the D-stem and the ASL is present in more than 80% eukaryotic tRNA containing a guanosine at position 26, many eukaryotic tRNAs have unmodified G26, e.g., yeast tRNAAsp. Studies using an in vitro modified dimethylguanosine in Xenopus laevis oocytes have identified the two consensus base pairs preceding G26 in the D-stem as a prerequisite for the N2,N2-dimethyl modification of G26, forming N2,N2-dimethylguanosine. Furthermore, the global structure of the tRNA was important for protein recognition of nuclear encoded tRNA including the tRNA m22G26 methyltransferase.
A decrease in tRNA synthesis affected by rapamycin correlates with an increase in m22G26 modification; the correlation is conserved from yeast to humans. As with many methyltransferases, the enzymatic synthesis of m22G26 is sometimes outperformed by the rate of transcription. The m22G26 modification is important for tRNA to translate the genetic code. The m22G26 and m2G, unlike other methylations in eukaryotes and archaeal organisms catalyzed by the S. cerevisiae N2,N2-dimethylguanosine methyltransferase Trm1, are thought to prevent Watson–Crick base pairing and to stabilize the three-dimensional core (reviewed in).
2.1.4. Position 48 and 49 (m5C48 and m5C49)
The 5-methylcytosine (m5C) modification, although present in both tRNAs and ribosomal RNA in eukaryotes and archaea, is absent in E. coli tRNAs but is found in the ribosomal RNA of E. coli. Plant mitochondria and chloroplast tRNAs are devoid of m5C, suggesting an independent evolution of organelle methylation in animals and plants. The most frequently occurring methylcytidines are found at positions 48 and 49 at the junction of the VL and TSL. Unlike the m1A9, m1G9, m2G10 and m22G26 methylations at positions 9, 10, and 26, respectively, on the 5’-side of tRNAs core at the junction of the accepting stem and D-stem, and the DSL and ASL stems, the methyl group of m5C48 and m5C49 located at the 3’-side of the tRNA’s core are not on the Watson–Crick H-bonding face of the nucleobase, and therefore do not physically interfere with canonical base pairing. RNA modification to m5C is important in regulating RNA metabolism and promotes tRNA stability and protein synthesis in eukaryotes and prokaryotes. The m5C modification is catalyzed by RNA-specific m5C-methyltransferases using the methyl donor, S-adenosyl-l-methionine (Ado-Met). In yeast Saccharomyces cerevisiae, a newly identified multisite-specific yeast tRNA methyltransferase (Trm4) was found to catalyze m5C modification at four different positions in tRNA. NSUN2 (cytosine-5 RNA methyltransferase) catalyzes the methylation of cytosines at C34, C48, C49 and C50. Although m5C is present in at least 34 different yeast tRNA species, in tRNALeuCAA it is uniquely found at the wobble position as well as at position 48 in the VL. Upon cellular oxidative stress, however, the distribution of m5C in the tRNA is dynamically changed, with a significant increase at the wobble position and a concomitant decrease at position 48, indicating a specific response to the stress.
In protozoan parasites, most tRNAs display at least one m5C at the junction between the VL and the TSL. The m5C modification leads to stabilization of the tRNA core in eukaryotes and archaeal tRNAs, but not in eubacteria. In Trypanosoma brucei, methylation of tRNAAspGUC, tRNAGlyGCC, tRNAValAAC and tRNATyrGUA was observed in residues at the TSL/VL junction including C48 and C49. In tRNATyrGUA both un-spliced and spliced molecules contained m5C48, indicating cytosine methylation can precede tRNA splicing, as had been reported for core modifications of bacterial precursor tRNAs.
In all tRNAs, nucleoside 48 in the VL forms a non-canonical base pair with the nucleoside 15 in the D-loop known as the ‘Levitt pair’ (Figure 3a,b). Instead of a canonical Watson–Crick base pairing, nucleosides 15 and 48 assume a reverse Watson–Crick (RWC) geometry or a trans arrangement. The RWC trans configuration brings together the DSL and VL, joining the two helical domains of tRNA (Figure 3a). In some tRNAs, one of the nucleosides in the Levitt pair, G15○C48, is modified. (The open circle, ○, denotes a non-canonical base pair.) The G15○m5C48 Levitt pair occurs in 26% of tRNA sequences found in databases. However, the physicochemical contribution of the methyl group has not been understood. The methyl of m5C48 increases the hydrophobic character of the nucleoside pair and may contribute to base stacking. In archaeal tRNAs, G15 is found to be modified to 7-formamidino-7-deazaguanosine (fa7d7G or ADG), known as archaeosine, G+ (Figure 2c). The imidino side chain of archaeosine has a distributed positive charge among the resonance of its conjugated double bonds. Positive charges at the junction of the amino acid accepting stem and the DSL are important to the transcription and subsequent proper tertiary folding of tRNAs. Both metal (Mg2+ ion) binding and chemical modifications have been suggested to be factors stabilizing the RWC geometry of the Levitt base pair. The Levitt pair interaction forms the innermost base-pair in the augmented D-stem and is usually between a purine nucleoside at position 15 and a pyrimidine at position 48 which form a non-Watson–Crick hydrogen bonding due to the trans orientation of the glycosidic bonds. However, a small number of species including E. coli tRNACys contain the unusual G15○G48 pairing (Figure 3c). Crystal studies have shown the G15○G48 pairing to have the same trans orientation as the conventional Pu15○Py48 Levitt pair.
The E. coli tRNACys core containing the unusual G15○G48 base pair joins the D- and V-loops and is very sensitive to alterations. This unusual Levitt pair may be important for aminoacylation by cysteinyl-tRNA synthetase, CysRS. Substitution to the more conventional G15○C48 pairing in tRNACys destabilized the tRNA’s core and decreased the catalytic efficiency of aminoacylation (kcat/Km) by nearly 100-fold. In contrast, the same substitution in E. coli tRNAAla had minimal effect on aminoacylation. The sensitivity of tRNACys to the Levitt pair substitution may be due to the unusual H-bonding between the O6 of G15 and N1 and N2 of G48 (Figure 3 b and c). The interaction of G15 O6 with the G48 N1 and N2 is different from the G15○C48 base pairing of other tRNAs. In other tRNAs the N1 and N2 of G15 are paired with O2 and N3 of G48, respectively. The triple base stacking of A46○(U8●A14) below the Levitt pair appears to be unique to this tRNA and along with several layers of base pairs above and below G15○G48, provides additional stacking interactions to stabilize G15○G48 in tRNACys. In addition, tRNACys contains the A9○(A13●A22) base triple. The 9○(13●22) pairing is rare and only found in Thermus thermophilus tRNASer as a G9○(G13●A22) base triple. Variations in the VL, alone and not the loops of the DSL or TSL rescued the kinetics of aminoacylation in the G15○C48 mutant of tRNACys.
2.2. Methylations of the Ribose 2’-OH and Nucleobase
Methylated nucleosides contribute to the stability of tRNA. Methylation can either occur at the 2’-hydroxyl of the ribose or on the nucleobase itself. The 2’-O-methylation (Figure 2e) induces the C3’-endo ribose conformation for both purine and pyrimidine nucleosides, stabilizing A-type helical regions within tRNA. In observations of 2’-O-methyluridine 3’-monophosphate (Ump), the C3’-endo conformation was found to be 0.67 kcal/mol more stable than the C2’-endo conformation; conversely, the C2’-endo conformation was slightly favored in the uridine 3’-monophosphate (Up) (Figure 2). Again, steric interactions are responsible for preference of Ump for the C3’-endo conformation. Repulsion between the 3’-phosphate and the 2’-O-methyl group forces the 2’-O-methyl group to orient itself toward the uracil base, in turn increasing steric repulsion between the 2’-O-methyl group and the 2-carbonyl group. Similar to the effect of the 2-thio modification on tRNA thermal stability, the 2’-O-methyl modification is favorable to thermophilic organisms whose tRNA structure must withstand high temperatures. Alternatively, the 2’-O-methyl modification has been proposed to protect tRNA from degradation at high temperatures; the tRNA would otherwise be increasingly susceptible to ribonuclease attack.
The incorporation of the 2’-O-methyl modification into thermophilic tRNA in vivo is temperature dependent. Bacillus stearothermophilus cultured at 70 °C experienced a three-fold increase in 2’-O-methylribose moieties in tRNA compared to the cultures grown at 50 °C. Similar results were obtained for Thermus thermophilus HB27 cultured at 80 °C and 50 °C in which the 2’-O-methylguanosine (Gm) content of tRNA in particular was compared. However, T. thermophilus tRNA (guanosine-2’) methyltransferase activity rather than tRNA content of Gm was affected by the temperature difference. Thus, the thermophilic enzyme may have evolved to be activated by high temperatures. Otherwise, at lower temperatures, T. thermophilus tRNA is sufficiently structured without an abundance of the 2’-O-methyl modification and methyltransferase activity is redundant.
The presence of methylated nucleobases, including N1-methyladenosine (m1A) and N7-methylguanosine (m7G) within tRNA proves to be beneficial to T. thermophilus as well. T. thermophilus HB27 lacking tRNA (m1A58) methyltransferase activity displayed a thermosensitive phenotype at 80 °C. T. thermophilus HB8 strain subjected to disruption of the tRNA (m7G46) methyltransferase (TrmB) gene also experienced a growth defect at temperatures above 70 °C. Moreover, tRNA from the ΔtrmB mutant strain was determined to increase the rate at which Gm18, m1G37, and m1A58 are formed by their respective modification enzymes. By promoting the formation of other stabilizing modifications, m7G46 is implicated in an intricate network of tRNA modifications.
The aforementioned methylated nucleobases are not exclusive to prokaryotic, thermophilic tRNA. The methyltransferase encoded by the WDR4 gene, for example, is responsible for the m7G46 modification within human tRNA. Mutation of arginine-170 to leucine on the WDR4 protein results in decreased levels of m7G incorporated into tRNA, and is correlated with microcephalic primordial dwarfism. The mechanism by which the diseased state of microcephalic dwarfism appears remains under investigation. The m1A58 modification is also common to tRNA from all domains of life. Within S. cerevisiae Δgcd10 mutants, the m1A tRNA modification was undetectable and the amount of mature initiator methionyl-tRNA (tRNAiMet) was reduced. However, neither syntheses of pre-tRNAiMet nor elongator methionine tRNA (tRNAeMet), which also contains the m1A58 modification, were affected. Thus, the m1A58 modification has been proposed to play a role in the maturation of pre-initiator tRNAiMet, preserving its unique tertiary structure in comparison to the structure of tRNAeMet. The introduction of a positive charge on A58, such that the A58○U54 reverse Hoogsteen base pair remains undisturbed, is thought to be the stabilizing force. In fact, both the positively charged m7G46 and m1A58 stabilize their respective base triplet and pairing interactions. Positively charged m7G46 and m1A58 are conserved across a substantial number of cytosolic tRNAs. In the gas phase, C13●G22○m7G46 is stabilized by 17.4 kcal/mol while T54○m1A58 and U54○m1A58 are stabilized by 6.8 kcal/mol and 6.2 kcal/mol, respectively, as compared to their unmodified analogs.
The reduction of the double bond linking C5 and C6 of uridine yields the dihydrouridine modification (Figure 2b). The dihydrouridine loop (D-loop) is a conserved region found in tRNAs in virtually all living systems. The presence of D within the D-loop plays an instrumental role in giving tRNA both its secondary and tertiary structure. The crystal structure of the nucleoside D was elucidated during the early 1970s. Within the crystalline network, D does not participate in base stacking due to its loss of aromaticity. The ribose of D was determined to favor the C2’-endo conformation, whereas that of the parent U favors neither the 2’- or 3’-endo conformation. The change in energy between the C3’-endo and C2’-endo conformations of pyrimidines is nominally only 0.5 kcal/mol. The small difference in energy between the C3’-endo and C2’-endo conformations allows for flexibility of the ribose but does not appear to be the case for modified nucleosides. In fact, the saturation of the C5=C6 double bond of the nucleobase gives rise to the C2’-endo conformation of the ribose; whereas methylation of the 2’-OH stabilizes the C3’-endo pucker. The interaction between a lone pair of O4’ and the pi antibonding orbital of the C5=C6 double bond of unmodified uridine is absent in D. The resulting increase in dihedral angle χ subsequently allows the ribose to assume the C2’-endo conformation. However, in solution, D was observed to induce a hairpin loop in the DSL reportedly due to its ability to engage in base stacking on only one side. Furthermore, a DSL lacking D at position 16 was unable to achieve a stable hairpin, instead interconverting between various conformations.
In a 2C9N2O6H14⦁H2O unit cell of a D crystal, one D molecule adopts a gauche–trans conformation around its C4’–C5’ bond, while the other preferentially adopts a trans–gauche conformation over a gauche–gauche conformation. Conversely, unmodified uridine strongly favors the gauche–gauche conformation over the trans–gauche and gauche–trans conformations. Although the D residue is asymmetric, both the gauche–trans and the trans–gauche conformations allow for the formation of loop structures within tRNA, while the gauche–gauche conformation of uridine is typically found within double helical structures. D has the potential to destabilize double-stranded, helical RNA due to its ribose conformation. When substituted into 21-nucleoside RNA duplexes, D disrupted the A-type helical structure and decreased the melting temperature of the duplexes.
In yeast tRNAPhe, among its 76 nucleosides there are twelve that assume the C2’-endo conformation. The 12 nucleosides with C2’-endo conformations occupy single-stranded loops and stretches. The single stranded regions of tRNA, though not involved in secondary structure motifs, are involved in tertiary interactions. The majority of nucleosides within both yeast tRNAPhe and tRNAAsp that adopt the C2’-endo conformation (seven out of twelve and seven out of ten, respectively) are located within or just outside of the D-loop. The minority of nucleosides adopting the C2’-endo conformation and located other than in the D-loop in tRNAAsp are afforded a greater degree of flexibility, resulting in a region in which base intercalation occurs. Evidently, the inclusion of D within the primary sequence of tRNA directly affects the arrangement of characteristic secondary and tertiary interactions.
The prevalence of D within tRNA is especially advantageous for psychrophiles, organisms that optimally grow at temperatures lower than 15 °C and cannot grow at temperatures above 20 °C. At these low temperatures, psychrophiles require modifications, such as dihydrouridine, that enhance tRNA flexibility. The tRNA digests of three psychrophilic bacteria were analyzed by liquid chromatography-mass spectrometry (LC-MS): ANT-300 (an unnamed species of Moritella) and Vibrio sp. strains 5710 and 29-6. The tRNA digests of the three psychrophilic bacteria contained between 1.33 and 1.46 times the D content of mesophilic E.
coli tRNA digests. Interestingly, the Tm of the psychrophilic tRNA (~77 °C) did not differ significantly from the Tm of E. coli tRNA (75 °C), suggesting that D contributes to local rather than global flexibility. Compared with psychrophiles, psychrotolerant organisms undergo optimal growth at temperatures above 15 °C; however, these organisms are viable at lower temperatures as well. The first study of tRNA modifications in psychrotolerant Archaea centered on Methanococcoides burtonii. Notably, the presence of D was confirmed within M. burtonii tRNA, while D was not detected within the archaeal hyperthermophile Stetteria hydrogenophila.
Conversely, thermophiles must contain modifications that confer stability onto their tRNA, effectively the opposite to the destabilizing function of D. Thermophilic tRNA containing 2-thioribothymidine at position 54 (s2T54) is resistant to denaturation (Figure 2b). The C=S bond of s2T is longer than the C=O bond of unmodified T, and the van der Waals radius of sulfur is larger than that of oxygen. The increased steric repulsion between the thiocarbonyl of the nucleobase and the 2’-hydroxyl of the ribose promotes the C3’-endo conformation. Additionally, the conformation around the C4’–C5’ bond is almost exclusively gauche–gauche. As previously stated, the gauche–gauche conformation is characteristic of helical regions of RNA. Two species of tRNAIle1 have been identified in Thermus thermophilus HB8 grown at 65 °C. The species, tRNAIle1a and tRNAIle1b, have identical sequences with the exception of the nucleoside at position 54: tRNAIle1a contains the modified nucleoside s2T54 while tRNAIle1b contains the unmodified T54. The Tm of tRNAIle1a (86.2 °C) was 2.9 °C higher than that of tRNAIle1b (83.3°C). Here is an instance in which a single modification has a profound impact on the thermostability of the entire tRNA molecule.
Pseudouridine (Figure 2b) is the most frequently occurring modified nucleoside in RNA. It is found in all three domains of life as well as in the organelles. Furthermore, Ψ is found in various RNAs including tRNA, rRNA, mRNA, and some small RNAs such as snoRNA. Existing uridines are modified into Ψ by pseudouridine synthases via a glycal intermediate. Ψ is an isomer of uridine, where instead of N1 of the base being linked to the sugar, the C5 of the base is connected to the sugar. Pseudouridine synthetase refashions U by cleaving the N1–C1’ glycosidic bond and produces a C5–C1’ bond without releasing the uracil nucleobase in forming Ψ residues in tRNA transcripts. The modified glycosidic linkage resulting in C5–C1’ bond slightly alters the chemical properties of the nucleoside that can cause drastic differences in structure and function. In tRNA, Ψ is found at various positions and three-dimensional environments, depending on both the isoacceptor as well as the organism (or organelle) itself. Nuckear magnetic resonance (NMR) studies of Ψ compared to U have shown that in the isomer the imino N1-H proton is in slow exchange with water, a property commonly associated with specific structural environments. The slow exchange of the imino N1 with water is an intrinsic property of pseudouridine rather than an effect caused by base pairing since the slow exchange of the imino proton has been shown to occur in a variety of contexts. While Ψ has been found at many positions of the anticodon stem and loop domain in a functional role, its occurrence in tRNA is most commonly found in a more structural role, especially in the TSL and certain positions in the ASL. Ψ is historically and most commonly associated with position 55 of the TSL in the sequence TΨCG.
Perhaps the most striking effect of Ψ appears to be its ability to enhance RNA stabilization through sugar puckering, where the RNA helix is stabilized by favoring the C3’-endo pucker. The altered puckering (Figure 2f), has been linked with enhanced stacking ability, explaining the increased stability pseudouridine modifications provide. In recent computational studies, Ψ did not strongly favor either C2’-endo or C3’-endo puckering, but a clear energy minimum was observed for the anti-conformation of the C5–C1’ between the base and ribose. The same effect has been observed in NMR studies of Ψ alone, whereas in an oligonucleoside the C3’-endo puckering is apparent. The energy barriers between C2’-endo or C3’-endo puckering and between the syn and anti-glycosidic bond angles for uridine and Ψ must be low for all combinations have been observed and may depend on the experimental conditions, sequence context and the biophysical approach taken. In particular, the context in which these nucleosides appear in an RNA sequence and its secondary and tertiary structures are most likely to influence the ribose backbone and the glycosidic bond conformations taken at any particular site. Computational as well as NMR studies indicate that Ψ induces C3’-endo puckering in its neighbors, therefore enhancing the rigidity of the local structure, and hence stabilizing the A-form RNA. The inherent ability of Ψ to stabilize RNA structures through enhanced C3’-endo sugar puckering and enhanced base stacking compared to uridine has been extensively demonstrated experimentally through circular dichroism (CD), NMR, and UV-melting studies.
Studies on pseudouridine synthetase (Pus1) from S. cerevisiae indicate the importance of tertiary base pairing in Pus1 tRNA recognition. The presence of a DSL and the non-canonical G26○A44 mismatch pair near the target uridine 27 is important for Pus1 tRNA high affinity recognition, resulting in an increase in association rate constant (Ka) by a factor of 100 and commensurate decrease in the equilibrium constant for the reaction. In addition to enhancing the local stability of its neighboring nucleosides, Ψ contributes to the stability of tRNA’s three-dimensional structure. The D-loop and T-loop interact in the tertiary structure through several base pairs, including G15●C48 (Figure 3) and G19●C56 and G18○Ψ55 (not shown in Figure 3). The m1A58○T54 base pair is sandwiched between the two aforementioned base pairs where the modified nucleosides are crucial for enhanced structural stability. The stability is further increased if T54 is replaced by s2T54.
3.1. Modifications at Position 34
The most numerous, and most chemically varied, of the post-transcriptional modifications to tRNA occur in the ASL domain, particularly at positions 34, the first position of the anticodon and position 37, 3’-adjacent to the anticodon. Modifications of positions 34 and 37 have been implicated in a wide range of functional roles, including alteration of tRNA conformation and thermal stability, enhancement of ribosome binding, promotion of mRNA translocation during translation, augmentations of mRNA decoding specificity, maintenance of the mRNA reading frame and the pre-structuring of the ASL into the canonical U-turn structural motif for ribosomal A-site entry.
Position 34 in the tRNA ASL is sometimes referred to as the “wobble” position in reference to Francis Crick’s 1966 Wobble Hypothesis. The Hypothesis states that a guanosine, uridine or inosine occupying position 34 has the ability to base pair to two or three different nucleosides. This ability of nucleosides at position 34 of the anticodon to bind to non-canonical bases or “wobbling” addresses the degeneracy of the genetic code. The concomitant existence of fewer tRNAs than possible mRNA codons, means a single tRNA species decodes more than one synonymous codon differing in the identity of the residue at the third position. Twenty-five years later, the Modified Wobble Hypothesis expanded Crick’s original postulate. The Modified Wobble Hypothesis took into account evidence of codon recognition being restricted or expanded by the presence of a chemical modification to the wobble nucleoside, allowing the anticodon–codon interaction to be fine-tuned.
The presence or absence of modifications to position 34 uridines displays a particularly clear effect on wobble decoding. In certain cases, an unmodified position 34 uridine permits even broader codon recognition than the Wobble Hypothesis predicts; many mitochondrial and plastid tRNAs with a wobble uridine can decode all four codons in a single codon box. In general, however, cytosolic tRNAs across species require modification to a wobble position uridine in order to recognize any nucleoside in the third position of the mRNA codon (other than its exact cognate A). Eighty-four percent of the sequenced cytosolic tRNAs carry modifications at position 34. Modifications at position 34 have been shown to restrict or expand the decoding ability of the tRNA.
Unexpectedly, most post-transcriptional modifications to wobble nucleosides occur at position 5 of a pyrimidine, on the opposite site of the nucleobase from the Watson–Crick face. Modification to the Watson–Crick face is predicted by the original Wobble Hypothesis to facilitate wobble pairings. Modifications other than those to the Watson–Crick face must, therefore, affect the chemistry at the anticodon–codon interface indirectly, by influencing the overall conformation and structure of the nucleoside. One structural effect of 5-position modifications is the biasing of the ribose sugar moiety of the nucleoside toward either a C3’- or C2’-endo conformation; the C3’- or C2’-endo sugar pucker have been shown to restrict or expand, respectively, the overall decoding capacity of the tRNA.
The restriction of decoding is frequently coupled to substitution of the 2-position keto group with a thio group along the Watson–Crick face. Modification of the Watson–Crick face with a thio group may affect the hydrogen bonding chemistry of uridine as a proton donor in addition to stabilizing the C3′-endo conformer. For example, modifications in the 2-thio-5-methyluridine (xm5s2U) family at position 34 bias the nucleoside toward its C3′-endo form due to the large van der Waals radius of the sulfur atom. The larger van der Waals radius of the sulfur atom sterically repels the ribose 2′-oxygen atom. The 2-thiolated modified uridines typically recognize only adenosine as the third letter of the codon. By contrast, modified nucleosides in the 5-hydroxyuridine (xo5U) family can adopt the C2′-endo as well as the C3′-endo form, and they expand wobble recognition to include adenosine, guanosine, and uridine.
Structural studies also indicate that the larger modifications to position 34 uridines also operate to directly influence the stability of base pairs at the anticodon–codon interface. The 5-taurinomethyluridine (τ5mU34) modification is a member of the xm5U family of modifications. The τ5mU34 modification is found in several mitochondrial tRNAs (Figure 2b) and its absence from tRNALeu(UUR) and tRNALys is implicated in MELAS (mitochondrial encephalopathy, lactic acidosis and stroke-like episodes) and MERRF (myoclonic epilepsy with ragged red fibers) syndromes. Together with its orthologous E. coli modification 5-carboxymethylaminomethyluridine (cmnm5U34), the τ5mU34 modification has been shown to play a role in codon decoding by stabilizing U34●G3 wobble base pairs. Both crystallographic and computational studies of the τ5mU34-modified ASL of mitochondrial tRNALeuUAA have revealed stabilized hydrogen bonding interactions between the sulfonic acid and secondary amine moieties of the modification and nucleosides U33, A35 and A36, as well as adoption of a non-Watson–Crick U34●G3 geometry thought to enhance stacking. Likewise, 2-thiouridine geranylation (Figure 2b) in bacteria facilitates tRNAGluUUC and tRNALysUUU formation of stable Watson–Crick U34●G3 wobble base pairs at the anticodon–codon interface. The large hydrophobic terpene geranyl group covalently modifies the position 2 thio group of 2-thiouridine inducing a loss of protonation at the N3 atom of U34.
Adenosine in position 34 of the anticodon is nearly universally modified to inosine an analog of guanosine. Inosine binds cytosine in the third position of the codon, as well as uridine and adenosine according to the predictions of the Wobble Hypothesis. The prevalence of this A34-to-I34 modification is so complete that only four exceptions are noted: tRNAThrAGU of Mycoplasma capricolum and Mycoplasma mycoides; and mt tRNAArgACG of S. cerevisiae and A. suum. In eukaryotes, all four-fold degenerate codon boxes are decoded by at least one tRNA isoacceptor that contains a wobble-position inosine.
The increased spatial extent of an inosine–adenosine (I34○A3) pair in the wobble position of the codon on the ribosome led to significant early speculation on the preferred geometry adopted by this purine○purine pair. Crick originally proposed traditional Watson–Crick base pairing geometry for the I34○A3 pair without addressing the additional 1.6 A of space required of an inosine–adenosine base pair. An alternative hypothesis suggested that the inosine adopts a syn
N-glycosidic bond conformation, exposing its Hoogsteen face for binding to A3 rather than its canonical Watson–Crick chemistry and better allowing the base pair to occupy a space intended for a smaller purine●pyrimidine interaction. However, an x-ray crystal structure of inosine-modified E. coli tRNAArgICG bound to its wobble CGA codon in the A-site of the 30S ribosome unambiguously demonstrated that I34○A3 adopts traditional Watson–Crick geometry. The broader spatial extent of an I34○A3 pair is accommodated by a compression of the I34 ribose and phosphate moieties between C1′ and P of that residue, where the β (P-O5′-C5′-C4′) torsion angle, in particular, changes from about 180° to −37.8°.
Apart from its role in tRNA decoding, A-to-I deamination also plays an important role in mRNA editing, with hundreds of millions of examples in eukaryotic transcriptomes. In the context of tRNA decoding of mRNA codons, A-to-I editing of mRNA coding regions alters the coding potential by tRNAs. Where inosine resides in the codon, wobble base pairing occurs in a non-traditional reverse direction; accuracy and efficiency of decoding could potentially be affected by tRNA modifications of the anticodon domain. The A-to-I deamination reaction in mRNA is catalyzed by a specialized family of adenosine-deaminases-acting-on-RNA proteins (ADARs), whose A-to-I editing function acts on double-stranded coding regions of mRNA as well as non-coding introns, the 3′-untranslated regions of mRNAs and in microRNAs. The A-to-I editing can alter the splicing profile and miRNA-mediated gene silencing of altered transcripts, providing another level of control of gene expression. Mutations in ADARs have been associated with human pathology, including cancer, neurological disorders and defects in innate immunity.
Modifications at position 34 have also been shown to enable biologically relevant mismatch base pairing, particularly at the wobble position, by permitting tautomerism of the modified nucleobase (Figure 4). In human and E. coli tRNALysUUU, a 5-methoxycarbonylmethyl-2-thiouridine or 5-methylaminomethyl-2-thiouridine modification at position 34 (mcm5s2U34 or mnm5s2U34), respectively, allows the tRNA to recognize the non-cognate wobble AAG codon. The modification has been shown in crystal structures to shift the uridine keto–enol equilibrium to the enol form due to an alteration of the electronic structure of the uridine ring. The enol tautomer permits the G●U base pair to maintain Watson–Crick geometry in the ribosomal A-site and avoid steric clashes (Figure 4a;). A similar keto–enol tautomerization is observed for E. coli tRNAVal3UAC with cmo5U34. The modification also permits the wobble uridine to adopt its enol form for Watson–Crick mismatch base pairing with a guanosine. Tautomerism enables a G●U mismatch in the wobble position of the codon–anticodon duplex to adopt the geometry of a Watson–Crick pair. The modification-facilitated geometry of a Watson–Crick base pair is thought to allow such a G●U mismatch to escape discrimination by the translational fidelity mechanism. In crystallographic studies of human mitochondrial tRNAMetCAU bound to the non-cognate AUA codon on the ribosome, a 5-formylcytidine modification at position 34 (f5C34) also appears as the rare imino-oxo, (rather than the common amino-oxo) tautomer of cytidine. An NMR study of rare, transient mismatch base pairs involving unmodified A, G, C and U has shown that anionic, as well as tautomeric forms, can occur and be important to the chemistry and structure of RNA. Observation of a possible imino-oxo tautomerization suggests again a role in vivo for the modification to shift the tautomeric equilibrium toward the imino form. In the imino form, the wobble f5C34○A3 mismatch base pair is stabilized in Watson–Crick geometry (Figure 4b). Thus, in order for tRNA to effectively fulfill its role in an expended wobble recognition, certain position 34 modifications facilitate in vivo the Watson–Crick geometry of anticodon–codon mismatches through tautomerization and possibly anionic forms.
3.2. Modifications at Position 37
Modifications at position-37 contribute both stereochemistry and favorable energy to formation of the Watson–Crick base pair. Accuracy and efficiency in vivo is dependent on a number of factors: codon bias and usage; energetics of base pairing (e.g., A●U, U●A vs. G●C and C●G); codon context or nearest neighbor nucleosides and codons; tRNA species concentration and thus competition for codon. Of considerable importance to accuracy and efficiency are the post-transcriptionally modified nucleosides at position-37, 3′-adjacent to the anticodon, as well as at the anticodon wobble position-34. There are a number of significant advantages to modification of the invariant purine nucleoside at position-37. Modification of purine-37 negate cross-loop base pairing. Early evidence points to the closing of the unmodified anticodon loop by H-bonding. An open anticodon loop facilitates and frees the last two anticodon bases for canonical base pairing, and the first base for canonical and wobble pairings. Hydrophobic modifications at position-37 are supportive of the 3′-stack of the anticodon facilitating anticodon presentation to the codon. The modifications of A37 produce a hydrophobic platform. The hydrophobic platform is particularly noteworthy for it envelops the first anticodon–codon base pair of those tRNAs responding to codons starting with A1. Position-37 purine modifications help pre-structure, order, the anticodon domain for the recycling of the tRNA without investment of energy and chemistry to regain the conformation and dynamics for the integrity of codon recognition. The physicochemical contributions of modified purine nucleosides-37 collectively maintain the translational reading frame in all kingdoms of life by supporting correct formation of the critical first base pair, A36 or U36 hydrogen bonding to U1 or A1 respectively, of the mRNA codon. There are human disease associations to mutations in the enzymes responsible for the position 37 modifications. For instance, mutations of the human protein CDK5 regulatory subunit associated protein 1-like 1 (Cdkal1), the enzyme responsible for the ms2 modification of N6-threonylcarbamoyladenosine, t6A37 (Figure 2a), are associated with lack of processing of proinsulin and decreased secretion of the mature insulin. Interestingly, a new reactive species, cysteine hydropersulfide (CysSSH) has been reported as substrate and thio-donor for this enzyme with suppression of CysSSH production associated with a decrease in insulin secretion. At least one of the purine-37 modification enzymes, human tRNA isopentenyltransferase, TRIT1, has a direct consequence on human health for it is a tumor suppressor, whereas modification of G37, 3′-adjacent to A36 in phenylalanine tRNAs, may actually promote frameshifting in some codon contexts.
When the first codon base is U1 or A1, the responding tRNA anticodon with an A36 or U36, respectively, has a 3′-adjacent modified A37. The modification can be of extraordinary chemistry and structure, such as N6-isopentenyladenosine, i6A and its derivatives (e.g., 2-methylthio-N6-isopentenyladenosine, ms2i6A); N6-threonylcarbamoyladenosine, t6A and its derivatives (e.g., 2-methylthio-N6-threonylcarbamoyladenosine, ms2t6A). Phenylalanine tRNAs, which respond to codons beginning with U, have G as the 3′-adjacent purine-37 instead of an A37. Still, tRNAPhe G37 is extensively modified. The G37 of tRNAPhe from Archaea, Eukarya and even mitochondria is modified to a tricyclic wyosine/wybutosine, imG/yW, (3H-Imidazo[1,2-α]purine-7-butanoic acid, 4,9-dihydro-α-[(methoxycarbonyl)amino]-4,6-dimethyl-9-oxo-3-β-d-ribofuranosyl-, methyl ester, (αS)-) or a derivative, the biosynthesis of which begins with a simple N1 methylation.
The elaborate N6 modification of A37 with i6 or t6, and of G37 as imG37 or yW37 deny stable cross-loop hydrogen bonding by position 37 nucleosides. All tRNAs responding to codons beginning with U1 or A1 have the invariant U33 at the U-turn otherwise capable of cross-loop H-bonding between U33 and A/G37. The rotational freedom of t6A about the N11–C12 bond results in steric hindrance, preventing the incorporation of the nucleoside into the helix and therefore, abrogating hydrogen bonding with U33. The open loop afforded by the modification is necessitated by the geometry of Watson–Crick base pairing of the anticodon to the codon. Both t6A37 and imG/yW37 have either non-covalently or covalently formed tricyclic bases. The tricyclic imG/yW37 are seen as platforms above the U36-A1 or A36-U1 anticodon–codon base pairs. The cyclized ester of t6A37 (ct6A37) in the oxazolone ring form has been found in a number of species and is a stable derivative enzymatically closed. However, a hydantoin isoform was determined by x-ray crystallography. The hydantoin form may be of considerable consequence to the decoding of mRNA codons beginning with A. It is found in E. coli tRNA and may not be able to lend support to the first anticodon–codon base pair due to possible twisted conformations that are not co-planar with the adenine nucleobase. Position 37 modifications contribute considerable energetic stability to the first base pair through van der Waals forces and hydrophobic interactions. In contrast to t6A37, yW37 in some eukaryotes may, in the context of particular mRNAs, contribute to frameshifting, whereas the unmodified, hypomodified, nucleoside will maintain the translational reading frame.
tRNAs with anticodons ending in G36 or C36 respond to codons starting with C1 or G1 and may or not have a modified a A37 or G37. The G36●C1 and C36●G1 first base pairs with additional hydrogen bonding are energetically more favorable in comparison to U36●A1 and A36●U1. In addition, the purine-37 stacks well with G36●C1 and C36●G1 supporting the Watson–Crick base pairing. However, methylation of the N1 of G37 suppresses +1 shifts in the translation reading frame. tRNA species for leucine, proline and arginine that respond to codons beginning with C1 are modified to m1G37. tRNAProUGG with the anticodon UGG is particularly susceptible to +1 frame shifts, and the tRNAProGGG species requires m1G37 and the aid of the translational factor EF-P (elongation factor P) to suppress such frameshifts. The modification has been found in human mitochondrial tRNAPro. The m1G37 modification occurs in all tRNAs responding to codons beginning with C1. Mutations in the N1-G37 human methyltransferase, TRMT5, are associated with cancers. TRMT5 methylates G37 whether there is a G36 or C36; the bacterial counterpart methylates only G37 with an adjacent G36. It appears that the modification of the mitochondrial G37 is also associated with an oxidative phosphorylation disorder.
In summary, the post-transcriptional modification of the position-37 invariant purine takes two forms. The modifications of A37 in tRNAs responding to codons beginning with A1 and to some extent also those beginning with U1 improve base stacking for the 3′-stack of the anticodon into the 3′-side of the anticodon loop and into the 3′-side of the anticodon stem. Van der Waals forces, hydrophobic interactions and their solvent-mediated effects of t6A37 and i6A37 contribute favorable energetics to stabilizing the base stacking and supporting a conformation advantageous to the Watson–Crick base pairing of the decoding of the first two base pairs. The G37 modification to imG37 and yW37 of tRNAPhe responding to UUU and UUG contributes likewise. In contrast, the less extensive methylation of G37 to m1G37 of tRNAs responding to codons beginning with C1 would not be expected to greatly enhance base stacking. However, purine-37 modifications tend to negate intraloop hydrogen bonding and thus, promote the open loop conformation requisite to the canonical base pairing of the anticodon with the codon. There is little free energy lost in the difference between base hydrogen bonding to one another base or to water. Stacking interactions and solvent displacement drives RNA tertiary structure. Purine-37 modifications maximize anticodon loop stacking and contribute greatly to the desired conformation of the loop structure for mRNA decoding.
3.3. Metal Ions and the Modification of the tRNA Anticodon Domain
There is little in the literature expounding upon the importance of metal ions to the contributions of modified nucleoside chemistry and structure to tRNA, certainly in comparison to the extensive literature on the global effect of Mg2+ on RNA folding. In some of the earliest papers on modifications and metal ions, the inherent fluorescence of the modification wyosine in yeast tRNAPhe is used to detect the effects of metals on the anticodon structure. The fluorescence of the lanthanide metals, such as europium (III), was used to monitor metal binding to specific species of E. coli tRNA. These papers confirmed the presence of modest strength, metal binding sites, particularly in the anticodon loop, as seen in crystal structures of tRNAs. In studies of the ASL of yeast tRNAPhe with only the naturally-occurring methylated nucleosides, and m1G37, the precursor to yW37, the presence of m5C and even the deoxynucleoside dm5C40, modulated cooperative Mg2+ binding and enhanced the stability of the ASL. Base modifications were found to stabilize or possibly strengthen the binding of Mg2+ to E. coli tRNAVal. Studies of unmodified yeast tRNAPhe implicated modified nucleoside contributions of chemistry and structure to Mg2+ binding and to the correct folding of the fully modified tRNA, thus highlighting the limitations of studying unmodified tRNA. The truncated sequences of mitochondrial tRNAs require modifications in their core regions to facilitate folding into a canonical L-shaped tRNA structure. At least in the folding of human mitochondrial tRNAMet there appears to be an unpredicted Mg2+-dependent unfolded state of the 5′-side of the molecule followed by a modification assisted and correct interaction with the 3′-side. The data supports an in vivo model of mitochondrial tRNA transcription and folding in which the truncated DSL remains unfolded until the 5′-side of the molecule is transcribed, and perhaps modified. Modified nucleosides with free amino or acid moieties would be expected to bind metal ions. Studies have shown that metal coordinates to the free acid of the threonine of t6A in the anti-conformation, and that Mn2+ binds the free acid of uridine 5-oxyacetic acetic. As would be expected, the binding of Mn2+ and Co(NH3)63+ to the ASL of E. coli tRNAPhe neutralizes charge, as would Mg2+. Charge neutralization facilitates formation of the ubiquitous U-turn even with an unmodified ASL but when accomplished in the presence of the modified nucleoside, i6A37, competing ASL conformers are destabilized. It is evident from the study of the modification archaeosine (position 15:,7-formamidino-7-deazaguanosine) that some modifications may, in effect, replace the need for Mg2+ to achieve the functional folding of tRNA, particularly with regard to the Levitt base pair, nucleosides at positions 15 and 48 (Figure 3).
3.4. Modifications at Position 32
While modifications at position 32 in the tRNA ASL are less commonly considered than those arising at positions 34 and 37, more than 30% of known tRNA species carry a position-32 modification. The nucleoside occupying position 32 is nearly always (>99% of instances) a pyrimidine. Position 32 is located just below the stem region of the anticodon stem and loop and across from the residue at position 38, which is usually unmodified and less strictly conserved. A non-canonical interaction between the position-32 and position-38 nucleosides forms in ASL regions adopting the U-turn structural motif for ribosomal A-site binding, and exhibits a classic bifurcated hydrogen bond between the O2 of the pyrimidine at position 32 (Y32) and a hydrogen bond acceptor, typically an amine, in the nucleoside at position 38 (N38) in tRNA. This cross-loop base pair interaction significantly affects the ability of the tRNA to engage in codon discrimination, and is also suggested to affect intron splicing and aminoacylation and to modulate affinity of the tRNA for the ribosomal A-site. The cross-loop base pair is stabilized by stacking interactions with nucleosides in the ASL stem, requiring the nucleosides at positions 32 and 38 to adopt the C3′-endo sugar pucker found in A-form RNA helices. The modifications found at position 32, preponderantly 2-thiocytidine or 2′-O-methylated pyrimidines, have been shown to stabilize the C3′-endo sugar conformation to facilitate base stacking with the ASL stem.
The 2-thiocytidine (s2C) modification at position 32 in E. coli tRNAArg1ICG has also been shown to be a direct determinant of wobble decoding efficiency. tRNAArg1ICG and tRNAArg2ICG differ only in their position-32 modification status; unmodified tRNAArg2ICG binds CGC, CGU and CGA arginine codons in agreement with the predictions of the Wobble Hypothesis, but s2C32-modified tRNAArg1ICG is incapable of wobble binding to CGA. The structural basis of the restrictive effect of 2-thiocytidine at position 32 stems from a tendency of the thio modification to contribute to a global, destabilizing dehydration of the ASL under the structural conditions where its conformation has already been perturbed to adapt to a spatially broad adenosine–inosine purine–purine base pair in the wobble position of the anticodon–codon interface.
In many cases, tRNA species modified at position 34 also contain modifications at position 32 (67%), position 37 (82%), or both (56%). Position 32 modifications with position 34 modifications may function in tandem to affect tRNA anticodon loop architecture and function. For instance, it is possible that the thionyl group of the wobble position 34 of 2-thiouridine in E. coli tRNALysSUU may be protected from oxidation through direct or indirect interaction with an N6-threonylcarbamoyladenosine at position 37, t6A37. In tRNAArg1,2ICG, the 2-thiocytidine and 2-methyladenosine modifications to positions 32 and 37 act to modulate wobble recognition by I34 of adenosine in the third position of the anticodon. The pre-structuring of the anticodon domain architecture toward the U-turn structural motif for energetically favorable insertion into the ribosomal A-site, mRNA binding and translocation often requires the simultaneous contributions of several post-transcriptional modifications in the ASL. The U-turn is stabilized by a hydrogen bond between the 2′-OH of U33 and, when the residue at position 35 is a purine, its N7 atom; modifications are important in allowing tRNAs with position 35 pyrimidines, such as tRNALysUUU with mnm5s2U34 and t6A37, to also adopt U-turns. This motif of interactive contributions of a suite of modifications to positions 32, 34 and 37 of the anticodon stem and loop region seems likely to be in evidence across an array of tRNAs and species.
Modified nucleosides contribute to tRNA folding, structure, and function in numerous ways through their abilities to induce and stabilize the tRNA core, create and pre-structure the most efficient ASL conformation for an accurate response to mRNA codons on the ribosome and act as determinants for recognition by macromolecules due to the unique chemical environment they can provide. Modifications of the ASL in or around the anticodon affect translation. For instance, yeast strains lacking I34 at anticodon wobble position 34 are inviable, while those lacking m1G37 and t6A37, 3′-adjacent to the anticodon, grow poorly. Modifications in the core of tRNA affect tRNA stability and folding. Thus, the lack of 1-methyladenosine at position 9 (m1A9) in human mitochondrial tRNALys leads to an alternate elongated structure, 2′-O-methylation being supportive of the 3′-endo nucleoside conformation its absence decreases the probability of the 3′-endo conformer, and the lack of pseudouridines destabilize helices of the core. Some modifications at certain positions affect tRNA identity in protein recognition. In yeast, 2′-O-ribosyladenosine (phosphate), Ar(p), at position 64 in the stem of the TSL is an initiator tRNAMet identity element. The modified nucleosides at positions within the tRNA’s core at the junction of the stems forming the cloverleaf secondary structure tend to negate Watson–Crick, canonical base pairing and have such an important role in the structure and function of the tRNA molecule that mutations at these locations impact human health.
Modified nucleosides with a wide array of chemical moieties enhance the properties of tRNA where both local and global environments can be changed; they pre-structure the molecule both globally and locally for accurate and efficient codon recognition and ribosome binding. The traditional L-shape of tRNA is adopted through highly conserved modifications to the tRNA core enabling a correct global fold to take place. Modified residues at both the D- and T-arm facilitate the three-dimensional interactions through conserved interactions. The D-loop is formed largely due to the increased structural flexibility provided by D, whereas the T-loop often contains highly stabilizing residues such as Ψ and T.
The heightened use of stabilizing nucleosides, such as Ψ and even s2T, or on the other hand destabilizing ones such as D, allows organisms to adapt tRNA dynamics to various environments in which life may not be feasible with the limited chemistry provided by only the four major nucleosides. The enhanced C3′-endo sugar puckering facilitated by both Ψ and s2T stabilize the A-form helix commonly adopted by RNA at 37 °C and enhance the viability of thermophiles at higher temperatures. In contrast, the presence of D induces C2′-endo sugar puckering destabilizing the helix, hence increasing structural flexibility due to reduced base stacking. The increased conformational dynamics of the tRNA has enhanced the adaptation of psychrophiles to their inhospitably cold environments.
The variety and complexity of modifications found in the ASL and especially at positions 34 and 37 is astounding. The rich variation in chemistry allows for a large number of mechanisms to enable decoding of even thermodynamically weak codons, as well to expand or restrict the recognition of codons. The wobble position is frequently modified regardless of the major base found in the primary sequence, with adenosine and uridine being modified in the majority of cases. With very few exceptions, adenosine is modified to inosine at the wobble position 34 of tRNAs, allowing wider codon recognition through traditional I○A, purine–purine, and I●U/C purine–pyrimidine, Watson–Crick base pairing.
Uridine modifications at position 34 exhibit a rich variety of chemical moieties that can significantly alter the thermodynamics and conformation of the nucleoside. Long carbon chains containing heteroatoms account for properties such as keto–enol and amine–imine tautomerization, ionization, steric hindrance, and increased hydrophobicity. These properties of the long chain modifications often enhance either hydrogen bonding with the cognate codon, enhance base stacking, or restrict binding to non-cognate codons. Additional modifications such as 2′-thiolation further enhance properties such as C3′-endo sugar puckering, therefore increasing stacking favorability. The modifications at position-34 often work concomitantly with modifications at position 37.
In the decoding of mRNA, modifications of tRNA’s position-37, 3′-adjacent to the anticodon, stabilize and facilitate formation of the important canonical first base pair in anticodon–codon interaction. The conformation of the ribosome is altered upon tRNA entering the A-site, restricting the first two nucleosides of the codon to forming Watson–Crick base pairs with the tRNA anticodon. Decoding of the first codon base becomes a decisive factor to the accuracy and efficiency of reading the second and third codon bases. tRNA binding the first and second codon bases in a canonical fashion is energetically favored and non-canonical base-paired tRNAs that are less energetically supported are released. Thus, cognate Watson–Crick reading of the first two codon bases leads to efficient and effective binding of the correct tRNAs with few near-cognate mismatches. An accurate first base pair interaction of tRNA with one of the 64 genetic codes would be expected to reduce the potential for a decoding mistake to 1 in 4. Followed by an accurate selection at the second base-pair, there would be a reduction in error to 1 in 16. Formation of the first two canonical base pairs is favored by at least the sum of the hydrogen bond energies of the two base pairs (~−3 kcal/H-bond), ~−12 to −18 kcal. However, an error rate of 1 in 16 is such a large number of mistakes in amino acid incorporation that a protein of median length, 400 amino acids and average molecular size of 44,000 daltons, would have 25 incorrect amino acids. Life as we know it would not be supportable with that error rate. Sustained viability on Earth requires a more energetically favored, structured cycle of tRNA recognition and binding to mRNA codons. Thus, a simplistic purview of the 64 codons of the Universal Genetic Codes and the energy contributions of hydrogen bonds is quite deceiving because many factors greatly improve the mathematical odds of a correct anticodon–codon interaction. In vivo analyses of codon recognition are 10,000-fold more accurate, probably <4 errors in 104 amino acid incorporations.
Efficient and exact decoding of mRNA is made possible by the rich chemical variation provided by modified nucleosides, especially at the anticodon loop positions 32, 34, and 37. Modifications at positions 32, 34, and 37 facilitate the efficient decoding of pyrimidine rich anticodons by both enabling the formation of the U-turn and enhancing the stability of the anticodon–codon interaction. The half-life of tRNA has been estimated to be from hours to days and the post-transcriptional modifications only add to that time span. Thus, an individual tRNA species responds to its cognate and wobble codons innumerable times during this period. The minimum free energy structure of an unmodified tRNA may not be globally, and specifically at the anticodon loop, structurally fashioned to meet the requirements of the ribosome entry site or the restrictions to translocation. As discussed, the modifications of the tRNA core contribute to the tertiary folding of tRNA into the canonical and functioning three-dimensional shape. For the most part, modifications in the core, methylations in particular, negate Watson–Crick base pairing resulting in a three-dimensional structure with non-canonical base pairs and base triples of varying stability. The modifications of the anticodon stem and loop enable some 40 tRNAs to read the 61 amino acid codons, and yet retain a common conformation and dynamic acceptable to translation. Thus, with few exceptions, every time an unmodified or under modified tRNA responds to its codon, it may have to undergo an investment of energy and time in restructuring to meet ribosome requirements. Modifications engineer and pre-structure the tRNA architecture and alter its dynamics in the direction of ribosome acceptance. Therefore, the modifications discussed in this review enable the correct folding, and accurate and efficient functioning of tRNA in its central role in biology.