Copyright 1999 by the Genetics Society of America
Archaeal Aminoacyl-tRNA Synthesis: Diversity Replaces Dogma Debra Tumbula,* Ute C. Vothknecht,* Hyun-soo Kim,* Michael Ibba,‡ Bokkee Min,* Tong Li,* Joanne Pelaschier,* Constantinos Stathopoulos, Hubert Becker* and Dieter So¨ll† *Department of Molecular Biophysics and Biochemistry and †Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, Connecticut 06520-8114 and ‡Center for Biomolecular Recognition, IMBG, The Panum Institute, DK-2200 Copenhagen N, Denmark Manuscript recieved March 25, 1999 Accepted for publication May 5, 1999 ABSTRACT Accurate aminoacyl-tRNA synthesis is essential for faithful translation of the genetic code and consequently has been intensively studied for over three decades. Until recently, the study of aminoacyl-tRNA synthesis in archaea had received little attention. However, as in so many areas of molecular biology, the advent of archaeal genome sequencing has now drawn researchers to this field. Investigations with archaea have already led to the discovery of novel pathways and enzymes for the synthesis of numerous aminoacyltRNAs. The most surprising of these findings has been a transamidation pathway for the synthesis of asparaginyl-tRNA and a novel lysyl-tRNA synthetase. In addition, seryl- and phenylalanyl-tRNA synthetases that are only marginally related to known examples outside the archaea have been characterized, and the mechanism of cysteinyl-tRNA formation in Methanococcus jannaschii and Methanobacterium thermoautotrophicum is still unknown. These results have revealed completely unexpected levels of complexity and diversity, questioning the notion that aminoacyl-tRNA synthesis is one of the most conserved functions in gene expression. It has now become clear that the distribution of the various mechanisms of aminoacyltRNA synthesis in extant organisms has been determined by numerous gene transfer events, indicating that, while the process of protein biosynthesis is orthologous, its constituents are not.
HE synthesis of aminoacyl-tRNAs is one of the core reactions of gene expression. By matching the anticodon trinucleotide of a tRNA with a particular amino acid, the aminoacyl-tRNAs allow the translation of a series of mRNA codons into a corresponding polypeptide sequence. These small RNAs, which have an amino acid attached to the adenosine at their 39-end, are the sole substrates for ribosomal translation of mRNA. Consequently, the fidelity of translation is highly dependent on the accuracy of aminoacyl-tRNA synthesis. The majority of aminoacyl-tRNAs are directly synthesized by the aminoacyl-tRNA synthetases, a family of 20 enzymes (1 discrete enzyme per amino acid-tRNA pair) that attach the cognate amino acid to the tRNA. However, in a number of organisms and organelles, glutaminyl- or asparaginyl-tRNAs are formed in an indirect pathway (a tRNA-dependent amino acid transformation) that requires the modification of an incorrectly charged amino acid while bound to tRNA (see below). The aminoacyl-tRNA synthetases have been studied in considerable detail, resulting in a clear picture of the mechanisms that ensure that only a particular amino acid is attached to certain tRNAs. With only a few notable exceptions, the essential nature of the aminoacyl-tRNA synthetases had long been assumed to limit their vari-
Corresponding author: Dieter So¨ll, Department of Molecular Biophysics and Biochemistry, Yale University, P.O. Box 208114, 266 Whitney Ave., New Haven, CT 06520-8114. E-mail: [email protected]
Genetics 152: 1269–1276 (August 1999)
ability in living organisms. This assumption was initially borne out when no fundamental difference was seen between the aminoacyl-tRNA synthetases of numerous organisms as diverse as Escherichia coli, yeast, and human. Not surprisingly, until recently most studies of the aminoacyl-tRNA synthetases had been mainly confined to these model organisms. In particular, very little information had accumulated on the mechanisms of aminoacyl-tRNA synthesis in archaea—and none of the data had suggested that studying this process in such notoriously intractable organisms would be a particularly worthwhile endeavor. Any such preconceptions were destroyed after the publication in 1996 of the first genomic sequence from an archaeon, that of Methanococcus jannaschii (Bult et al. 1996). The sequence of the M. jannaschii genome provided many surprises, among them the observation that the genome encoded only 16 known aminoacyl-tRNA synthetase homologs. The “missing” synthetases were the asparaginyl- (AsnRS), cysteinyl- (CysRS), glutaminyl(GlnRS), and lysyl-tRNA synthetases (LysRS). In addition, the assignment of an open reading frame (ORF) encoding seryl-tRNA synthetases (SerRS) was based on rather low homology, while there appeared to be three reasonable candidates to encode the two subunits of phenylalanyl-tRNA synthetase (PheRS). Furthermore, key components of the selenocysteinyl-tRNA synthesis pathway could not be identified. Archaeal protein synthesis utilizes the 20 canonical amino acids (as well as
D. Tumbula et al.
Figure 1.—(Top) The direct aminoacylation pathway. (Bottom) The transamidation pathway of Gln-tRNAGln formation (from Curnow et al. 1997). A similar pathway exists for AsntRNAAsn formation: in the first step, tRNAAsn is aspartylated by a nondiscriminating aspartyl-tRNA synthetase, and the second reaction is catalyzed by an Asp-tRNAAsn amidotransferase activity. Glutamine can serve as the amide donor for both amidotransferase activities.
selenocysteine in many of these organisms), so pathways to synthesize the corresponding aminoacyl-tRNAs must exist in archaea. Biochemical and genetic approaches, combined with the vast amount of archaeal sequence data released since 1996, have now largely explained how aminoacyl-tRNAs are made in these organisms. While these studies were initiated in archaea, they have had far-reaching implications for our understanding of aminoacyl-tRNA synthesis in general, as discussed below. tRNA-dependent amino acid transformations: About 30 years ago it was reported that Bacillus subtilis (Wilcox and Nirenberg 1968; Wilcox 1969) and possibly Halobacterium cutirubrum (White and Bayley 1972) possess an indirect way of making Gln-tRNA. This observation was later expanded to include many more organisms and organelles (Scho¨n et al. 1988a), yet the elusive nature of the enzymes involved in this pathway may have thwarted a more universal recognition of this fact. However, in the last few years the biochemical and genetic description of some of these enzymes and the analysis of many microbial genome sequences established that Gln-tRNAGln and Asn-tRNAAsn formation can occur in the absence of their respective aminoacyl-tRNA synthetases. In the first step of these indirect pathways (Figure 1), a nondiscriminating glutamyl-(GluRS) or aspartyl-tRNA synthetase (AspRS) forms the misaminoacylated substrate for the second enzyme, Glu-tRNAGln or Asp-tRNAAsn, respectively. Transamidation of the tRNA-bound amino acid in the second step then yields the correctly acylated Gln-tRNAGln or Asn-tRNAAsn. The enzymes responsible have been named Glu-tRNAGln amidotransferase (Glu-AdT) and Asp-tRNAAsn amidotransferase (Asp-AdT). The preferred amide donor in these reactions is glutamine (and also asparagine), which is transformed to glutamate (or aspartate). This enzymatic reaction is characteristic of the glutamine amidotrans-
ferases, enzymes ubiquitous in the synthesis of amino acids, amino sugars, purine and pyrimidine bases, and coenzymes (Zalkin and Smith 1998). While Glu-AdT was first characterized and cloned from B. subtilis (Curnow et al. 1997), Asp-AdT activity was first demonstrated in extracts of the archaeon Haloferax volcanii (Curnow et al. 1996). Our current knowledge of these enzymes is still sketchy. B. subtilis Glu-AdT is a heterotrimeric enzyme; the genes for its three subunits are arranged in an operon (gatCAB). The Gat proteins have no similarity to the known glutamine amidotransferases (Curnow et al. 1997). The A subunit has sequence similarity to certain bacterial amidases, but the known characterized enzymes (Kobayashi et al. 1993) do not use glutamine or asparagine as substrates. However, GatA (coexpressed with GatC) does convert glutamine to glutamate (A. W. Curnow, unpublished results). On the basis of this information it was proposed that the A subunit functions as amide donor by hydrolyzing glutamine, while the B subunit may be a tRNA-binding protein. The role of the C subunit may be in the stabilization or correct folding of the A subunit (Curnow et al. 1997). How many subunits are needed for activity? Genomic analysis suggests that the yeast mitochondrial Glu-AdT has only an A and a B subunit. However, the lack of a sequence with similarity to gatC does not rule out the presence of a GatC-like protein structure encoded by a dissimilar sequence. Yet a Thermus thermophilus Asp-AdT enzyme was highly active, but contained only the two subunits corresponding to gatA and gatB (Becker and Kern 1998). Thus, GatA/GatB heterodimers may also be active, a fact to be kept in mind when searching in genomes for ORFs responsible for the tRNA-dependent transamidation pathway. A look at genomic sequences reveals that bacteria carry two forms of gat gene arrangements: some of them have operon structures like B. subtilis (e.g., Chlamydia trachomatis), while in others the genes for the subunits are dispersed throughout the chromosome (e.g., Deinococcus radiodurans). The latter gene arrangement is the one found in archaea; currently most known sequenced archaea have unlinked gatA, gatB, and gatC genes. Additionally, a gatB ortholog and a gatB paralog are in each of the genomic sequences of M. jannaschii, Methanobacterium thermoautotrophicum DH, and Archaeoglobus fulgidus (Bult et al. 1996; Klenk et al. 1997; Smith et al. 1997). In comparison, the only discernible gat gene in the genome of Pyrococcus horikoshii, which lacks GlnRS, is gatB (Kawarabaysi et al. 1998); thus the pathways of Gln-tRNA formation in this organism remain to be established. However, an AsnRS gene is present in the Pyrococcus genome. From these genome examinations it is clear that all archaea lack GlnRS and instead use the transamidation pathway (Table 1). However, none of the archaeal transamidation enzymes have been purified and characterized, which is necessary to reveal the
Archaeal Aminoacyl-tRNA Synthesis
TABLE 1 Biochemical evidence for aminoacyl-tRNA formation by transamidation Aminoacyl-tRNA
Organisms and references
Archaeaa,b Haloferax volcanii (Curnow et al. 1996) Halobacterium cutirubrum (White and Bayley 1972) Methanococcus maripaludisc Methanobacterium thermoautotrophicumd Bacteria Gram-positivea Bacillus subtilis (Wilcox and Nirenberg 1968) Bacillus megaterium (Wilcox and Nirenberg 1968) Lactobacillus acidophilus (Wilcox and Nirenberg 1968) Lactobacillus bulgaricus (Scho¨n et al. 1988b) Streptococcus faecalis (Wilcox 1969) Streptomyces antibioticus (Wilcox 1969) Gram-negative Rhizobium meliloti (Gagnon et al. 1996) Helicobacter pylorie Exceptions: Escherichia coli f (Lazzarini and Mehler 1964), Azotobacter vinelandii, and Proteus vulgaris (Wilcox 1969) Synechocystis sp. strain PCC6803g (Scho¨n et al. 1988a) Eukaryal organelles Chloroplastsa Barley (Scho¨n et al. 1988a) Spinach (Scho¨n et al. 1988a) Chlamydomonas reinhardtii ( Jahn et al. 1990) Mitochondria Yeast (Martin and Rabinowitz 1984) Mouse (Scho¨n et al. 1988a) Mung bean (Scho¨n et al. 1988a) Exceptions: Leishmania tarentolae and Trypanosoma brucei (Nabholz et al. 1997)
Archaeaa,b Haloferax volcanii (Curnow et al. 1996) Methanococcus maripaludisc Methanobacterium thermoautotrophicumd Bacteriah Deinococcus radiodurans (Curnow et al. 1998) Thermus thermophilus (Becker and Kern 1998) Helicobacter pylorie
Within this group, exceptions to this transamidation route have yet to be demonstrated biochemically. Except for Pyrobaculum aerophilum and Pyrococcus horikoshii, archaeal genomic sequences lack GlnRS and AsnRS genes and contain homologs of gatC, gatA, and gatB (Bult et al. 1996; Klenk et al. 1997; Smith et al. 1997). c A. W. Curnow, M. Ibba and D. So¨ll (unpublished results). d D. L. Tumbula, H. D. Becker and D. So¨ll (unpublished results). e D. L. Tumbula, A. Buhmann and D. So¨ll (unpublished results). The genomic sequence of Helicobacter pylori lacks GlnRS and AsnRS genes and contains homologs of gatC, gatA, and gatB (Tomb et al. 1997). f The genomic sequence of E. coli has a GlnRS gene and lacks homologs of the gat genes (Blattner et al. 1997). g The Synechocystis genome sequence lacks a GlnRS gene and contains homologs of gatC, gatA, and gatB (Kaneko et al. 1996). h Most bacterial genomic sequences contain an AsnRS gene. b
exact details of how these organisms make Gln-tRNA and Asn-tRNA. Two transamidases of differing specificities (Glu-AdT and Asp-AdT) are suggested in some archaea by sequence data. According to this the two enzymes would
share the C and A subunits, but would differ in the B subunit; an ortholog (to B. subtilis) for Glu-AdT and a paralog for Asp-AdT. Both transamidation pathways have been demonstrated by tRNA-dependent amino acid transformations of labeled substrates in cell extracts
D. Tumbula et al.
of H. volcanii, Methanococcus maripaludis, and M. thermoautotrophicum Marburg (Curnow et al. 1996; H. D. Becker, A. W. Curnow, M. Ibba and D. Tumbula, unpublished results). However, this clear distinction has been muddied recently by the demonstration that, in addition to Gln-tRNA synthesis, the gatCAB-encoded Glu-AdT enzymes from B. subtilis and D. radiodurans can also carry out the transamidation of Asp-tRNAAsn to AsntRNAAsn (Curnow et al. 1998). Thus, there is no necessity for two separate enzymes; as shown in these organisms, the tRNA specificity is context dependent and depends on the presence or absence of misacylated tRNAGln or tRNAAsn. Therefore the presence in the organism of a nondiscriminating GluRS or AspRS, supplying the required misacylated tRNA substrate, is the key factor for “endowing” the amidotransferase with the needed tRNA specificity. Does the presence in an organism of one aminoacyltRNA synthesis pathway a priori exclude another route to the same aminoacyl-tRNA? The initial assumption was “yes,” and, in line with the occurrence of the indirect pathways, direct glutaminylation and asparaginylation of tRNA from H. volcanii and H. cutirubrum could not be demonstrated (Griffiths and Bayley 1969; Gupta 1984; Curnow et al. 1996). However, contrary to initial expectations, bacteria have now been found that contain both direct and indirect pathways of Gln-tRNA and Asn-tRNA formation (Becker and Kern 1998; Curnow et al. 1998). Without further biochemical and metabolic investigations no conclusions can be reached on the respective roles of these pathways. It is clear that studies in the archaea will clarify the issue of number and nature of tRNA-dependent amidotransferases. A corollary to the transamidation pathways is the required presence of nondiscriminating GluRS or AspRS enzymes. Fortunately, the first crystal structure of an archaeal aminoacyl-tRNA synthetase was that of AspRS from Pyrococcus kodakaraensis KOD1 (Schmitt et al. 1998). Knowledge of the structures of the native enzyme and its complexes with ATP and aspartic acid suggested a basis for the degenerate specificity of a naturally occurring misaminoacylating enzyme. A loop in yeast AspRS, a discriminating synthetase, which specifically interacts with the tRNA anticodon (positions 36 and 37) is shortened in the corresponding region of the pyrococcal AspRS. Modeling the pyrococcal structure with tRNA suggests that the loop and anticodon do not interact in the archaeal AspRS. In analogy to its closely related ortholog in T. thermophilus the pyrococcal AspRS is a nondiscriminating enzyme (Becker and Kern 1998). The pyrococcal enzyme also displays the unusual characteristic that ATP can be substituted by GTP or UTP in the aspartylation reaction (Fujiwara et al. 1996). No information is yet available for the archaeal, or indeed any other, nondiscriminating GluRS. Lysyl-tRNA synthetase: The absence of AsnRS and GlnRS from archaeal genomes was easily explained by
their replacement with tRNA-dependent transamidation pathways (see above). The lack of an obvious ortholog for LysRS was initially more problematic (Ibba et al. 1997a). The possibility that LysRS was absent, with Lys-tRNA instead made via a mischarging pathway, seemed unlikely because there exist no obvious routes to easily transform another amino acid into lysine (particularly at the 39-end of a tRNA). These ideas, which suggest that LysRS must be present in some form, were confirmed by the observation that cell-free extracts of both H. volcanii and M. maripaludis are able to directly attach lysine to tRNA (M. Ibba and A. W. Curnow, unpublished results). This assay was then used as the basis for the purification of M. maripaludis LysRS, the N-terminal sequencing of which allowed the cloning of the corresponding gene (lysS). Heterologous expression of this gene in E. coli gave rise to a protein with canonical LysRS activity. The sequence of the M. maripaludis lysS gene showed no similarity to known LysRSencoding genes, but was highly homologous to unassigned open reading frames in all of the archaeal genomes that lacked an identifiable LysRS (Ibba et al. 1997b). Interestingly, this group of archaeal lysS genes all encoded proteins containing sequence motifs characteristic of class I aminoacyl-tRNA synthetases, in contrast to all other then-known LysRS proteins that were members of class II. The division of the aminoacyl-tRNA synthetases into two classes is based upon distinct and unrelated active site topologies, with membership of a particular class being conserved throughout the living kingdom (Eriani et al. 1990; the so-called “class rule”). The violation of the class rule (of which this remains the only known example to date) by the existence of both class I and class II LysRS proteins has subsequently been confirmed by various biochemical approaches (Ibba et al. 1999). The initial finding that many, but not all, archaea contain a novel LysRS led to the suggestion that this protein might be confined to the archaeal kingdom. However, subsequent studies arising from bacterial genomics have shown that the class I LysRS is also present in several bacteria to the exclusion of the more common class II protein (Ibba et al. 1997c). Comparisons of the predicted amino acid sequences for the 15 known class I LysRS proteins show that the bacterial examples do not cluster together but instead form subgroups most closely related to particular groups of archaeal proteins (Figure 2). The class I LysRS is found in both Euryarchaeota (two species of Methanococcus and Pyrococcus, M. thermoautotrophicum, Thermoplasma acidophilum, Methanosarcina mazei, and A. fulgidus) and in Crenarchaeota (a species of Cenarchaeum and Aeropyrum pernix). The known bacterial class I lysS genes are found in disparate taxa, the spirochetes (Borrelia burgdorferi and Treponema pallidum), a G 1 C rich gram-positive bacterium (Streptomyces coelicolor) and the a-proteobacteria (Rickettsia prowazekii and Rhodobacter capsulatus). The euryarchaeal
Archaeal Aminoacyl-tRNA Synthesis
Figure 2.—Unrooted phylogeny of class I LysRS proteins. This was assembled with the maximum likelihood method (5000 puzzling steps) implemented in the program PUZZLE 4.0 (Strimmer and von Haeseler 1996). Sequences were aligned with the CLUSTAL X program (Thompson et al. 1997). Sequences not found in the GenBank or WIT databases are Thermoplasma acidophilum (courtesy of A. Ruepp and W. Baumeister, MaxPlanck-Institut fu¨r Biochemie, Manrtinsried, Germany) and Streptomyces coelicolor (data taken from the Sanger Centre web site).
versions of the enzyme are a coherent phylogenetic grouping in agreement with the 16S rRNA phylogeny (Olsen et al. 1994; Barns et al. 1996) from which the crenarchaeal examples are distant. In contrast, the bacterial LysRS proteins do not form a single grouping. The relationships between the various class I LysRS proteins suggest that the bacterial examples have been acquired by at least two horizontal gene transfer events from archaea, with the existing class II LysRS subsequently displaced (Ibba et al. 1999). Thus, the LysRS in a-proteobacteria may have arisen from a crenarchaeal LysRS and that in the spirochetes and Streptomyces from the pyrococcal LysRS. As mentioned above, not all archaea possess a class I-type LysRS. For example, the crenarchaeotes Pyrobaculum aerophilum (S. Fitz-Gibbon, unpublished data) and Sulfolobus solfataricus (Sensen et al. 1996; C. W. Sensen, personal communication) both contain only class IItype LysRS proteins. Closer examination of these crenarchaeal sequences shows that they are more similar to each other than to any other LysRS and that they have reasonably high homology to bacterial proteins (e.g., both show over 43% identity at the amino acid level to Thermotoga maritima LysRS). This last observation in particular suggests that these crenarchaeal class II LysRS might have been acquired from bacteria. The apparent two-way transfer of the different LysRS encoding genes, both to and from the archaeal kingdom, has provided clear evidence of the dynamic nature
of aminoacyl-tRNA synthetase evolution. This is in stark contrast to previous predictions that suggested that the obvious link between the genetic code and the aminoacyl-tRNA synthetases would limit the variation of this enzyme family. What instead has emerged from studies of the class I LysRS is clear evidence that the tRNAs themselves predate the aminoacyl-tRNA synthetases (Ibba et al. 1999; Schimmel and Ribas de Pouplana 1999) and by implication must be the true determinants of the genetic code. This is borne out by the observations that the class I and class II LysRS proteins can recognize the same tRNALys in vitro and in vivo (Ibba et al. 1999). Phenylalanyl-tRNA synthetase: Phenylalanyl-tRNA synthetase (PheRS) is a class II synthetase; in most organisms it consists of two subunits in a heterotetrameric a2b2 quaternary structure, although monomeric a-forms are also found (Sanni et al. 1991). PheRS from the methanogen Methanosarcina barkerii was the first archaeal aminoacyl-tRNA synthetase to be purified (Rauhut et al. 1984). Its a- and b- subunits are 60 and 70 kD in size, but no sequence information is available. The genomic sequences of P. horikoshii and Pyrococcus furiosus exhibit one gene each for the two PheRS subunits (Kawarabayasi et al. 1998; WIT1 ORFs RPF01697 and RPF01699). However, each of the genome se-
1 As these sequences are not yet in the GenBank database, the WIT ORF number is given; see http://wit.mcs.anl.gov/WIT2/.
D. Tumbula et al.
quences of M. jannaschii, M. thermoautotrophicum DH, and A. fulgidus contains three ORFs with similarity to PheRS subunits (Bult et al. 1996; Klenk et al. 1997; Smith et al. 1997). One ORF (MTH0770) displayed high sequence similarity to bacterial and eukaryal PheRS b-subunits. In addition, there are two ORFs (e.g., MTH0742 and MTH1501) with similarity to a-subunits from nonarchaea. Their sizes (in kilodaltons) are 58.5 and 61, respectively. Which of these potential a-subunits is found in the active PheRS? We addressed this question by purifying PheRS from M. thermoautotrophicum Marburg. SDS-polyacrylamide gel electrophoresis of the purified enzyme revealed two proteins in about equimolar ratio (U. C. Vothknecht, unpublished results). The aminoterminal sequences were obtained for both proteins. The sequence of the 70-kD protein matched that of ORF MTH0770 (which has sequence similarity with known PheRS b-subunits), while the N-terminal sequence of the 60-kD protein corresponded to that of ORF MTH0742. Thus, this ORF encodes the PheRS a-subunits used in this organism. It is likely that in M. jannaschii and A. fulgidus the PheRS a-subunit is encoded by the ORFs with the highest homology to the M. thermoautotrophicum protein (i.e., MJ0487 and AF1955). What is the function of the protein encoded by ORF MTH1501 in M. thermoautotrophicum and the other euryarchaeotes? It may be a pseudogene, although the highly conserved sequence suggests a functional protein. Alternatively, this ORF may encode a synthetase-like protein with unknown function. There are several examples of the involvement of synthetase-like proteins in reactions other than tRNA aminoacylation (see, e.g., Giritch et al. 1997). However, not much is known to date about the function of these proteins. Seryl-tRNA synthetase: SerRS is a class II synthetase characterized by the three signature motifs characteristic for all class II enzymes (Cusack et al. 1996). Surprisingly, three kinds of SerRS are found in the archaea (Doolittle and Handy 1998; Kim et al. 1998). The enzyme from the halophilic archaeon Haloarcula marismortui was the first archaeal SerRS to be purified and its gene sequence identified (Taupin et al. 1997). It possesses high sequence similarity to bacterial SerRS proteins and clusters within the bacterial branch in maximum likelihood analyses. A different clade comprising the SerRS enzymes from several other archaea (including P. horikoshii, P. furiosus, P. aerophilum, and A. fulgidus) shares high sequence similarity to its eukaryotic and bacterial counterparts (Klenk et al. 1997; WIT database; S. Fitz-Gibbon, unpublished data; Kawarabayasi et al. 1998). In contrast, the three methanogenic archaea M. jannaschii, M. maripaludis, and M. thermoautotrophicum contain SerRS-encoding genes (serS) that display only low sequence similarity to the known serS genes from all other organisms and encode a structurally uncommon enzyme with an altered motif II (Kim et al. 1998). Bio-
chemical analysis of purified and heterologously expressed SerRS from M. thermoautotrophicum and M. maripaludis showed that these proteins in fact have canonical SerRS activity (Kim et al. 1998). While it has been demonstrated that the M. thermoautotrophicum and M. maripaludis SerRS enzymes aminoacylate E. coli tRNA, detailed biochemical, mutational, and structural studies are warranted to elucidate the structure-function relationships in this unusual SerRS enzyme. Cysteinyl-tRNA synthetase: Biochemical, genetic, and genomic studies have revealed that many archaea contain a relatively recent bacterial version of CysRS and that lateral transfer of this gene between bacteria and archaea was widespread (Li et al. 1999). However, as discussed above, there are two organisms (M. jannaschii and M. thermoautotrophicum) with known complete genomic sequences that do not contain an ORF for a recognizable CysRS. Barring the possibility that the published genomes miss some genes, one is led to believe that these organisms must possess a currently unrecognizable enzyme responsible for Cys-tRNA formation. Because domain searches have not revealed unidentified ORFs with clear class I and class II signatures (except the unassigned PheRS-like ORF; see above), such an enzyme would not have similarity with the known synthetases. On the other hand, an indirect pathway of Cys-tRNACys formation by thiolation of Ser-tRNACys analogous to the biosynthetic route for selenocysteinyl-tRNA was still a possibility (Baron and Bo¨ck 1995). For this reason it was intriguing to realize that the structually uncommon SerRS (see above) is found in organisms whose genomic sequence lacks a recognizable gene encoding CysRS (M. jannaschii and M. thermoautotrophicum). However, in vitro experiments with pure M. thermoautotrophicum SerRS did not provide support for mischarging of tRNACys with serine (Kim et al. 1998). Nevertheless, M. maripaludis, a relative of M. jannaschii for which a genomic sequence is not yet available, contains a canonical CysRS with similarity to the pyrococcal enzyme (C. Stathopoulos, D. Graham, P. Haney and A. W. Curnow, unpublished results). However, this does not rule out the existence of another CysRS in this organism or the presence of a tRNA-dependent amino acid transformation pathway of Cys-tRNA formation in some archaea. Defining the pathway of Cys-tRNA synthesis in M. jannaschii and M. thermoautotrophicum is the last great challenge to our knowledge of aminoacyl-tRNA synthesis; it appears to be strictly an archaeal mystery. Selenocysteine formation: Selenocysteine, a UGAdirected, cotranslationally inserted, essential amino acid, is also found in many archaea (see, e.g., Wilting et al. 1997). The mechanism of selenocysteine formation in archaea is currently unknown. While orthologs to the E. coli selD (selenophosphate synthase) and selC (selenocysteine tRNA) genes have been found, archaea appear to employ nonorthologous proteins in the function of
Archaeal Aminoacyl-tRNA Synthesis
selA (selenocysteine synthase) and selB (SELB translation factor). Future prospects: The study of aminoacyl-tRNA synthesis in archaea has questioned, and in some cases overturned, several dogmatic views. The question now arises, since nearly all of the puzzles initially presented by the genomic sequence of M. jannaschii have been solved, should we return to our trusted model organisms such as E. coli? The answer is most certainly not, for the simple reason that the investigation of archaea has made it abundantly clear that the concept of a model organism for the study of synthetases is now redundant. For example, while it was first shown in archaea that a transamidation pathway can synthesize Asn-tRNA and that a class I LysRS exists, both have subsequently proved to be common in bacteria. Similarly, the apparently universal absence of GlnRS from archaea, together with its scarcity in bacteria, has confirmed the prediction that this is essentially a eukaryotic enzyme (Siatecka et al. 1998). These examples clearly illustrate how the study of aminoacyl-tRNA synthesis in archaea has profoundly altered our preconceptions about this essential process. To date, the utilization of alternative pathways and enzymes to synthesize particular aminoacyl-tRNAs shows no predictable distribution in the living kingdom (Table 1), with some bacteria even employing two pathways to make the same Asn-tRNA (Becker and Kern 1998; Curnow et al. 1998). This lack of a consistent evolutionary pattern is seen among virtually all the aminoacyltRNA synthetase families (Diaz-Lazcoz et al. 1998; C. Woese, G. Olsen, M. Ibba and D. So¨ll, unpublished results). To better understand these unexpected levels of both biochemical and evolutionary complexity in aminoacyl-tRNA synthesis, the sampling of a wider range of organisms is now essential: the expansion of experimentation on archaeal aminoacyl-tRNA synthesis forms an integral part of such studies. In light of the numerous unexpected results that have already come from working with archaea, it seems reasonable to assume that more surprises are in store for the aminoacyl-tRNA synthesis field. We thank Rajdeep Das for help with PheRS purification. The work described in this article was supported by grants from the National Institute of General Medical Sciences and Bristol-Myers Squibb (D.S.). D.L.T. holds a postdoctoral fellowship from the National Institute of General Medical Sciences, and H.B. is a European Molecular Biology Organization postdoctoral fellow. M.I. gratefully acknowledges the support of the Alfred Benzon Foundation.
LITERATURE CITED Barns, S. M., C. F. Delwiche, J. D. Palmer and N. R. Pace, 1996 Perspectives on archaeal diversity, thermophily and monophyly from environmental rRNA sequences. Proc. Natl. Acad. Sci. USA 93: 9188–9193. Baron, C., and A. Bo¨ck, 1995 The selenocysteine-inserting tRNA species: structure and function, pp. 529–544 in tRNA: Structure, Biosynthesis and Function, edited by D. So¨ll and U. RajBhandary. ASM Press, Washington, DC.
Becker, H. D., and D. Kern, 1998 Thermus thermophilus: a link in evolution of the tRNA-dependent amino acid amidation pathways. Proc. Natl. Acad. Sci. USA 95: 12832–12837. Blattner, F. R., G. Plunkett, C. A. Bloch, N. T. Perna, V. Burland et al., 1997 The complete genome sequence of Escherichia coli K-12. Science 277: 1453–1474. Bult, C. J., O. White, G. J. Olsen, L. Zhou, R. D. Fleischmann et al., 1996 Complete genome sequence of the methanogenic archaeon, Methanococcus jannaschii. Science 273: 1058–1073. Curnow, A. W., M. Ibba and D. So¨ll, 1996 tRNA-dependent asparagine formation. Nature 382: 589–590. Curnow, A. W., K.-W. Hong, R. Yuan, S.-I. Kim, O. Martins et al., 1997 Glu-tRNAGln amidotransferase: a novel heterotrimeric enzyme required for correct decoding of glutamine codons during translation. Proc. Natl. Acad. Sci. USA 94: 11819–11826. Curnow, A. W., D. L. Tumbula, J. T. Pelaschier, B. Min and D. So¨ll, 1998 Glutamyl-tRNAGln amidotransferase in Deinococcus radiodurans may be confined to asparagine biosynthesis. Proc. Natl. Acad. Sci. USA 95: 12838–12843. Cusack, S., A. Yaremchuk and M. Tukalo, 1996 The crystal structure of the ternary complex of T. thermophilus seryl-tRNA synthetase with tRNASer and a seryl-adenylate analogue reveals a conformational switch in the active site. EMBO J. 15: 2834–2842. Diaz-Lazcoz, Y., J. C. Aude, P. Nitschke, H. Chiapello, C. LandesDevauchelle et al., 1998 Evolution of genes, evolution of species: the case of aminoacyl-tRNA synthetases. Mol. Biol. Evol. 15: 1548–1561. Doolittle, R. F., and J. Handy, 1998 Evolutionary anomalies among the aminoacyl-tRNA synthetases. Curr. Opin. Genet. Dev. 8: 630–636. Eriani, G., M. Delarue, O. Poch, J. Gangloff and D. Moras, 1990 Partition of tRNA synthetases into two classes based on mutually exclusive sets of sequence motifs. Nature 347: 203–206. Fujiwara, S., S. Lee, M. Haruki, S. Kanaya, M. Takagi et al., 1996 Unusual enzyme characteristics of aspartyl-tRNA synthetase from hyperthermophilic archaeon Pyrococcus sp. KOD1. FEBS Lett. 394: 66–70. Gagnon, Y., L. Lacoste, N. Champagne and J. Lapointe, 1996 Widespread use of the Glu-tRNAGln transamidation pathway among bacteria. J. Biol. Chem. 271: 14856–14863. Giritch, A., A. Herbik, H.-J. Balzer, M. Ganal, U. W. Stephan et al., 1997 A root-specific iron-regulated gene of tomato encodes a lysyl-tRNA-synthetase-like protein. Eur. J. Biochem. 244: 310– 317. Griffiths, E., and S. T. Bayley, 1969 Properties of transfer ribonucleic acid and animoacyl transfer ribonucleic acid synthetases from an extremely halophilic bacterium. Biochemistry 8: 541– 551. Gupta, R., 1984 Halobacterium volcanii tRNAs. J. Biol. Chem. 259: 9461–9471. Ibba, M., A. W. Curnow and D. So¨ll, 1997a Aminoacyl-tRNA synthesis: divergent routes to a common goal. Trends Biochem. Sci. 22: 39–42. Ibba, M., S. Morgan, A. W. Curnow, D. R. Pridmore, U. C. Vothknecht et al., 1997b A euryarchaeal lysyl-tRNA synthetase: resemblance to class I synthetases. Science 278: 1119–1122. Ibba, M., J. L. Bono, P. A. Rosa and D. So¨ll, 1997c Archaeal-type lysyl-tRNA synthetase in the Lyme disease spirochete, Borrelia burgdorferi. Proc. Natl. Acad. Sci. USA 94: 14383–14388. Ibba, M., C. H. Losey, Y. Kawarabayasi, H. Kikuchi, S. Bunjun et al., 1999 Substrate recognition by class I lysyl-tRNA synthetases: a molecular basis for gene displacement. Proc. Natl. Acad. Sci. USA 96: 418–423. Jahn, D., Y.-C. Kim, Y. Ishino, M.-W. Chen and D. So¨ll, 1990 Purification and functional characterization of the Glu-tRNAGln amidotransferase from Chlamydomonas reinhardtii. J. Biol. Chem. 265: 8059–8064. Kaneko, T., S. Sato, H. Kotani, A. Tanaka, E. Asamizu et al., 1996 Sequence analysis of the genome of the unicellular cyanobacterium Synechocystis sp. strain PCC6803. II. Sequence determination of the entire genome and assignment of potential protein-coding regions. DNA Res. 30: 109–136. Kawarabayasi, Y. M. Sawada, H. Horikawa, Y. Haikawa, Y. Hino et al., 1998 Complete sequence and gene organization of the genome of a hyper-thermophilic archaebacterium, Pyrococcus horikoshii OT3. DNA Res. 5: 55–76.
D. Tumbula et al.
Kim, H.-S., U. C. Vothknecht, R. Hedderich, I. Celic and D. So¨ll, 1998 Sequence divergence of seryl-tRNA synthetases in archaea. J. Bacteriol. 180: 6446–6449. Klenk, H.-P., R. A. Clayton, J.-F. Tomb, O. White, K. E. Nelson et al., 1997 The complete genome sequence of the hyperthermophilic, sulphate-reducing archaeon Archaeoglobus fulgidus. Nature 390: 364–370. Kobayashi, M., H. Komeda, T. Nagasawa, M. Nishiyama, S. Horinouchi et al., 1993 Amidase coupled with low-molecular-mass nitril hydratase from Rhodoccous rhodochrous J1. Sequencing and expression of the gene and purification and characterization of the gene product. Eur. J. Biochem. 217: 327–336. Lazzarini, R. A., and A. H. Mehler, 1964 Separation of specific glutamate- and glutamine-activating enzymes from Escherichia coli. Biochem. 3: 1445–1449. Martin, N. C., and M. Rabinowitz, 1984 Glu-tRNAGln: an intermediate in yeast mitochondrial protein synthesis. Methods Enzymol. 106: 152–157. Nabholz, C. E., R. Hauser and A. Schneider, 1997 Leishmania tarentolae contains distinct cytosolic and mitochondrial glutaminyl-tRNA synthetase activities. Proc. Natl. Acad. Sci. USA 94: 7903–7908. Olsen, G. J., C. R. Woese and R. Overbeek, 1994 The winds of (evolutionary) change: breathing new life into microbiology. J. Bacteriol. 176: 1–6. Rauhut, R., H.-J. Gabius, W. Ku¨hn and F. Cramer, 1984 Phenylalanyl-tRNA synthetase from the archaebacterium Methanosarcina barkeri. J. Biol. Chem. 259: 6340–6345. Sanni, A., P. Walter, Y. Boulanger, J. P. Ebel and F. Fasiolo, 1991 Evolution of aminoacyl-tRNA synthetase quaternary structure and activity: Saccharomyces cerevisiae mitochondrial phenylalanyl-tRNA synthetase. Proc. Natl. Acad. Sci. USA 88: 8387–8391. Schimmel, P., and L. Ribas de Pouplana, 1999 Genetic code origins: experiments confirm phylogenetic predictions and may explain a puzzle. Proc. Natl. Acad. Sci. USA 96: 327–328. Schmitt, E., L. Moulinier, S. Fujiwara, T. Imanaka, J.-C. Thierry et al., 1998 Crystal structure of aspartyl-tRNA synthetase from Pyrococcus kodakaraensis KOD: archaeon specificity and catalytic mechanism of adenylate formation. EMBO J. 17: 5227–5237. Scho¨n, A., C. G. Kannagara, S. Gough and D. So¨ll, 1988a Protein biosynthesis in organelles requires misaminoacylation of tRNA. Nature 331: 187–190. Scho¨n, A., H. Hottinger and D. So¨ll, 1988b Misaminoacylation and transamidation are required for protein biosynthesis in Lactobacillus bulgaricus. Biochimie 70: 391–394.
Sensen, C. W., H. P. Klenk, R. K. Singh, G. Allard, C. C. Chan et al., 1996 Organizational characteristics and information content of an archaeal genome: 156 kb of sequence from Sulfolobus solfataricus P2. Mol. Microbiol. 22: 175–191. Siatecka, M., M. Rozek, J. Barciszewski and M. Mirande, 1998 Modular evolution of the Glx-tRNA synthetase family—rooting of the evolutionary tree between the bacteria and archaea/eukarya branches. Eur. J. Biochem. 256: 80–87. Smith, D. R., L. A. Doucette-Stamm, C. Deloughery, H. Lee, J. Dubois et al., 1997 Complete genome sequence of Methanobacterium thermoautotrophicum DH: functional anaylsis and comparative genomics. J. Bacteriol. 179: 7135–7155. Strimmer, K., and A. von Haeseler, 1996 Quarter puzzling: A quarter maximum-likelihood method for reconstructing tree topologies. Mol. Biol. Evol. 13: 964–969. Taupin, C. M.-S., M. Ha¨rtlein and R. Leberman, 1997 Seryl-tRNA synthetase from the extreme halophile Haloarcula marismortui. Isolation, characterization and sequencing of the gene and its expression in Escherichia coli. Eur. J. Biochem. 243: 141–150. Thompson, J. D., T. J. Gibson, F. Plewniak, F. Jeanmougin and D. G. Higgins, 1997 The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 25: 4876–4882. Tomb, J. F., O. White, A. R. Kerlavage, R. A. Clayton, G. G. Sutton et al., 1997 The complete genome sequence of the gastric pathogen Helicobacter pylori. Nature 388: 539–547. White, B. N., and S. T. Bayley, 1972 Further codon assignments in an extremely halophilic bacterium using a cell-free proteinsynthesizing system and a ribosomal binding assay. Can. J. Biochem. 50: 600–609. Wilcox, M., 1969 g-phosphoryl ester of Glu-tRNAGln as an intermediate in Bacillus subtilis glutaminyl-tRNA synthesis. Cold Spring Harbor Symp. Quant. Biol. 34: 521–528. Wilcox, M., and M. Nirenberg, 1968 Transfer RNA as a cofactor coupling amino acid synthesis with that of protein. Proc. Natl. Acad. Sci. USA 61: 229–236. Wilting, R., S. Schorling, B. C. Persson and A. Bo¨ck, 1997 Selenoprotein synthesis in archaea: identification of an mRNA element of Methanococcus jannaschii probably directing selenocysteine insertion. J. Mol. Biol. 266: 637–641. Zalkin, H., and J. L. Smith, 1998 Enzymes utilizing glutamine as an amide donor. Adv. Enzymol. Relat. Areas Mol. Biol. 72: 87–144. Communicating editor: W. B. Whitman