![]() | ![]() |
Formats:
|
||||||||||||||||
Copyright © 2006 Heron Publishing—Victoria, Canada Lineage-specific partitions in archaeal transcription 1 Microarray Group, The European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, U.K. 2 ALCIMED, 84 boulevard Vivier Merle, 69 485 Lyon Cedex 03, France 3 Computational Genomics Unit, Institute of Agrobiotechnology, Center for Research and Technology Hellas, Thessalonica, 57001 Greece * Corresponding author (Email: coulson/at/ebi.ac.uk) Received September 13, 2006; Accepted October 23, 2006. Abstract
The phylogenetic distribution of the components comprising the
transcriptional machinery in the crenarchaeal and euryarchaeal
lineages of the Archaea was analyzed in a systematic manner by
genome-wide profiling of transcription complements in fifteen complete
archaeal genome sequences. Initially, a reference set of
transcription-associated proteins (TAPs) consisting of sequences
functioning in all aspects of the transcriptional process, and
originating from the three domains of life, was used to query the
genomes. TAP-families were detected by sequence clustering of the TAPs
and their archaeal homologues, and through extensive database
searching, these families were assigned a function. The phylogenetic
origins of archaeal genes matching hidden Markov model profiles of
protein domains associated with transcription, and those encoding the
TAP-homologues, showed there is extensive lineage-specificity of
proteins that function as regulators of transcription: most of these
sequences are present solely in the Euryarchaeota, with nearly all of
them homologous to bacterial DNA-binding proteins. Strikingly, the
hidden Markov model profile searches revealed that archaeal chromatin
and histone-modifying enzymes also display extensive
taxon-restrictedness, both across and within the two phyla.
Keywords: genome profiling, protein families, sequence clustering, transcription-associated proteins Introduction
Transcription, a core gene expression process, involves different agents
participating in initiation, elongation, termination and regulation. The
basic principles of transcriptional regulation in the Bacteria and
Eukaryota have been outlined (Ptashne
and Gann 1997), contrasting with the Archaea, where such
mechanisms are less well-understood. The study of the archaeal
transcriptional machinery is important to an understanding of both the
molecular mechanisms, and the evolutionary history, of transcriptional
regulation in all three domains of life. Furthermore, these analyses may
reveal how archaea respond to environmental challenges, in particular,
given their possible association with various aspects of human disease
(Eckburg et al. 2003). Several
previous studies have shown that the RNA polymerase core enzyme exhibits
structural similarity between the Archaea and the Eukaryota
(Puhler et al. 1989). Moreover,
the minimal set of factors required for in vitro transcription
initiation in archaea consists of TATA-box binding protein (TBP), TFIIB
and RNA polymerase II (Werner and
Weinzierl 2002). In bacteria, however, the process appears to be
fundamentally different (Struhl
1999), with regulation accomplished by an entirely different set
of proteins (Gralla 1996).
Evidence from sequence similarity studies between RNA polymerases
suggests that archaeal transcription shared certain components with that
of the Eukarya (Puhler et al.
1989), a conclusion further supported by the discovery of TFIIB
in Pyrococcus furiosus
(Ouzounis and Sander 1992) and
TBP in P. woesei
(Rowlands et al. 1994). The
strong similarity of archaeal transcription initiation factors with
their eukaryotic counterparts has reinforced the concept that these
domains shared an ancestral transcriptional apparatus. However, a
rigorous comparison with bacteria was not performed until entire
archaeal genomes became available. It was then shown that archaea
contain a significant proportion of bacterial transcripional regulators,
in addition to their eukaryotic-like transcription initiation factors
(Kyrpides and Ouzounis 1999), for
example Lrp (Kyrpides and Ouzounis
1995) and sigma-70 (Kyrpides and
Ouzounis 1997). Notably, the only transcriptional components
common to all three domains of life are the two largest RNA polymerase
subunits, SIR2, NifL and TenA (Coulson
et al. 2001), the latter found fused with the metabolic enzyme
ThiD in eukaryotes (Ouzounis and
Kyrpides 1997). With 15 archaeal genomes, encompassing a wider
range of genera from the two main archaeal lineages, a systematic
analysis of the archaeal transcriptional machinery would test the
robustness of the assumption that bacterial-type regulators and
eukaryotic initiation factors co-exist in the Archaea
(Bell et al.
2001a).
Methods
The reference sequence dataset of transcription-associated proteins
(TAPs) was extracted from SwissProt and TrEMBL from entries containing
the word “transcription” in their description and keyword
records, as previously described (Coulson
and Ouzounis 2003, Coulson et al.
2004). Searching and linking database records was facilitated by
SRS (Etzold et al. 1996). From a
total of 9874 TAP sequences, 4654 are bacterial, 4907 are eukaryotic and
313 are archaeal. Archaeal genomes were obtained from the COGENT
database (Janssen et al. 2003).
The following fifteen fully sequenced and published archaeal genome
sequences were used (strain names after comma, COGENT identifiers in
parentheses): Archaeoglobus fulgidus, DSM4304
(AFUL-DSM-01), Aeropyrum pernix, K1 (APER-XK1-01),
Halobacterium sp., NRC-1 (HALO-NRC- 01),
Methanosarcina acetivorans, C2A (MACE-C2A-01),
Methanoccocus jannaschii, DSM 2661 (MJAN-DSM-01),
Methanopyrus kandleri, AV19 (MKAN-AV1-01),
Methanosarcina mazei, Go1 (MMAZ-GO1-01),
Methanobacterium thermoautotrophicum, deltaH
(MTHE-DEL-01), Pyrococcus abyssi, GE5 (PABY-GE5-01),
Pyrobaculum aerophilum, IM2 (PAER-IM2-01),
Pyrococcus horikoshii, OT3 (PHOR-OT3- 01),
Sulfolobus solfataricus, P2 (SSOL-XP2-01),
Sulfolobus tokodaii, Str. 7 (STOK-XX7-01),
Thermoplasma acidophilum, DSM1728 (TACI-DSM-01) and
Thermoplasma volcanium, GSS1 (TVOL-GSS-01). There is
only one complete nanoarchaeotal genome available, and its uniqueness
precluded it from the intra-lineage analyses performed in these studies.
The TAP reference dataset was compared against the fifteen archaeal
genomes (36,194 sequences) using BLASTp with an E-value
cut-off of 10–6
(Altschul et al. 1997), after
being filtered for composition bias with CAST
(Promponas et al. 2000). For each
archaeal genome, the numbers of both the TAP-homologues and
matching-reference TAPs were recorded (see
Table 1). The 1938 archaeal
proteins and 2634 TAPs thus obtained (4572 in total) were clustered
using the TRIBE-MCL algorithm (Enright
et al. 2002), i.e., only the TAPs matching archaeal proteins
(2634 in number) and these archaeal homologues (1938 in number) were
clustered by sequence similarity. To assess the effect of granularity,
two runs were performed with inflation values of 2.0 and 5.0, as
previously described (Coulson and
Ouzounis 2003). The distribution of both singletons and families
across the Archaea and the reference dataset was examined (see
Table 2).
To further establish the validity of this approach, a parallel analysis
was conducted with hidden Markov model (HMM) profiles from the Pfam
database (version 7.8) (Bateman et al.
2004). The Pfam accession numbers associated with the 9874
reference TAPs were used to link directly to 508 unique Pfam entries.
The fifteen archaeal genomes were queried with HMM profiles, using
“hmmsearch” from the software package HMMER
(Eddy 1998). Hits were considered
significant if their score against the corresponding HMM profile was
above a trusted cut-off value, equivalent to the lowest-scoring known
member of the Pfam alignment. Of the 508 profiles, 173 have detected
homologues in archaea, of which 115 are not involved in transcription.
The remaining 58 profiles, defined as TA-HMMs, were validated by their
InterPro records to confirm that they were transcription-related.
All results are available at:
http://www.ebi.ac.uk/research/cgg/projects/transcription/archaea.
Results
The 9874 reference TAP sequences extracted from SWISS-PROT and TrEMBL
identified 1938 proteins in the fifteen archaeal genomes
(Table 1). On average, about 5%
of the genomes matched the reference set, with some slight variation;
the TAP matches identified by BLASTp represent about 4% of the gene
content of the Crenarchaeota, and about 6% of the gene content of the
Euryarchaeota. This observation suggests that the TAP complement of the
Crenarchaeota is more divergent from the reference set than that of the
Euryarchaeota (Table 1). However,
in both archaeal divisions, the abundance of TAP hits appears to
increase with genome size.
The extent of sequence divergence between the detected archaeal proteins
and the sequences in the reference set was examined by sequence
clustering analyses performed on the BLASTp similarity search results.
The clustering operations were implemented at inflation values of 2.0
and 5.0, corresponding to different granularities; the lower value
produces larger clusters, containing more distantly related sequences.
With an inflation value of 2.0, clustering results in 281 families of
which 179 (64%) contain at least one TAP and one archaeal sequence
(Table 2). With an inflation
value of 5.0, clustering results in 632 families of which 179 (28%) also
contain at least one TAP and one archaeal sequence. In this second case,
243 singletons were generated (38% of all clusters,
Table 2). Hence, attention was
focused on clusters generated with an inflation value of 2.0 because
these are both more accurate and broader than clusters generated with a
larger inflation value (Coulson and
Ouzounis 2003). Similar parameter settings have been shown to
result in a precision of >90% for InterPro, and >80% for SCOP
families (Enright et al. 2002).
Lineage specificity and functional classification
Clusters generated by TRIBE-MCL correspond to protein families whose
members are related by common function. Thus, the 179 TAP-containing
clusters generated at an inflation value of 2.0
(Table 2), and containing at
least one TAP and one archaeal homologue, were annotated manually
based on the description records of SwissProt obtained via SRS
(Etzold et al. 1996). After
this step, 119 clusters were found to be directly related to the
transcriptional process, corresponding to 906 archaeal sequences.
These transcription-associated (TA-) clusters were examined in terms
of the role played in the transcriptional process by their TAP members
and the phylogenetic origins of both the TAP and archaeal sequences
(Table 3). The term
“phylogenetic origin” refers to the phylogenetic domain to
which a TAP belongs, and in which lineage the archaeal homologue is
observed.
Analysis of the phylogenetic origins of the TAPs in the TA-clusters
showed that in just over 80% of the families (97 clusters) the TAPs
are found only in prokaryotic genomes
(Table 3). In ~44% (i.e., 43
out of 97 families) and ~37% (36 families) of these TA-clusters, the
TAP members of an individual cluster originate solely from either the
Archaea (A) or the Bacteria (B), respectively. TAP members in the
remaining ~19% of these TA-clusters originate from both the archaeal
and bacterial domains (AB); so, each of these 18 families contains at
least one archaeal and one bacterial TAP. Strikingly, in 89 out of the
97 families (~92%) whose TAPs are present only in prokaryotes (A, B
and AB families), the TAPs function as transcriptional regulators
(listed as ‘Regulators’ in
Table 3): in the B and AB
families, the role of every TAP member is to regulate transcription,
and in ~82% of the A families, the TAP members perform the same
regulatory role. This finding highlights the prevalence of regulators
of prokaryotic origin in the Archaea.
The above pattern contrasts with the other 22 TA-clusters, which
contain TAPs present in the Eukaryota (codes with E in
Table 3). In 10 of these
families (~46%) the TAPs comprise RNA polymerase subunits, and in six
families (~27%) they function as basal transcription factors,
corresponding to a total of 73% of families involved in transcription
initiation and synthesis. This compares with just 8% (8 out of 97) of
families of prokaryotic origin that contain TAPs of identical
function. Hence, few archaeal sequences show homology to eukaryotic
regulators of transcription and bacterial proteins associated with RNA
synthesis. However, even though a large number of archaeal sequences
exhibit homology to bacterial transcriptional regulators, there
appears to be a substantial proportion of archaeal-specific regulatory
factors represented by the archaeal families (A families in
Table 3).
The 119 TA-clusters exhibit a high degree of lineage-specificity; ~63%
(27 cases) and ~81% (29 cases) of the A and B families,
respectively—the majority of which are transcriptional
regulators—detect archaeal homologues that are unique to
Euryarchaeotal genomes (Table
3). This contrasts with the TA-clusters containing TAPs
originating from the Eukaryota, where the opposite pattern is
observed: ~83% and ~60% of the AE and ABE families, respectively,
contain archaeal homologues present in both the Crenarchaeota and
Euryarchaeota. Furthermore, in over four fifths of these phylogenetic
classes of TA-clusters (14 out of 17 families), the TAP members
function as either subunits of RNA polymerase or basal transcription
factors.
However, among the prokaryotic (A, B and AB) families, two-thirds (64
out of 97) contain archaeal homologues present only in the
Euryarchaeota, whereas in 25 (about one-quarter) of such clusters the
TAP homologues originate from both archaeal phyla. Very few of these
families (~8%, 8 out of 97) have as members archaeal TAP homologues
that are present uniquely in the Crenarchaeota. The vast majority of
the A, B and AB families (>90%, 89 families) contain TAPs that
function as regulators and are mostly present in Euryarchaeota only.
In particular, all the TAPs in the B families are DNA-binding
proteins, and >80% of their families possess only Euryarchaeota
sequences (Table 3). These
functional and taxonomic distribution patterns suggest that there are
either fewer (known) Crenarchaeota-specific sequences associated with
the control of transcription, or that such sequences are more diverged
from the bacterial-type regulators in the Euryarchaeota.
Protein domain representation in archaeal phyla
The TRIBE-MCL clustering analyses were complemented by querying the
archaeal genomes with 58 HMM profiles that had been validated to be
associated with the transcriptional process (TA-HMMs). A large
proportion of sequences were identified by both procedures,
corresponding to 600 proteins and 81 clusters.
(Table 4 displays for each
genome the number of TAP-homolgues detected by each method
individually, and the number detected by both methods.) Additionally,
the two methods of TRIBE-MCL clustering and HMM profile searches
uniquely detect 306 and 239 proteins, respectively
(Table 4). This degree of
overlap indicates a consistent identification of the
transcription-associated gene complement in the archaeal genomes under
consideration (Table 4). Fewer
proteins associated with the transcriptional process were found in the
Crenarchaeota than in the Euryarchaeota (~2.40% versus ~3.39%;
Table 4). The presence of
nearly 1.5-fold more TAP-homologues in the Euryarchaeota is highly
significant (P < 2×10–7,
Fisher’s exact test). Furthermore, a slight,
monotonically-increasing relationship of TAP-homologue abundance with
genome size is observed (Tables
1 and 4 and
Figure 1
The twenty-one TA-HMMs representing eleven of the twelve polypeptides
that constitute RNA polymerase, match approximately equivalent
abundances of genes in the Crenarchaeota and Euryarchaeota genomes
(~0.44% versus ~0.53% respectively: 49 out of 11,120 and 132 out of
25,074 sequences). Genes encoding subunits A′, A²,
B′, B², D, E, H and N are detected in all fifteen of the
archaeal species by the profile-based method (Supplementary
Table 1). Only matches to
subunits F, K and L were not observed in all genomes; F is not
detected in two of the crenarchaeal (P. aerophilum
and S. tokodaii) and just one of the euryarchaeal
genomes (M. kandleri), whereas L exhibits the
opposite pattern and is undetected in P. horikoshii,
T. volcanium (both euryarchaeotes) and P.
aerophilum (a crenarchaeote), with a homologue of K absent
only from P. horikoshii. This apparent absence of
only a small number of RNA polymerase subunits probably arises from
their diverged nature as genes encoding these proteins have been
observed in these species (Brochier et
al. 2004). Furthermore, nearly identical phylogenetic
distributions are observed with the RNA polymerase families detected
by the TRIBE-MCL clustering (Supplementary
Table 2), suggesting little
sequence divergence between the RNA polymerase subunits of the
euryarchaeota and crenarchaeota.
Significantly more sequences in the Euryarchaeota match TA-HMMs of
proteins associated with the initiation complex, and transcription
elongation and termination, than in the Crenarchaeota
(Figure 2
The lack of taxon-restrictedness exhibited by RNA polymerase subunits
and basal transcription factors contrasts sharply with proteins that
bind DNA nonspecifically and modulate chromatin structure
(Figure 2
A high degree of phylum-specificity is observed with sequences
matching the 24 transcriptional regulator TA-HMMs
(Figure 2
The pattern of elevated abundance of sequences homologous to bacterial
regulators in the Euryarchaeota (Figure
2
The only eukaryotic DNA-binding domain identified in the archaeal
genomes was a variant of the C2H2-zinc finger (PF00096), that detected
a total of 17 sequences across all Crenarchaeota and six Euryarchaeota
species. However, Halobacterium—the only
non-thermophile analyzed—appears most
“bacteria-like” in its complement of transcriptional
regulators; it is the only species with homologues of cold-shock
protein (PF00313), Crp (PF00325) and region 2 of s70
(PF04542). Therefore, although nearly all the archaeal transcriptional
regulators identified are related to bacterial repressors, there is
considerable heterogeneity across, and within the two phyla in the
types of regulator families present within a particular species.
Discussion
The genome-wide profiling of archaeal transcription that was performed
indicates that the Euryarchaeota have more proteins associated with the
transcriptional process than do the Crenarchaeota. Furthermore, in both
phyla, there is a trend showing that TAP-homologue abundance increases
with genome size, i.e., smaller genomes appear to contain
proportionately fewer transcription factors than larger ones. Evidence
of this type of correlation has also been observed in bacteria
(Cases et al. 2003). Although a
number of functional classes associated with RNA synthesis and
transcription initiation are shared both between and within the known
archaeal phyla, there are a number of surprising lineage-specific
patterns that have not previously been systematically characterized:
both the proteins that constitute archaeal chromatin and histone
modifying enzymes, as well as transcriptional regulators—which are
major contributors to the lineage-specificity—display extensive
sequence diversity across the archaeal domain.
Genes encoding eleven out of the twelve RNA polymerase subunits are
observed in all fifteen archaeal genomes, with a few exceptions
(Slesarev et al. 2002). Subunit F
(identified only with the profile-based method) is not observed in three
genomes; however, its eukaryotic functional and structural equivalent,
Rpb4 (Werner et al. 2000), is not
essential for cell viability in yeast
(Woychik and Young 1989). The
other two subunits that were not found in all of the analysed genomes
are K and L, which appear to be absent only from P.
horikoshii as homologues of L are detected by the TRIBE-MCL
clustering in P. aerophilum and T.
volcanium. These data imply that RNA polymerase is very highly
conserved in the Archaea.
Transcription initiation in archaea resembles the eukaryotic process in
its requirement for TBP, TFB (the archaeal version of eukaryotic TFIIB)
(Soppa 1999,
Reeve 2003) and TFE (an archaeal
protein homologous to the N-terminal region of the a subunit of
eukaryotic TFIIE) (Bell et al.
2001b, Hanzelka
et al. 2001). Genes encoding these three basal transcription
factors are present in all of the genomes, with the only notable absence
being that of TFE from T. acidophilum and T.
volcanium. However, no archaeal genes encoding homologues of
TAFs (Qureshi et al. 1997) or
TFIIE (subunit b), TFIIF and TFIIH
(Ouhammouch 2004) were
identified. TBP is usually encoded by a single copy gene, though
Halobacterium sp. contains eleven copies
(Ng et al. 2000), with three
genes in M. acetivorans and M. mazei
coding for this canonical transcription factor. The
Halobacterium genome also contains multiple copies of
TFB-encoding genes (seven), as do half of the other genomes that contain
either two or three copies. Hence, the identification of multiple copies
of TBP and TFB genes in a range of archaeal species supports the
hypothesis that different types of transcriptional regulation occur in
response to environmental challenges
(Goo et al. 2004), although the
role of multiple TBPs and their relationship with multiple TFBs is not
yet clear (Galagan et al. 2002).
The archaeal nucleosome appears to display extensive taxon variablity:
with the exception of the Thermoplasma (whose genomes
encode bacterial HU-like proteins), the Euryarchaeota use homologues of
histones H3 and H4 to assemble chromatin
(Geiduschek and Ouhammouch 2005),
contrasting with the Crenarchaeota where completely different sets of
proteins compact the genome (e.g., Alba in S.
solfataricus (Bell et al.
2002)). It is interesting to note, given the apparent lack of
histones in Thermoplasma, that no homologues of members
of the HDAC family, Sir2 or TFE are present in the genomes of T.
acidophilum and T. volcanium. Sir2 homologues
are present in all the Crenarchaeotal genomes, but only in the same four
Euryarchaeotal genomes as NusB. This may suggest some association
between RNA chain elongation and histone deacetylation, as both TFIIE
and NusB function in the elongation process, and in eukaryotes many
factors can influence transcript elongation on chromatin templates
(Sims et al. 2004). This
heterogeneity in the constituents of archaeal chromatin is unexpected as
trypanosomatids contain the four core histones, and possess a range of
chromatin-remodeling activities, also present in mammals even though
there is considerable evolutionary distance between the two eukaryotic
taxa (Ivens et al. 2005).
The elevated abundance in euryarchaeal genomes of proteins associated
with the transcriptional process arises, predominately, from the
increased number (when genome size is accounted for) of sequences
homologous to bacterial transcriptional regulators within these genomes
(Figure 2
Nearly all the Crenarchaeota studied have shown a high degree of niche
specialization (most are hyperthermophiles) and, like parasitic
protozoa, which occupy only a limited range of environments, appear to
contain far fewer proteins that regulate transcription than do organisms
adapted to a wider range of environmental conditions
(Coulson et al. 2004,
Ivens et al. 2005). At present, a
lack of experimental data makes it impossible to determine whether the
Crenarchaeota possess significant numbers of lineage-specific
transcriptional regulators or whether they just contain fewer of the
bacterial-type compared with the Euryarchaeota. Nonetheless, these
results are consistent with previous genome profiling studies of
transcriptional complements, which show that the greater the
evolutionary separation between taxa, the greater the
taxon-restrictedness of the TAP-families
(Coulson and Ouzounis 2003).
Table S1. Number of archaeal genes matching domains present in RNA polymerase subunits.
Click here for additional data file.(189K, pdf)
Table S2. RNA polymerase subunit gene copy number in archaeal
genomes.
Click here for additional data file.(182K, pdf) Acknowledgments
R.M.R.C. and C.A.O. thank the Medical Research Council for supporting
this work through a Special Training Fellowship in Bioinformatics to
R.M.R.C.
References R1. Altschul S.F., Madden T.L., Schaffer A.A., Zhang J., Zhang Z., Miller W., Lipman D.J. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25:3389–3402. [PubMed] R2. Baliga N.S., Goo Y.A., Ng W.V., Hood L., Daniels C.J., Dassarma S. Is gene expression in Halobacterium NRC-1 regulated by multiple TBP and TFB transcription factors? Mol. Microbiol. 2000;36:1184–1185. [PubMed] R3. Bateman A., Coin L., Durbin R., et al. The Pfam protein families database. Nucleic Acids Res. 2004;32 Database issue:D:138–141. R4. Bell S.D., Magill C.P., Jackson S.P. Basal and regulated transcription in Archaea. Biochem. Soc. Trans. 2001;29:392–395. [PubMed] R5. Bell S.D., Brinkman A.B., Van der oost J., Jackson S.P. The archaeal TFIIEalpha homologue facilitates transcription initiation by enhancing TATA-box recognition. EMBO Rep. 2001;2:133–138. [PubMed] R6. Bell S.D., Botting C.H., Wardleworth B.N., Jackson S.P., White M.F. The interaction of Alba, a conserved archaeal chromatin protein, with Sir2 and its regulation by acetylation. Science. 2002;296:148–151. [PubMed] R7. Brochier C., Forterre P., Gribaldo S. Archaeal phylogeny based on proteins of the transcription and translation machineries: tackling the Methanopyrus kandleri paradox. Genome Biol. 2004;5:R17.01–12. [PubMed] R8. Cases I., De lorenzo V., Ouzounis C.A. Transcription regulation and environmental adaptation in bacteria. Trends Microbiol. 2003;11:248–253. [PubMed] R9. Coulson R.M., Ouzounis C.A. The phylogenetic diversity of eukaryotic transcription. Nucleic Acids Res. 2003;31:653–660. [PubMed] R10. Coulson R.M., Enright A.J., Ouzounis C.A. Transcription-associated protein families are primarily taxon-specific. Bioinformatics. 2001;17:95–97. [PubMed] R11. Coulson R.M., Hall N., Ouzounis C.A. Comparative genomics of transcriptional control in the human malaria parasite Plasmodium falciparum
. Genome Res. 2004;14:1548–1554. [PubMed] R12. DeLong E.F. Extreme genomes. Genome Biol. 2000;1:r10291–10293. R13. Eckburg P.B., Lepp P.W., Relman D.A. Archaea and their potential role in human disease. Infect. Immun. 2003;71:591–596. [PubMed] R14. Eddy S.R. Profile hidden Markov models. Bioinformatics. 1998;14:755–763. [PubMed] R15. Enright A.J., Van dongen S., Ouzounis C.A. An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res. 2002;30:1575–1584. [PubMed] R16. Etzold T., Ulyanov A., Argos P. SRS: information retrieval system for molecular biology data banks. Methods Enzymol. 1996;266:114–128. [PubMed] R17. Galagan J.E., Nusbaum C., Roy A., et al. The genome of M. acetivorans reveals extensive metabolic and physiological diversity. Genome Res. 2002;12:532–542. [PubMed] R18. Geiduschek E.P., Ouhammouch M. Archaeal transcription and its regulators. Mol. Microbiol. 2005;56:1397–1407. [PubMed] R19. Goo Y.A., Roach J., Glusman G., et al. Low-pass sequencing for microbial comparative genomics. BMC Genomics. 2004;5:3. [PubMed] R20. Gralla J.D. Activation and repression of E. coli promoters. Curr. Opin. Genet. Dev. 1996;6:526–530. [PubMed] R21. Hanzelka B.L., Darcy T.J., Reeve J.N. TFE, an archaeal transcription factor in Methanobacterium thermoautotrophicum related to eucaryal transcription factor TFIIEalpha. J. Bacteriol. 2001;183:1813–1818. [PubMed] R22. Ivens A.C., Peacock C.S., Worthey E.A., et al. The genome of the kinetoplastid parasite, Leishmania major
. Science. 2005;309:436–442. [PubMed] R23. Janssen P., Enright A.J., Audit B., Cases I., Goldovsky L., Harte N., Kunin V., Ouzounis C.A. COmplete GENome Tracking (COGENT): a flexible data environment for computational genomics. Bioinformatics. 2003;19:1451–1452. [PubMed] R24. Kyrpides N.C., Ouzounis C.A. The eubacterial transcriptional activator Lrp is present in the archaeon Pyrococcus furiosus
. Trends Biochem. Sci. 1995;20:140–141. [PubMed] R25. Kyrpides N.C., Ouzounis C.A. Bacterial sigma 70 transcription factor DNA-binding domains in the archaeon Methanococcus jannaschii
. J. Mol. Evol. 1997;45:706–707. [PubMed] R26. Kyrpides N.C., Ouzounis C.A. Transcription in archaea. Proc. Natl. Acad. Sci. USA. 1999;96:8545–8550. [PubMed] R27. Ng W.V., Kennedy S.P., Mahairas G.G., et al. Genome sequence of Halobacterium species NRC-1. Proc. Natl. Acad. Sci. USA. 2000;97:12176–12181. [PubMed] R28. Ouhammouch M. Transcriptional regulation in Archaea. Curr. Opin. Genet. Dev. 2004;14:133–138. [PubMed] R29. Ouzounis C.A., Kyrpides N.C. ThiD-TenA: a gene pair fusion in eukaryotes. J. Mol. Evol. 1997;45:708–711. [PubMed] R30. Ouzounis C., Sander C. TFIIB, an evolutionary link between the transcription machineries of archaebacteria and eukaryotes. Cell. 1992;71:189–190. [PubMed] R31. Promponas V.J., Enright A.J., Tsoka S., Kreil D.P., Leroy C., Hamodrakas S., Sander C., Ouzounis C.A. CAST: an iterative algorithm for the complexity analysis of sequence tracts. Bioinformatics. 2000;16:915–922. [PubMed] R32. Ptashne M., Gann A. Transcriptional activation by recruitment. Nature. 1997;386:569–577. [PubMed] R33. Puhler G., Leffers H., Gropp F., Palm P., Klenk H.P., Lottspeich F., Garrett R.A., Zillig W. Archaebacterial DNA-dependent RNA polymerases testify to the evolution of the eukaryotic nuclear genome. Proc. Natl. Acad. Sci. USA. 1989;86:4569–4573. [PubMed] R34. Qureshi S.A., Bell S.D., Jackson S.P. Factor requirements for transcription in the archaeon Sulfolobus shibatae
. EMBO J. 1997;16:2927–2936. [PubMed] R35. Reeve J.N. Archaeal chromatin and transcription. Mol. Microbiol. 2003;48:587–598. [PubMed] R36. Rowlands T., Baumann P., Jackson S.P. The TATA-binding protein: a general transcription factor in eukaryotes and archaebacteria. Science. 1994;264:1326–1329. [PubMed] R37. Sagi D., Friedman N., Vorgias C., Oppenheim A.B., Stavans J. Modulation of DNA conformations through the formation of alternative high-order HU–DNA complexes. J. Mol. Biol. 2004;341:419–428. [PubMed] R38. Sims R.J., Belotserkovskaya R., Reinberg D. Elongation by RNA polymerase II: the short and long of it. Genes Dev. 2004;18:2437–2468. [PubMed] R39. Slesarev A.I., Mezhevaya K.V., Makarova K.S., et al. The complete genome of hyperthermophile Methanopyrus kandleri AV19 and monophyly of archaeal methanogens. Proc. Natl. Acad. Sci. USA. 2002;99:4644–4649. [PubMed] R40. Soppa J. Transcription initiation in Archaea: facts, factors and future aspects. Mol. Microbiol. 1999;31:1295–1305. [PubMed] R41. Struhl K. Fundamentally different logic of gene regulation in eukaryotes and prokaryotes. Cell. 1999;98:1–4. [PubMed] R42. Werner F., Weinzierl R.O. A recombinant RNA polymerase II-like enzyme capable of promoter-specific transcription. Mol. Cell. 2002;10:635–646. [PubMed] R43. Werner F., Eloranta J.J., Weinzierl R.O. Archaeal RNA polymerase subunits F and P are bona fide homologs of eukaryotic RPB4 and RPB12. Nucleic Acids Res. 2000;28:4299–4305. [PubMed] R44. Woychik N.A., Young R.A. RNA polymerase II subunit RPB4 is essential for high- and low-temperature yeast cell growth. Mol. Cell. Biol. 1989;9:2854–2859. [PubMed] |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||
Nature. 1997 Apr 10; 386(6625):569-77.
[Nature. 1997]Infect Immun. 2003 Feb; 71(2):591-6.
[Infect Immun. 2003]Proc Natl Acad Sci U S A. 1989 Jun; 86(12):4569-73.
[Proc Natl Acad Sci U S A. 1989]Mol Cell. 2002 Sep; 10(3):635-46.
[Mol Cell. 2002]Cell. 1999 Jul 9; 98(1):1-4.
[Cell. 1999]Proc Natl Acad Sci U S A. 1989 Jun; 86(12):4569-73.
[Proc Natl Acad Sci U S A. 1989]Cell. 1992 Oct 16; 71(2):189-90.
[Cell. 1992]Science. 1994 May 27; 264(5163):1326-9.
[Science. 1994]Proc Natl Acad Sci U S A. 1999 Jul 20; 96(15):8545-50.
[Proc Natl Acad Sci U S A. 1999]Trends Biochem Sci. 1995 Apr; 20(4):140-1.
[Trends Biochem Sci. 1995]Nucleic Acids Res. 2003 Jan 15; 31(2):653-60.
[Nucleic Acids Res. 2003]Genome Res. 2004 Aug; 14(8):1548-54.
[Genome Res. 2004]Methods Enzymol. 1996; 266():114-28.
[Methods Enzymol. 1996]Bioinformatics. 2003 Jul 22; 19(11):1451-2.
[Bioinformatics. 2003]Nucleic Acids Res. 1997 Sep 1; 25(17):3389-402.
[Nucleic Acids Res. 1997]Bioinformatics. 2000 Oct; 16(10):915-22.
[Bioinformatics. 2000]Nucleic Acids Res. 2002 Apr 1; 30(7):1575-84.
[Nucleic Acids Res. 2002]Nucleic Acids Res. 2003 Jan 15; 31(2):653-60.
[Nucleic Acids Res. 2003]Bioinformatics. 1998; 14(9):755-63.
[Bioinformatics. 1998]Nucleic Acids Res. 2003 Jan 15; 31(2):653-60.
[Nucleic Acids Res. 2003]Nucleic Acids Res. 2002 Apr 1; 30(7):1575-84.
[Nucleic Acids Res. 2002]Methods Enzymol. 1996; 266():114-28.
[Methods Enzymol. 1996]Genome Biol. 2004; 5(3):R17.
[Genome Biol. 2004]Mol Microbiol. 2000 Jun; 36(5):1184-5.
[Mol Microbiol. 2000]Genome Biol. 2004; 5(3):R17.
[Genome Biol. 2004]J Mol Biol. 2004 Aug 6; 341(2):419-28.
[J Mol Biol. 2004]Trends Microbiol. 2003 Jun; 11(6):248-53.
[Trends Microbiol. 2003]Proc Natl Acad Sci U S A. 2002 Apr 2; 99(7):4644-9.
[Proc Natl Acad Sci U S A. 2002]Nucleic Acids Res. 2000 Nov 1; 28(21):4299-305.
[Nucleic Acids Res. 2000]Mol Cell Biol. 1989 Jul; 9(7):2854-9.
[Mol Cell Biol. 1989]Mol Microbiol. 1999 Mar; 31(5):1295-305.
[Mol Microbiol. 1999]Mol Microbiol. 2003 May; 48(3):587-98.
[Mol Microbiol. 2003]EMBO Rep. 2001 Feb; 2(2):133-8.
[EMBO Rep. 2001]J Bacteriol. 2001 Mar; 183(5):1813-8.
[J Bacteriol. 2001]EMBO J. 1997 May 15; 16(10):2927-36.
[EMBO J. 1997]Mol Microbiol. 2005 Jun; 56(6):1397-407.
[Mol Microbiol. 2005]Science. 2002 Apr 5; 296(5565):148-51.
[Science. 2002]Genes Dev. 2004 Oct 15; 18(20):2437-68.
[Genes Dev. 2004]Science. 2005 Jul 15; 309(5733):436-42.
[Science. 2005]Mol Microbiol. 2005 Jun; 56(6):1397-407.
[Mol Microbiol. 2005]Cell. 1999 Jul 9; 98(1):1-4.
[Cell. 1999]Genome Res. 2004 Aug; 14(8):1548-54.
[Genome Res. 2004]Science. 2005 Jul 15; 309(5733):436-42.
[Science. 2005]Nucleic Acids Res. 2003 Jan 15; 31(2):653-60.
[Nucleic Acids Res. 2003]