![]() | ![]() |
Formats:
|
||||||||||||||||
Copyright © 2007 by The National Academy of Sciences of the USA Genetics Chromosomal periodicity of evolutionarily conserved gene pairs *Department of Genetics, †Harvard–Massachusetts Institute of Technology Division of Health Sciences and Technology, and ‡Harvard–Partners Center for Genetics and Genomics, Harvard Medical School, Boston, MA 02115; and §Departments of Biology and Biomedical Engineering, and Bioinformatics Program, Boston University, Boston, MA 02215 ¶To whom correspondence should be addressed. E-mail: dsegre/at/bu.edu Edited by John R. Roth, University of California, Davis, CA, and approved May 11, 2007 Author contributions: M.A.W., G.M.C., and D.S. designed research; M.A.W., P.K., and D.S. performed research; P.K. contributed new reagents/analytic tools; M.A.W., P.K., G.M.C., and D.S. analyzed data; and M.A.W. and D.S. wrote the paper. Received December 6, 2006. Freely available online through the PNAS open access option. This article has been cited by other articles in PMC.Abstract Chromosomes are compacted hundreds of times to fit in the cell, packaged into dynamic folds whose structures are largely unknown. Here, we examine patterns in gene locations to infer large-scale features of bacterial chromosomes. Specifically, we analyzed >100 genomes and identified thousands of gene pairs that display two types of evolutionary correlations: a tendency to co-occur and a tendency to be located close together in many genomes. We then analyzed the detailed distribution of these pairs in Escherichia coli and found that genes in a pair tend to be separated by integral multiples of 117 kb along the genome and to be positioned in a 117-kb grid of genomic locations. In addition, the most pair-dense locations coincide with regions of intense transcriptional activity and the positions of top transcribed and conserved genes. These patterns suggest that the E. coli chromosome may be organized into a 117-kb helix-like topology that localizes a subset of the most essential and highly transcribed genes along a specific face of this structure. Our approach indicates an evolutionarily maintained preference in the spacing of genes along the chromosome and offers a general comparative genomics framework for studying chromosome structure, broadly applicable to other organisms. Keywords: chromosome structure, computational genomics, nucleoid, spatial organization The interplay between structure and function of chromosomes is a critical aspect of spatial organization in the cell, intimately involved in transcription (1–3), recombination (4), and replication (1, 2). Despite this importance, the detailed structure of a chromosome in any organism is unknown. Bacteria offer appealing systems in which to study the fundamental factors governing chromosome structure, because they exhibit exquisite spatial organization and contain functional homologs of many eukaryotic DNA-associated proteins, yet they have a small genome generally packaged in a single circular chromosome (2, 5, 6). Bacterial chromosomes must be compacted 1,000-fold to fit within the cell. The resulting structures could therefore be highly disordered; for example, 10 kb of uncompacted DNA (1/400th of the genome) could span the entire cell. However, in vivo the chromosome exhibits a high degree of order. At a local level, it is wound into ≈10-kb supercoiled domains that topologically isolate different regions of the genome from each other (7, 8). At larger scales, certain regions of the genome are physically inaccessible to each other, suggesting that loci undergo limited diffusion (9). More recently, fluorescence microscopy has shown that loci are not randomly positioned in the cell but occupy reproducible 3D positions that undergo specific cell-cycle movements (10–14). In Escherichia coli and Caulobacter crescentus, evidence suggests furthermore that the positions of loci in the cell are linearly correlated with their coordinate along the genome, with the origin and terminus at opposite cell poles (11, 12). In E. coli, recent data confirm this linear correlation but suggest the origin is located at midcell, with the two arcs of the chromosome in two different (longitudinal) halves of the cell (13, 14). At a finer scale, below the resolution of current confocal microscopy, positional correlations and periodicities in sequence (15), expression levels (16–19), and transcription factor-binding sites (17) suggest a functionally important, possibly regular chromosome conformation. However, beyond coarse high-level outlines, the structure has been largely inaccessible to experiment and remains largely unknown. Here, we approach the problem of chromosome structure from an evolutionary perspective. Our method, based on comparative genomics, is similar to statistical coupling analysis in proteins (20). In proteins, the 3D arrangement of specific residues is critical for function; for example, the WW protein domain contains a small 3D network of residues that is crucial for both folding and function (20). Maintaining this arrangement constrains the identities of the amino acids at the involved residues and causes them to coevolve, generating statistical correlations in a multiple sequence alignment (20). In the chromosome, we reasoned analogously that if a particular 3D arrangement of genes is critical for function, such genes would tend to occupy genomic locations where they can achieve this arrangement in the folded chromosome (Fig. 1
In our analysis, we explore this hypothesis by analyzing a large set of statistically correlated (SC) gene pairs. We first select a set of correlated pairs. We then investigate patterns in the position and distance distributions of these pairs along the genome of E. coli. We next explore the functional basis of these distributions by analyzing the levels of conservation and transcription along the genome. Finally, we discuss possible implications of these distributions in terms of geometrical features of the E. coli chromosome fold. Results To begin, we identified a large set of strongly correlated gene pairs. Specifically (Fig. 1 Based on these criteria, we searched across 10 million gene pairs in >100 genomes and selected 22,500 strongly SC pairs [supporting information (SI) Methods and SI Table 1; Fig. 1 We first investigated properties of the SC pairs across all organisms and found that the genes in a pair exhibit a strong preference for positions that are symmetric about the origin of replication (SI Fig. 6). This symmetry is consistent with fluorescence microscopy in C. crescentus (10–12) and with observations of symmetry in genome alignments and gene order (24–26). We next examined the detailed genomic organization of the SC pairs in a single organism, E. coli. First, we analyzed the distribution of distances, i.e., the number of times a particular distance separates genes in an SC pair along each chromosome arc (defined by the origin and terminus of replication; see SI Fig. 6a). Because the SC genes were chosen based on their tendency for closeness in bacterial genomes, the expectation (under a null hypothesis of otherwise randomly positioned genes) is that most distances will be close to zero, and that distances larger than zero will taper smoothly to zero (see SI Fig. 7a). In E. coli, however, we observe a markedly different pattern (Fig. 2
Many different distributions of locations for the SC genes in E. coli could generate the above distance distribution (Fig. 2
To explore the functional basis of these distributions, we examined the relationship between SC genes and transcription in E. coli. Transcription has long been hypothesized to play a role in condensing the chromosome (3). In addition, many of the genes that belong to a large number of SC pairs are known to be highly transcribed. We found that the pair density mirrors the log-phase transcript level (16) along the chromosome (Pearson correlation coefficient R = 0.67, P < 10−44) (Fig. 3 We next sought to understand this link in more detail, in particular the relationship between the periodicities in the SC pairs and the positioning of highly transcribed genes in E. coli; in addition, because the SC pairs were chosen by using orthologs conserved over many genomes, we simultaneously examined the connection between SC pairs and the positioning of highly conserved genes. We therefore constructed two new pair sets in which the gene pairs were selected randomly from E. coli by using probabilities proportional to their level of transcription (transcription pairs) or conservation (conservation pairs) (see Methods). The distance distributions of these new pair sets are therefore enriched in distances that separate highly transcribed or conserved genes along the chromosome, allowing us to examine preferences in the chromosomal spacing of these genes in a manner similar to the SC pairs. We first compared the distance distributions of these two new pair sets with the SC pairs. In contrast to the SC pairs, we found no periodicity in conservation pairs (Fig. 4
However, as we gradually restrict the pair sets to the top genes in each set (i.e., the genes with the highest levels of transcription, conservation, or number of SC pairings), a 117-kb periodicity steadily emerges in both transcription and conservation and grows stronger in all three pair sets (Fig. 4 Discussion The SC gene pairs, which were chosen based only on their evolutionary patterns of chromosomal proximity and co-occurrence across many species, therefore display a strong 117-kb periodicity in genomic distances and locations in E. coli. In addition, the density of SC pairs is highly correlated with transcription levels. Could simple constraints on the sizes or sites of genome rearrangements generate these patterns? General models based on repulsion of gene clusters or preferred sizes for recombination or horizontal gene transfer could account for local spacing along the chromosome; for example, a strong preferred recombination distance of ≈117 kb could generate several gene clusters spaced at 117 kb by splitting a single initial gene cluster by 117-kb recombination events. However, the clusters generated from splitting two different initial clusters would not naturally be in phase with each other. In general, such local constraints cannot easily explain a global periodicity of positions that extends in almost perfect phase along each half of the genome. Even an extreme case of recombination hotspots spaced at n × 117 kb along the chromosome could maintain the 117-kb periodicity only if the SC paired genes were constrained to the very center or edges of each 117-kb stretch. Otherwise, a single inversion would destroy the periodicity. In addition, any horizontal gene transfer would disrupt the periodicity unless the fragment were small ( 117 kb) or ≈117 kb long with SC genes at the center or edges. We cannot rule out the possibility that such rearrangement processes contribute to the observed patterns, e.g., symmetric inversions about the origin of replication (24–26) could explain the observed symmetry of SC paired genes. However, the localization of SC genes in an in-phase set of periodically spaced islands suggests that some selective pressure beyond these processes maintains these genes at these specific locations.Structural constraints due to the spatial organization of the chromosome offer a simple explanation. In-phase positional periodicities in amino acid sequences are a canonical structural motif seen in proteins, where they indicate the presence of a specific face on a periodic structure (27), for example, the face of hydrophobic residues in contact with the membrane in a transmembrane domain (27). Because of the regular period, these spatially contiguous structural faces are composed of residues that are separated by periodic intervals along the sequence. In the E. coli chromosome, the periodic distributions suggest an analogous structural organization, a regular 117-kb looping, and a single structural face of each chromosome half, along which SC pairs are predominantly localized. In Fig. 5
We evaluated the agreement of the SC gene pairs with the structures above by using the 3D distances between the pairs as a metric, while varying the size of the loops. We found a helical period of 117 kb to be the optimum for both arcs (SI Fig. 14). Two important properties emerge from these structural representations. First, the 117-kb looping causes a dramatic concentration of the pair dense regions in space, in a few patches along each face, consistent with spatial colocalization of subsets of SC paired genes. Second, many pairs, for example those separated by >1 Mb, are not physically close on the structure but share the feature of localization in the pair-dense faces. This suggests that the correlations in the SC pairs reflect confinement to these faces, rather than mere physical distance. This would be analogous to the correlations observed in protein residues, which often reflect important substructures regardless of spatial vicinity (20). If the distributions of the SC pairs are the product of a structural periodicity and the localization of SC pairs along specific structural faces of the E. coli chromosome, what could be responsible for these features? Given the correlation between SC pairs and transcription, transcription is an attractive possibility for causing localization: the localization of certain highly transcribed genes along the structural faces (see helical moments in Fig. 5 Spatial localization of highly transcribed genes alone, however, would not generate periodicity. Rather, periodicity requires a second constraint, a regular loop size analogous to the 3.5-residue turn of an α-helix. This suggests an intrinsic property of the chromosome or of its binding proteins (e.g., H-NS MukBEF) (6); for example, the association of chromosomal DNA with proteins that induce a regular curvature would create periodic loops. Helices are also known to be the energetically optimal way of confining a string to certain geometrical spaces (31); thus, a 117-kb looping may be the spontaneous outcome of physicochemical properties including macromolecular crowding, supercoiling, DNA persistence length, and cell dimension. The bacterial chromosome, however, is known to be a highly dynamic structure (2, 3, 12, 13) and the proposed models (Fig. 5 The relationship of the proposed features to existing experimental data also bears discussion. These features are consistent with confocal microscopy (10, 12, 13), recombination (32), transposon insertion (33), and atomic force microscopy (34). In addition, our findings may yield insight into previously observed periodicities of 96 kb in transcription factor-binding sites (17), 90–120 kb in wavelet analysis (15, 16), and 115 kb in transcription levels (18, 19) in E. coli. In particular, our analysis suggests that these periodicities reflect an in-phase 117-kb grid, occupied by top-transcribed, top-conserved, and SC gene pairs. The significance of the pattern in the SC pairs may be due to the special vantage point of comparative genomics, where loci are identified based on the combined results of evolution acting on multiple genomes. Independent of structure, our approach reveals significant gene organization on the chromosome. More general comparative analyses of how genes, gene pairs, or higher multiplets of genes are positioned in the genome should yield further insight into chromosome architecture. Similar methods should also be applicable to eukaryotes. The particular structural faces we propose and the chromosome-wide structural periodicity make specific predictions, which must be tested experimentally. To this effect, we are examining other bacteria for similar patterns (see SI Fig. 15 for C. crescentus, which displays a similar strong periodicity at 113 kb) and have developed a multiplex method of chromosome conformation capture (3C) (35) to measure the distances between thousands of chromosomal loci simultaneously at the resolution of our model. Ultimately, genome sequences and their structures may be highly interdependent aspects of a single finely tuned system. Evolutionary conservation should provide a powerful means of unraveling this interdependence. Methods Selection of SC Gene Pairs. We selected the SC gene pairs based on evolutionary preference for chromosomal proximity and phylogenetic co-occurrence across many genomes, as explained in Fig. 1 Genomic Data. The genomes were obtained from GenBank and consisted of 105 bacterial and three eukaryotic genomes (Saccharomyces cerevisiae, Schizosaccharomyces pombe, and Caenorhabditis elegans, which were included to represent particularly distant species). Chromosomal Proximity. For a pair of genes x and y, we calculated the tendency toward chromosomal proximity by using the difference in the order in which genes appear along the chromosome (gene-order difference) (23). We evaluated the probability
To correct for the variable phylogenetic divergence of query genomes, we constructed a UPGMA (36) phylogenetic tree based on a phylogenetic distance ϕ(g1, g2) between genomes g1 and g2. Note that the use of an alternative phylogenetic reconstruction method (neighbor-joining) does not affect our conclusions. We used a phylogenetic distance based on gene content (37), specifically, the mutual information between E. coli ortholog occurrence vectors in two genomes. The probabilities Pg from each genome were weighted based on the phylogenetic tree, by using an approach similar to the method of phylogenetic contrasts (36) (see SI Methods for details). The orthology mapping was established by using best bidirectional orthologs from Kyoto Encyclopedic of Genes and Genomes (KEGG) Sequence Similarity Database (www.genome.ad.jp/kegg/ssdb). Phylogenetic Co-Occurrence. Phylogenetic profile cooccurrence probability was calculated by using the extended hypergeometric distribution method described in Kharchenko et al. (38), which also includes a correction for the phylogenetic divergence. The orthologs were determined by using best bidirectional BLASTP hits against National Center for Biotechnology Information NR protein data set. Organisms containing orthologs for <1% of E. coli genes were excluded from calculations. Distributions of Distances and Positions and Fourier Transform. We constructed a histogram of the distances between genes for all SC pairs in E. coli. The histogram was transformed into a continuous probability density by using a Gaussian smoothing window (σ = 4 kb) and normalizing the total density over the entire genome to 1. A discrete Fourier transform of the data were computed from 0 to 1,000 kb by using a Tukey window to taper the ends (ratio of 0.5 for tapered to untapered length). The periodicity is independent of the maximum distance value. We calculated the statistical significance by repeating the smoothing and Fourier analysis on 10,000 randomizations in which the positions of the operons involving SC paired genes [determined from Price et al. (39)] were randomized within their chromosomal arc. The P value was determined by counting the number of randomizations with a Fourier peak as strong as or stronger than the 117-kb SC pair peak. The density of SC pairs was computed by counting the number of SC pairs involving genes at each position along the chromosome, smoothing with the Gaussian window (σ = 8 kb), and normalizing by the overall gene density. The 1D grid is defined as a set of positions nτ + p along the chromosome, where τ is the spacing between grid points (the period), p is the offset (or phase) (set separately for each arc), and n is an integer. We evaluate the fit of the distributions to the grid using the sum of the distances of each peak to the nearest grid point (over all choices of p for each τ) as the error measure (see SI Fig. 8). Expression Correlation. We calculated an average of the absolute transcript level for wild-type standard growth conditions (4-morpholinepropanesulfonic acid minimal glucose, doubling time 2–8 h) using 5 Affymetrix (Santa Clara, CA) microarrays data sets extracted from the ASAP database [www.genome.wisc.edu/tools/asap.htm, Allen et al. (16)]. These data were smoothed by using a Gaussian window σ = 6 kb and normalized by the overall gene density as above. We calculated the Pearson correlation coefficient of the smoothed data with the pair position density, sampling once every 12 kb to avoid smoothing artifacts (and averaging over all choices of the sampling phase). P was computed by using Student's t test with n−2 degrees of freedom (where n is the number of data points). Transcription and Conservation Pair Sets. We constructed pair sets based on the levels of transcription (Ti) and conservation (Ci) of genes in E. coli (GE.coli), with i GE.coli by using log-phase transcript level from Allen et al. (16) for transcription and the number of orthologs of a gene (using best bidirectional orthologs from KEGG Sequence Similarity Database) for conservation. Each pair in the transcription pair set was chosen by randomly selecting two genes from GE.coli, where the probability of selecting gene i is pi = Ti/Ttot, with Ttot = ΣTi. Similarly, for selecting pairs in the conservation pair set we used probabilities pi = Ci/Ctot. Distance distributions and Fourier spectra were calculated as for the SC pairs.Pair sets limited to the top k transcribed genes were created by choosing i GE.coli(k, T), where GE.coli(k, T) is the set of top k transcribed genes. Similarly, we defined pairs for the top k conserved genes by sampling from a subset GE.coli(k, C) and for the top k SC genes by taking the subset of the initial SC pairs in which both genes are elements of GE.coli(k, SC), the set of k genes most represented in the initial SC pair set.Supporting Information
Acknowledgments We thank F. J. Isaacs, D. Peer, N. Reppas, J. Shendure, M. Umbarger, and I. Yanai for advice and critical reading of the manuscript. Part of this work was funded by the Department of Energy and National Institutes of Health Grant P50 GM068763. D.S. is also a faculty scholar at Lawrence Livermore National Laboratory. Footnotes The authors declare no conflict of interest. This article is a PNAS Direct Submission. This article contains supporting information online at www.pnas.org/cgi/content/full/0610776104/DC1. References 1. Chakalova L, Debrand E, Mitchell JA, Osborne CS, Fraser P. Nat Rev Genet. 2005;6:669–677. [PubMed] 2. Travers A, Muskhelishvili G. Curr Opin Genet Dev. 2005;15:507–514. [PubMed] 3. Cook PR. Nat Genet. 2002;32:347–352. [PubMed] 4. Branco MR, Pombo A. PLoS Biol. 2006;4:e138. [PubMed] 5. Bates D, Kleckner N. Cell. 2005;121:899–911. [PubMed] 6. Dame RT. Mol Microbiol. 2005;56:858–870. [PubMed] 7. Postow L, Hardy CD, Arsuaga J, Cozzarelli NR. Genes Dev. 2004;18:1766–1779. [PubMed] 8. Higgins NP, Yang X, Fu Q, Roth JR. J Bacteriol. 1996;178:2825–2835. [PubMed] 9. Segall A, Mahan MJ, Roth JR. Science. 1988;241:1314–1318. [PubMed] 10. Teleman AA, Graumann PL, Lin DC, Grossman AD, Losick R. Curr Biol. 1998;8:1102–1109. [PubMed] 11. Niki H, Yamaichi Y, Hiraga S. Genes Dev. 2000;14:212–223. [PubMed] 12. Viollier PH, Thanbichler M, McGrath PT, West L, Meewan M, McAdams HH, Shapiro L. Proc Natl Acad Sci USA. 2004;101:9257–9262. [PubMed] 13. Wang X, Liu X, Possoz C, Sherratt DJ. Genes Dev. 2006;20:1727–1731. [PubMed] 14. Nielsen HJ, Ottesen JR, Youngren B, Austin SJ, Hansen FG. Mol Microbiol. 2006;62:331–338. [PubMed] 15. Allen TE, Price ND, Joyce AR, Palsson BO. PLoS Comput Biol. 2006;2:e2. [PubMed] 16. Allen TE, Herrgard MJ, Liu M, Qiu Y, Glasner JD, Blattner FR, Palsson BO. J Bacteriol. 2003;185:6392–6399. [PubMed] 17. Kepes F. J Mol Biol. 2004;340:957–964. [PubMed] 18. Jeong KS, Ahn J, Khodursky AB. Genome Biol. 2004;5:R86. [PubMed] 19. Carpentier AS, Torresani B, Grossmann A, Henaut A. BMC Genomics. 2005;6:84. [PubMed] 20. Socolich M, Lockless SW, Russ WP, Lee H, Gardner KH, Ranganathan R. Nature. 2005;437:512–518. [PubMed] 21. Vakoc CR, Letting DL, Gheldof N, Sawado T, Bender MA, Groudine M, Weiss MJ, Dekker J, Blobel GA. Mol Cell. 2005;17:453–462. [PubMed] 22. Pellegrini M, Marcotte EM, Thompson MJ, Eisenberg D, Yeates TO. Proc Natl Acad Sci USA. 1999;96:4285–4288. [PubMed] 23. Huynen MA, Bork P. Proc Natl Acad Sci USA. 1998;95:5849–5856. [PubMed] 24. Eisen JA, Heidelberg JF, White O, Salzberg SL. Genome Biol. 2000;1:RESEARCH0011. [PubMed] 25. Makino S, Suzuki M. Science. 2001;292:803. [PubMed] 26. Tillier ER, Collins RA. Nat Genet. 2000;26:195–197. [PubMed] 27. Eisenberg D, Weiss RM, Terwilliger TC. Proc Natl Acad Sci USA. 1984;81:140–144. [PubMed] 28. Cabrera JE, Jin DJ. Mol Microbiol. 2003;50:1493–1505. [PubMed] 29. Liu M, Durfee T, Cabrera JE, Zhao K, Jin DJ, Blattner FR. J Biol Chem. 2005;280:15921–15927. [PubMed] 30. Woldringh CL. Mol Microbiol. 2002;45:17–29. [PubMed] 31. Maritan A, Micheletti C, Trovato A, Banavar JR. Nature. 2000;406:287–290. [PubMed] 32. Valens M, Penaud S, Rossignol M, Cornet F, Boccard F. EMBO J. 2004;23:4330–4341. [PubMed] 33. Manna D, Breier AM, Higgins NP. Proc Natl Acad Sci USA. 2004;101:9780–9785. [PubMed] 34. Kim J, Yoshimura SH, Hizume K, Ohniwa RL, Ishihama A, Takeyasu K. Nucleic Acids Res. 2004;32:1982–1992. [PubMed] 35. Dostie J, Richmond TA, Arnaout RA, Selzer RR, Lee WL, Honan TA, Rubio ED, Krumm A, Lamb J, Nusbaum C, Green RD, Dekker J. Genome Res. 2006;16:1299–1309. [PubMed] 36. Felsenstein J. Am Nat. 1985;125:1–15. 37. Snel B, Bork P, Huynen MA. Nat Genet. 1999;21:108–110. [PubMed] 38. Kharchenko P, Chen L, Freund Y, Vitkup D, Church GM. BMC Bioinformatics. 2006;7:177. [PubMed] 39. Price MN, Huang KH, Alm EJ, Arkin AP. Nucleic Acids Res. 2005;33:880–892. [PubMed] |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||
Nat Rev Genet. 2005 Sep; 6(9):669-77.
[Nat Rev Genet. 2005]Curr Opin Genet Dev. 2005 Oct; 15(5):507-14.
[Curr Opin Genet Dev. 2005]Nat Genet. 2002 Nov; 32(3):347-52.
[Nat Genet. 2002]PLoS Biol. 2006 May; 4(5):e138.
[PLoS Biol. 2006]Cell. 2005 Jun 17; 121(6):899-911.
[Cell. 2005]Genes Dev. 2004 Jul 15; 18(14):1766-79.
[Genes Dev. 2004]J Bacteriol. 1996 May; 178(10):2825-35.
[J Bacteriol. 1996]Science. 1988 Sep 9; 241(4871):1314-8.
[Science. 1988]Curr Biol. 1998 Oct 8; 8(20):1102-9.
[Curr Biol. 1998]Genes Dev. 2000 Jan 15; 14(2):212-23.
[Genes Dev. 2000]Nature. 2005 Sep 22; 437(7058):512-8.
[Nature. 2005]Mol Cell. 2005 Feb 4; 17(3):453-62.
[Mol Cell. 2005]Proc Natl Acad Sci U S A. 1999 Apr 13; 96(8):4285-8.
[Proc Natl Acad Sci U S A. 1999]BMC Bioinformatics. 2006 Mar 29; 7():177.
[BMC Bioinformatics. 2006]Proc Natl Acad Sci U S A. 1999 Apr 13; 96(8):4285-8.
[Proc Natl Acad Sci U S A. 1999]Proc Natl Acad Sci U S A. 1998 May 26; 95(11):5849-56.
[Proc Natl Acad Sci U S A. 1998]Curr Biol. 1998 Oct 8; 8(20):1102-9.
[Curr Biol. 1998]Genes Dev. 2000 Jan 15; 14(2):212-23.
[Genes Dev. 2000]Proc Natl Acad Sci U S A. 2004 Jun 22; 101(25):9257-62.
[Proc Natl Acad Sci U S A. 2004]Genome Biol. 2000; 1(6):RESEARCH0011.
[Genome Biol. 2000]Science. 2001 May 4; 292(5518):803.
[Science. 2001]Nat Genet. 2002 Nov; 32(3):347-52.
[Nat Genet. 2002]J Bacteriol. 2003 Nov; 185(21):6392-9.
[J Bacteriol. 2003]Genome Biol. 2004; 5(11):R86.
[Genome Biol. 2004]BMC Genomics. 2005 Jun 6; 6(1):84.
[BMC Genomics. 2005]Genome Biol. 2000; 1(6):RESEARCH0011.
[Genome Biol. 2000]Science. 2001 May 4; 292(5518):803.
[Science. 2001]Nat Genet. 2000 Oct; 26(2):195-7.
[Nat Genet. 2000]Proc Natl Acad Sci U S A. 1984 Jan; 81(1):140-4.
[Proc Natl Acad Sci U S A. 1984]Genes Dev. 2004 Jul 15; 18(14):1766-79.
[Genes Dev. 2004]J Bacteriol. 1996 May; 178(10):2825-35.
[J Bacteriol. 1996]Proc Natl Acad Sci U S A. 2004 Jun 22; 101(25):9257-62.
[Proc Natl Acad Sci U S A. 2004]Genes Dev. 2000 Jan 15; 14(2):212-23.
[Genes Dev. 2000]Genes Dev. 2006 Jul 1; 20(13):1727-31.
[Genes Dev. 2006]Genes Dev. 2006 Jul 1; 20(13):1727-31.
[Genes Dev. 2006]Genes Dev. 2000 Jan 15; 14(2):212-23.
[Genes Dev. 2000]Nature. 2005 Sep 22; 437(7058):512-8.
[Nature. 2005]Mol Microbiol. 2003 Dec; 50(5):1493-505.
[Mol Microbiol. 2003]J Biol Chem. 2005 Apr 22; 280(16):15921-7.
[J Biol Chem. 2005]Nat Genet. 2002 Nov; 32(3):347-52.
[Nat Genet. 2002]Mol Microbiol. 2002 Jul; 45(1):17-29.
[Mol Microbiol. 2002]Mol Microbiol. 2005 May; 56(4):858-70.
[Mol Microbiol. 2005]Nature. 2000 Jul 20; 406(6793):287-90.
[Nature. 2000]Curr Opin Genet Dev. 2005 Oct; 15(5):507-14.
[Curr Opin Genet Dev. 2005]Nat Genet. 2002 Nov; 32(3):347-52.
[Nat Genet. 2002]Proc Natl Acad Sci U S A. 2004 Jun 22; 101(25):9257-62.
[Proc Natl Acad Sci U S A. 2004]Genes Dev. 2006 Jul 1; 20(13):1727-31.
[Genes Dev. 2006]Mol Microbiol. 2006 Oct; 62(2):331-8.
[Mol Microbiol. 2006]Curr Biol. 1998 Oct 8; 8(20):1102-9.
[Curr Biol. 1998]Proc Natl Acad Sci U S A. 2004 Jun 22; 101(25):9257-62.
[Proc Natl Acad Sci U S A. 2004]Genes Dev. 2006 Jul 1; 20(13):1727-31.
[Genes Dev. 2006]EMBO J. 2004 Oct 27; 23(21):4330-41.
[EMBO J. 2004]Proc Natl Acad Sci U S A. 2004 Jun 29; 101(26):9780-5.
[Proc Natl Acad Sci U S A. 2004]Genome Res. 2006 Oct; 16(10):1299-309.
[Genome Res. 2006]Proc Natl Acad Sci U S A. 1998 May 26; 95(11):5849-56.
[Proc Natl Acad Sci U S A. 1998]Nat Genet. 1999 Jan; 21(1):108-10.
[Nat Genet. 1999]BMC Bioinformatics. 2006 Mar 29; 7():177.
[BMC Bioinformatics. 2006]Nucleic Acids Res. 2005; 33(3):880-92.
[Nucleic Acids Res. 2005]J Bacteriol. 2003 Nov; 185(21):6392-9.
[J Bacteriol. 2003]J Bacteriol. 2003 Nov; 185(21):6392-9.
[J Bacteriol. 2003]