U.S. flag

An official website of the United States government

Display Settings:

Items per page

PMC Full-Text Search Results

Items: 10

1.
Figure 5

Figure 5. Association between gene repertoire relatedness and phylogenetic distance.. From: Organised Genome Dynamics in the Escherichia coli Species Results in Highly Diverse Adaptive Paths.

The horizontal line corresponds to the average relatedness among Escherichia coli/Shigella strains. The log fit shows an R2 = 0.26 (p<0.01), which drops to R2 = 0.07 (p<0.01) if the points before the dashed line are removed.

Marie Touchon, et al. PLoS Genet. 2009 Jan;5(1):e1000344.
2.
Figure 8

Figure 8. Global view of insertion/deletion hot spots.. From: Organised Genome Dynamics in the Escherichia coli Species Results in Highly Diverse Adaptive Paths.

Number of genes (ranging from 0 to 200) in indels along the genomes of modern strains according to the ancestral gene order of the core genome. The numbers on the x-axis represent the order of genes in the core genome, which has the same order as E. coli K-12 MG1655.

Marie Touchon, et al. PLoS Genet. 2009 Jan;5(1):e1000344.
3.
Figure 10

Figure 10. Standardized cumulative sum of effective gene conversion rate and G+C content.. From: Organised Genome Dynamics in the Escherichia coli Species Results in Highly Diverse Adaptive Paths.

Gene conversion rate (i.e., probability of being involved in a gene conversion event Cgc.Lgc) is shown in blue, and G+C content in red. A decrease in the cumulative sum reflects regions of lower-than-expected values of the statistics. Around the terminus domain, we found a decrease in both recombination and G+C content. Coloured boxes represent the 4 different organisation macrodomains (Right, Ter, Left, Ori). The arrows point towards the origin and terminus of replication.

Marie Touchon, et al. PLoS Genet. 2009 Jan;5(1):e1000344.
4.
Figure 2

Figure 2. Frequency of genes within the 20 analysed Escherichia coli genomes.. From: Organised Genome Dynamics in the Escherichia coli Species Results in Highly Diverse Adaptive Paths.

At one extreme of the x-axis are the genes present in a single genome which are regarded as strain specific genes (9054 genes: 51% of the pan-genome), while at the opposite end of the scale are situated the genes found in all 20 genomes, which represent the E. coli core-genome (1976 genes: 11% of the pan-genome). Coloured rectangles represent the proportion of insertion sequence (IS)-like elements (yellow), prophage-like elements (green), and genes of unknown/unclassified function (white). Black rectangles represent genes for which a function can be assigned. Strain-specific genes correspond to 2885 IS-like elements (32%), 2352 prophage-like elements (26%), and 3220 genes of unknown/unclassified function (35%).

Marie Touchon, et al. PLoS Genet. 2009 Jan;5(1):e1000344.
5.
Figure 3

Figure 3. Impact of gene conversion rate on phylogenetic reconstruction.. From: Organised Genome Dynamics in the Escherichia coli Species Results in Highly Diverse Adaptive Paths.

Sets of 20 sequences of 25 kbp were simulated 100 times under different gene conversion rates with constant tract length (50 bp) and mutation rate. The topology of the “true” genealogy of the sample (as inferred from a single nucleotide on which no gene conversion was allowed) was compared, using Robinson and Foulds distance, to the topology inferred from phylogenetic tree reconstruction using the simulated sequences. Error bars indicate one standard deviation variance, and horizontal bars represent one standard deviation variance from the no-gene-conversion model. A high rate of gene conversion is required to affect the topology of the reconstructed phylogeny. The observed average ratio of gene conversion to mutation (CGC/theta) is indicated by an arrow.

Marie Touchon, et al. PLoS Genet. 2009 Jan;5(1):e1000344.
6.
Figure 7

Figure 7. Reconstruction of gains and losses of genes in the evolution of Escherichia coli.. From: Organised Genome Dynamics in the Escherichia coli Species Results in Highly Diverse Adaptive Paths.

The cladogram shows the phylogenetic relationships among the 20 E. coli/Shigella genomes rooted on the E. fergusonii genome, as in , ignoring branch lengths for clarity. Each strain and internal node of the tree is labelled with the inferred numbers of genes gained (red: top) and lost (black: top) and the inferred numbers of corresponding events of gene acquisition (red: bottom) and loss (black: bottom) along the branch. Pie charts on each branch represent the functional classification of genes gained based on the colour-scale (details in the keys). The functional classes of known function genes are represented by numbers explained by a key in Supplementary . A similar figure, but displaying the pie charts for genes lost in the branch, is given in supplementary material ().

Marie Touchon, et al. PLoS Genet. 2009 Jan;5(1):e1000344.
7.
Figure 4

Figure 4. Maximum likelihood phylogenetic tree of the 20 Escherichia coli and Shigella strains as reconstructed from the sequences of the 1878 genes of the Escherichia core genome.. From: Organised Genome Dynamics in the Escherichia coli Species Results in Highly Diverse Adaptive Paths.

The earliest diverging species, E. fergusonii, was chosen to root the tree. The numbers at the nodes correspond, in black, to the bootstrap values (1000 bootstraps) and, in grey, to a “consensus strength”, which is the number of genes that confirms the bipartition (see ). The latter value is displayed only in instances where consensus and tested trees correspond. The branch length separating E. fergusonii from the E. coli strains is not to scale; the numbers above the branch indicate its length. Phylogenetic group membership of the strains is indicated with bars at the right of the figure.

Marie Touchon, et al. PLoS Genet. 2009 Jan;5(1):e1000344.
8.
Figure 6

Figure 6. Inferred gene content evolution in the lineage of Escherichia coli.. From: Organised Genome Dynamics in the Escherichia coli Species Results in Highly Diverse Adaptive Paths.

The cladogram shows the phylogenetic relationships among the 20 E. coli/Shigella genomes rooted on the E. fergusonii genome, as in , but ignoring branch lengths. The major phylogenetic groups are indicated by the vertical lines. Each strain and internal node of the tree is labelled with the number of genes present (as inferred by maximum likelihood: see ). Coloured rectangles represent different gene classes within the gene repertoires of ancestral and modern E. coli. Rectangle widths are proportional to the number of genes. The four different gene classes (by colour) include genes that are: in the core genome (white), not clade-specific (grey), clade-specific but not ubiquitous in the clade (green) and both clade-specific and ubiquitous in the clade (yellow). A clade-specific gene is one that is inferred to be present only in the node and its descendent nodes.

Marie Touchon, et al. PLoS Genet. 2009 Jan;5(1):e1000344.
9.
Figure 1

Figure 1. Escherichia coli core and pan-genome evolution according to the number of sequenced genomes.. From: Organised Genome Dynamics in the Escherichia coli Species Results in Highly Diverse Adaptive Paths.

Number of genes in common (left) and total number of non-orthologous genes (right) for a given number of genomes analysed for the different strains of E. coli. The upper and lower edges of the boxes indicate the first quartile (25th percentile of the data) and third quartile (75th percentile), respectively, of 1000 random different input orders of the genomes. The central horizontal line indicates the sample median (50th percentile). The central vertical lines extend from each box as far as the data extend, to a distance of at most 1.5 interquartile ranges (i.e., the distance between the first and third quartile values). At 20 sequenced genomes, the core-genome had 1976 genes (11% of the pan-genome), whereas the pan-genome had (i) 17 838 total genes (black), (ii) 11 432 genes (red) with no strong relation of homology (<80% similarity in sequence), and (iii) 10 131 genes (blue) after removing insertion sequence-like elements (3834, 21% of all genes) and prophage-like elements (3873, 22% of all genes).

Marie Touchon, et al. PLoS Genet. 2009 Jan;5(1):e1000344.
10.
Figure 9

Figure 9. The genomic island at the pheV tRNA insertion hot spot in the different Escherichia coli strains.. From: Organised Genome Dynamics in the Escherichia coli Species Results in Highly Diverse Adaptive Paths.

The figure provides a synthetic view of the pheV tRNA insertion hotspot in the different studied E. coli strains. This region has been defined using the synteny breaks among 12 E. coli strains. In E. coli K-12 MG1655, the genes immediately flanking the pheV tRNA gene are the ECK2960 gene (speC, ornithine decarboxylase) and the ECK2981 gene (pitB, phosphate transporter). In strain APEC O1, the pheV tRNA gene is absent. As most E. coli genomic regions have a composite structure, e.g., a region partially conserved or found in different synteny groups in other strains (i.e., at different genomic locations), we have manually divided this large genomic island into sub-regions (or modules), which are found in only a subset of the compared E. coli strains. Homologous modules have the same colour code and identifying number throughout. A total of 23 homologous modules were defined. The composition of these modules (i.e, the lists and functional descriptions of the constituent genes) is available in Supplementary Table 7. Black modules are strain-specific. Modules with hatched patterns correspond to repeated regions. Modules with grey dotted patterns are found in other strains but at another genomic location. The pathogenicity island published as PAI-V in UTI89 and 536 or PAI-I in APEC O1 and CFT073 ends just before module number 6.

Marie Touchon, et al. PLoS Genet. 2009 Jan;5(1):e1000344.

Display Settings:

Items per page

Supplemental Content

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...
Support Center