Strain-specific genes of Helicobacter pylori: distribution, function and dynamics

Nucleic Acids Res. 2001 Nov 1;29(21):4395-404. doi: 10.1093/nar/29.21.4395.

Abstract

Whole-genome clustering of the two available genome sequences of Helicobacter pylori strains 26695 and J99 allows the detection of 110 and 52 strain-specific genes, respectively. This set of strain-specific genes was compared with the sets obtained with other computational approaches of direct genome comparison as well as experimental data from microarray analysis. A considerable number of novel function assignments is possible using database-driven sequence annotation, although the function of the majority of the identified genes remains unknown. Using whole-genome clustering, it is also possible to detect species-specific genes by comparing the two H.pylori strains against the genome sequence of Campylobacter jejuni. It is interesting that the majority of strain-specific genes appear to be species specific. Finally, we introduce a novel approach to gene position analysis by employing measures from directional statistics. We show that although the two strains exhibit differences with respect to strain-specific gene distributions, this is due to the extensive genome rearrangements. If these are taken into account, a common pattern for the genome dynamics of the two Helicobacter strains emerges, suggestive of certain spatial constraints that may act as control mechanisms of gene flux.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Bacterial Proteins / chemistry
  • Bacterial Proteins / classification
  • Bacterial Proteins / genetics
  • Bacterial Proteins / metabolism
  • Campylobacter jejuni / genetics
  • Computational Biology
  • Databases, Protein
  • Evolution, Molecular*
  • Gene Order / genetics
  • Genes, Bacterial / genetics*
  • Genome, Bacterial*
  • Genomics*
  • Helicobacter pylori / classification*
  • Helicobacter pylori / genetics*
  • Internet
  • Models, Genetic
  • Molecular Sequence Data
  • Sequence Alignment
  • Species Specificity

Substances

  • Bacterial Proteins