Your browser version may not work well with NCBI's Web applications. More information here...
1: Nature. 1998 Jun 11;393(6685):537-44.Click here to read Links
Erratum in:
Nature 1998 Nov 12;396(6707):190.
Comment in:
Nature. 1998 Jun 11;393(6685):515-6.

Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence.

Sanger Centre, Wellcome Trust Genome Campus, Hinxton, UK. stcole@pasteur.fr

Countless millions of people have died from tuberculosis, a chronic infectious disease caused by the tubercle bacillus. The complete genome sequence of the best-characterized strain of Mycobacterium tuberculosis, H37Rv, has been determined and analysed in order to improve our understanding of the biology of this slow-growing pathogen and to help the conception of new prophylactic and therapeutic interventions. The genome comprises 4,411,529 base pairs, contains around 4,000 genes, and has a very high guanine + cytosine content that is reflected in the biased amino-acid content of the proteins. M. tuberculosis differs radically from other bacteria in that a very large portion of its coding capacity is devoted to the production of enzymes involved in lipogenesis and lipolysis, and to two new families of glycine-rich proteins with a repetitive structure that may represent a source of antigenic variation.

PMID: 9634230 [PubMed - indexed for MEDLINE]