NCBI Bookshelf. A service of the National Library of Medicine, National Institutes of Health.

Metagenomics: Sequences from the Environment [Internet]. Bethesda (MD): National Center for Biotechnology Information (US); 2006.

Cover of Metagenomics

Metagenomics: Sequences from the Environment [Internet].

Show details

Global Ocean Sampling Expedition Microbial Metagenome

Microbial DNA sequences were isolated from seawater samples collected during a global circumnavigation aboard the Sorcerer II in order to analyze the genomic and functional diversity within this particular marine environment. Starting in Halifax, Canada, samples were collected at sites along the U.S. east coast, Gulf of Mexico, Galapagos Islands, central and south Pacific Oceans, Australia, Indian Ocean, South Africa, and across the Atlantic back to the U.S. This large dataset allows investigators to study genetic and biochemical microbial diversity within the marine environment.


A total of 41 different samples were taken from a variety of aquatic habitats collected over 8,000 km. 7.7 million sequencing reads were obtained from size-fractionated samples, yielding 6.4 million contiguous sequences, totaling 5.9 Gbp of nonredundant sequence. These were further processed into about 3 million assemblies (scaffolds).

Isolation Source

Samples were collected as part of the Sorcerer II expedition between August 8, 2003, and May 22, 2004. Most specimens were collected from surface water marine environments at approximately 320 km intervals. 44 samples were obtained from 41 sites, covering a wide range of distinct surface marine environments as well as a few nonmarine aquatic samples for contrast.

Genome Assembly

The 4,124,495 contigs were further assembled into 3,087,206 WGS scaffolds using a overlap cutoff of 98%. 85% of the assembled sequences and 57% of unassembled data is unique at the 98% identity cutoff. Based on clustering and HMM profiling, more than 6.1 million proteins were annotated on this dataset (includes bacterial as well as viral sequences). These are defined as "marine metagenome" within the source. 60 highly abundant ribotypes were identified and found to be associated with open ocean and aquatic samples.


  1. Venter JC, Remington K, Heidelberg JF, Halpern AL, Rusch D, Eisen JA, Wu D, Paulsen I, Nelson KE, Nelson W, Fouts DE, Levy S, Knap AH, Lomas MW, Nealson K, White O, Peterson J, Hoffman J, Parsons R, Baden-Tillson H, Pfannkoch C, Rogers YH, Smith HO. Environmental genome shotgun sequencing of the Sargasso Sea. Science. 2004 Apr 2;304(5667):58-60. [PubMed: 15001713]
  2. Tringe SC, von Mering C, Kobayashi A, Salamov AA, Chen K, Chang HW, Podar M, Short JM, Mathur EJ, Detter JC, Bork P, Hugenholtz P, Rubin EM. Comparative metagenomics of microbial communities. Science. 2005 Apr 22;308(5721):5547. [PubMed: 15845853]
  3. Rusch DB, Halpern AL, Sutton G, Heidelberg KB, Williamson S, Yooseph S, Wu D, Eisen JA, Hoffman JM, Remington K, Beeson K, Tran B, Smith H, Baden-Tillson H, Stewart C, Thorpe J, Freeman J, Andrews-Pfannkoch C, Venter JE, Li K, Kravitz S, Heidelberg JF, Utterback T, Rogers YH, Falcon LI, Souza V, Bonilla-Rosso G, Eguiarte LE, Karl DM, Sathyendranath S, Platt T, Bermingham E, Gallardo V, Tamayo-Castillo G, Ferrari MR, Strausberg RL, Nealson K, Friedman R, Frazier M, Venter JC. The Sorcerer II Global Ocean Sampling Expedition: Northwest Atlantic through Eastern Tropical Pacific. PLoS Biol. 2007 Mar 13;5(3):e77. [PMC free article: PMC1821060] [PubMed: 17355176]
  4. Yooseph S, Sutton G, Rusch DB, Halpern Al, Williamson SJ, Remington K, Eisen JA, Heidelberg KB, Manning G, Li W, Jaroszewski L, Cieplak P, Miller CS, Li H, Mashiyama ST, Joachimiak MP, van Belle C, Chandonia JM, Soergel DA, Zhai Y, Natarajan K, Lee S, Raphael BJ, Bafna V, Friedman R, Brenner SE, Godzik A, Eisenberg D, Dixon JE, Taylor SS, Strausberg RL, Frazier M, Venter JC. The Sorcerer II Global Ocean Sampling Expedition: Expanding the Universe of Protein Families. PLoS Biol. 2007 Mar 13;5(3):e16. [PMC free article: PMC1821046] [PubMed: 17355171]
  5. Kannan N, Taylor SS, Zhai Y, Venter JC, Manning G. Structural and Functional Diversity of the Microbial Kinome. PLoS Biol. 2007 Mar 13;5(3):e17. [PMC free article: PMC1821047] [PubMed: 17355172]


Sequence Analysis

Related information

  • PMC
    PubMed Central citations
  • PubMed
    Links to PubMed

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...