Improving the coverage of the cyanobacterial phylum using diversity-driven genome sequencing

Proc Natl Acad Sci U S A. 2013 Jan 15;110(3):1053-8. doi: 10.1073/pnas.1217107110. Epub 2012 Dec 31.

Abstract

The cyanobacterial phylum encompasses oxygenic photosynthetic prokaryotes of a great breadth of morphologies and ecologies; they play key roles in global carbon and nitrogen cycles. The chloroplasts of all photosynthetic eukaryotes can trace their ancestry to cyanobacteria. Cyanobacteria also attract considerable interest as platforms for "green" biotechnology and biofuels. To explore the molecular basis of their different phenotypes and biochemical capabilities, we sequenced the genomes of 54 phylogenetically and phenotypically diverse cyanobacterial strains. Comparison of cyanobacterial genomes reveals the molecular basis for many aspects of cyanobacterial ecophysiological diversity, as well as the convergence of complex morphologies without the acquisition of novel proteins. This phylum-wide study highlights the benefits of diversity-driven genome sequencing, identifying more than 21,000 cyanobacterial proteins with no detectable similarity to known proteins, and foregrounds the diversity of light-harvesting proteins and gene clusters for secondary metabolite biosynthesis. Additionally, our results provide insight into the distribution of genes of cyanobacterial origin in eukaryotic nuclear genomes. Moreover, this study doubles both the amount and the phylogenetic diversity of cyanobacterial genome sequence data. Given the exponentially growing number of sequenced genomes, this diversity-driven study demonstrates the perspective gained by comparing disparate yet related genomes in a phylum-wide context and the insights that are gained from it.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Bacterial Proteins / chemistry
  • Bacterial Proteins / genetics
  • Bacterial Proteins / metabolism
  • Chlorophyll Binding Proteins / chemistry
  • Chlorophyll Binding Proteins / genetics
  • Chlorophyll Binding Proteins / metabolism
  • Cyanobacteria / classification*
  • Cyanobacteria / genetics*
  • Cyanobacteria / metabolism
  • Evolution, Molecular
  • Genetic Variation
  • Genome, Bacterial*
  • Light-Harvesting Protein Complexes / chemistry
  • Light-Harvesting Protein Complexes / genetics
  • Light-Harvesting Protein Complexes / metabolism
  • Models, Molecular
  • Molecular Sequence Data
  • Multigene Family
  • Photosystem I Protein Complex / chemistry
  • Photosystem I Protein Complex / genetics
  • Photosystem I Protein Complex / metabolism
  • Phylogeny
  • Plastids / genetics
  • Sequence Homology, Amino Acid

Substances

  • Bacterial Proteins
  • Chlorophyll Binding Proteins
  • Light-Harvesting Protein Complexes
  • Photosystem I Protein Complex

Associated data

  • GENBANK/CP003495
  • GENBANK/CP003548
  • GENBANK/CP003549
  • GENBANK/CP003550
  • GENBANK/CP003551
  • GENBANK/CP003552
  • GENBANK/CP003553
  • GENBANK/CP003554
  • GENBANK/CP003558
  • GENBANK/CP003559
  • GENBANK/CP003590
  • GENBANK/CP003591
  • GENBANK/CP003592
  • GENBANK/CP003593
  • GENBANK/CP003594
  • GENBANK/CP003595
  • GENBANK/CP003596
  • GENBANK/CP003600
  • GENBANK/CP003601
  • GENBANK/CP003602
  • GENBANK/CP003610
  • GENBANK/CP003611
  • GENBANK/CP003612
  • GENBANK/CP003613
  • GENBANK/CP003614
  • GENBANK/CP003615
  • GENBANK/CP003616
  • GENBANK/CP003617
  • GENBANK/CP003618
  • GENBANK/CP003619
  • GENBANK/CP003630
  • GENBANK/CP003631
  • GENBANK/CP003632
  • GENBANK/CP003633
  • GENBANK/CP003634
  • GENBANK/CP003635
  • GENBANK/CP003636
  • GENBANK/CP003637
  • GENBANK/CP003638
  • GENBANK/CP003642
  • GENBANK/CP003643
  • GENBANK/CP003644
  • GENBANK/CP003645
  • GENBANK/CP003646
  • GENBANK/CP003647
  • GENBANK/CP003648
  • GENBANK/CP003649
  • GENBANK/CP003650
  • GENBANK/CP003653
  • GENBANK/CP003654
  • GENBANK/CP003655
  • GENBANK/CP003656
  • GENBANK/CP003657
  • GENBANK/CP003658
  • GENBANK/CP003659
  • GENBANK/CP003660
  • GENBANK/CP003661
  • GENBANK/CP003662
  • GENBANK/CP003663
  • GENBANK/CP003664
  • GENBANK/CP003665