Abundances of light-sensing proteins in metagenomes. (A) Total number of proteins orthologous to 20 query proteins (columns) in 373 sequenced genomes (top row) and five metagenomes (remaining rows). (B) Number of proteins as a percentage of the total number of predicted proteins per environment. Rows labeled in gray are subsamples. Columns are labeled as follows (see Table 1 for details): psa, photosystem I subunits ABC; psb, photosystem II subunits ABDEHIJKLF; pet, photosynthetic electron transfer subunits A123; apc, allophycocyanin; cpc, phycocyanin; kaiAB, circadian clock regulators; bluf, blue-light flavin adenine dinucleotide-binding domain-containing proteins; slr0359/plpA, blue-light-absorbing phototropins; cph1 and cph2, red- and far-red-absorbing phytochromes; taxD1, photoreceptor for phototaxis; cry, DNA photolyase and cryptochrome families; carot, water-soluble carotenoids as intracellular UV sunscreen; scyto, scytonemin as extracellular sunscreen; taxP1, phototaxis putative regulatory element; taxY1, phototaxis CheY-like protein; taxAY1, phototaxis histidine kinase; rcaE, complementary chromatic adaptation protein. Growth and repair proteins are more abundant in high-light environments than invariable-light ones, whereas sensing and adaptation proteins are more abundant in variable-light environments than in high-light ones. In particular, photolyase DNA repair proteins are overrepresented in the high-UV environment of surface seawater compared to all other environments. BLUF domain blue-light-sensing proteins are extremely rare in both genomes and environments, although the majority are found in surface rather than deep water. The red-light sensors Cph1 and Cph2 are overrepresented in deep water rather than primarily blue surface water. RcaE chromatic adaptation proteins are overrepresented in variable-light environments, such as the deep sea and lower (darker) layers of the microbial mat.