Send to

Choose Destination
J Phycol. 2019 Nov 12. doi: 10.1111/jpy.12947. [Epub ahead of print]

Evidence that inconsistent gene prediction can mislead analysis of dinoflagellate genomes.

Author information

Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD, 4072, Australia.
School of Chemistry and Molecular Biosciences, University of Queensland, Brisbane, Queensland, 4072, Australia.
Department of Biochemistry and Microbiology, Rutgers University, New Brunswick, New Jersey, 08901, USA.


Comparative algal genomics often relies on predicted genes from de novo assembled genomes. However, the artifacts introduced by different gene-prediction approaches, and their impact on comparative genomic analysis remain poorly understood. Here, using available genome data from six dinoflagellate species in the Symbiodiniaceae, we identified methodological biases in the published genes that were predicted using different approaches and putative contaminant sequences in the published genome assemblies. We developed and applied a comprehensive customized workflow to predict genes from these genomes. The observed variation among predicted genes resulting from our workflow agreed with current understanding of phylogenetic relationships among these taxa, whereas the variation among the previously published genes was largely biased by the distinct approaches used in each instance. Importantly, these biases affect the inference of homologous gene families and synteny among genomes, thus impacting biological interpretation of these data. Our results demonstrate that a consistent gene-prediction approach is critical for comparative analysis of dinoflagellate genomes.


Symbiodiniaceae; algal genomics; dinoflagellate genomics; dinoflagellates; gene prediction


Supplemental Content

Full text links

Icon for Wiley
Loading ...
Support Center