Send to

Choose Destination
PLoS Curr. 2015 Feb 9;7. pii: ecurrents.outbreaks.c7fd7946ba606c982668a96bcba43c90. doi: 10.1371/currents.outbreaks.c7fd7946ba606c982668a96bcba43c90.

High-resolution Genomic Surveillance of 2014 Ebolavirus Using Shared Subclonal Variants.

Author information

Department of Physics and Department of Systems Biology, Columbia University, New York, New York, USA.
Department of Systems Biology and Department of Biomedical Informatics, Columbia University, New York, New York, USA.



Viral outbreaks, such as the 2014 ebolavirus, can spread rapidly and have complex evolutionary dynamics, including coinfection and bulk transmission of multiple viral populations. Genomic surveillance can be hindered when the spread of the outbreak exceeds the evolutionary rate, in which case consensus approaches will have limited resolution. Deep sequencing of infected patients can identify genomic variants present in intrahost populations at subclonal frequencies (i.e. <50%). Shared subclonal variants (SSVs) can provide additional phylogenetic resolution and inform about disease transmission patterns.


We use metrics from population genetics to analyze data from the 2014 ebolavirus outbreak in Sierra Leone and identify phylogenetic signal arising from SSVs. We use methods derived from information theory to measure a lower bound on transmission bottleneck size.


We identify several SSV that shed light on phylogenetic relationships not captured by consensus-based analyses. We find that transmission bottleneck size is larger than one founder population, yet significantly smaller than the intrahost effective population. Our results demonstrate the important role of shared subclonal variants in genomic surveillance.


ebolavirus; genomic surveilance; intrahost variants; transmission bottleneck

Supplemental Content

Full text links

Icon for Public Library of Science Icon for PubMed Central
Loading ...
Support Center