Format

Send to

Choose Destination
J Gen Virol. 2008 Sep;89(Pt 9):2098-107. doi: 10.1099/vir.0.83657-0.

Classification of hepatitis C virus and human immunodeficiency virus-1 sequences with the branching index.

Author information

1
Theoretical Biology & Biophysics, T-10 MS K710, LANL, Los Alamos, NM 87545, USA. phraber@lanl.gov

Abstract

Classification of viral sequences should be fast, objective, accurate and reproducible. Most methods that classify sequences use either pair-wise distances or phylogenetic relations, but cannot discern when a sequence is unclassifiable. The branching index (BI) combines distance and phylogeny methods to compute a ratio that quantifies how closely a query sequence clusters with a subtype clade. In the hypothesis-testing framework of statistical inference, the BI is compared with a threshold to test whether sufficient evidence exists for the query sequence to be classified among known sequences. If above the threshold, the null hypothesis of no support for the subtype relation is rejected and the sequence is taken as belonging to the subtype clade with which it clusters on the tree. This study evaluates statistical properties of the BI for subtype classification in hepatitis C virus (HCV) and human immunodeficiency virus-1 (HIV-1). Pairs of BI values with known positive- and negative-test results were computed from 10,000 random fragments of reference alignments. Sampled fragments were of sufficient length to contain phylogenetic signals that grouped reference sequences together properly into subtype clades. For HCV, a threshold BI of 0.71 yields 95.1% agreement with reference subtypes, with equal false-positive and false-negative rates. For HIV-1, a threshold of 0.66 yields 93.5% agreement. Higher thresholds can be used where lower false-positive rates are required. In synthetic recombinants, regions without breakpoints are recognized accurately; regions with breakpoints do not represent any known subtype uniquely. Web-based services for viral subtype classification with the BI are available online.

PMID:
18753218
PMCID:
PMC2754793
DOI:
10.1099/vir.0.83657-0
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for Ingenta plc Icon for PubMed Central
Loading ...
Support Center