Format

Send to

Choose Destination
ScientificWorldJournal. 2012;2012:365104. doi: 10.1100/2012/365104. Epub 2012 Jun 18.

Mixed sequence reader: a program for analyzing DNA sequences with heterozygous base calling.

Author information

1
Department of Computer Science, National Tsing Hua University, Hsin-Chu, Taiwan.

Abstract

The direct sequencing of PCR products generates heterozygous base-calling fluorescence chromatograms that are useful for identifying single-nucleotide polymorphisms (SNPs), insertion-deletions (indels), short tandem repeats (STRs), and paralogous genes. Indels and STRs can be easily detected using the currently available Indelligent or ShiftDetector programs, which do not search reference sequences. However, the detection of other genomic variants remains a challenge due to the lack of appropriate tools for heterozygous base-calling fluorescence chromatogram data analysis. In this study, we developed a free web-based program, Mixed Sequence Reader (MSR), which can directly analyze heterozygous base-calling fluorescence chromatogram data in .abi file format using comparisons with reference sequences. The heterozygous sequences are identified as two distinct sequences and aligned with reference sequences. Our results showed that MSR may be used to (i) physically locate indel and STR sequences and determine STR copy number by searching NCBI reference sequences; (ii) predict combinations of microsatellite patterns using the Federal Bureau of Investigation Combined DNA Index System (CODIS); (iii) determine human papilloma virus (HPV) genotypes by searching current viral databases in cases of double infections; (iv) estimate the copy number of paralogous genes, such as β-defensin 4 (DEFB4) and its paralog HSPDP3.

PMID:
22778697
PMCID:
PMC3385616
DOI:
10.1100/2012/365104
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for Hindawi Limited Icon for PubMed Central
Loading ...
Support Center