Single nucleotide polymorphism hunting in cyberspace

Hum Mutat. 1998;12(4):221-5. doi: 10.1002/(SICI)1098-1004(1998)12:4<221::AID-HUMU1>3.0.CO;2-I.

Abstract

Large-scale sequencing of human cDNA and genomic DNA libraries has produced a large collection of sequence data in public databases. To date, >900,000 human expressed sequence tag (EST) sequences and >80,000,000 bases of genomic DNA sequence have been deposited in Genbank. This ever-expanding data set is a rich source of gene-associated and anonymous single nucleotide polymorphisms (SNPs). DNA sequence variations can be found by comparing the sequences of redundant ESTs and by comparing sequences from overlapping genomic clones. Initial studies have shown that, with proper computer screening, informative SNP markers can be developed from these DNA databases in an efficient and cost-effective manner. Complete public access to these databases will allow individual investigators to add biological value to the human sequence data generated by large-scale sequencing centers.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.
  • Review

MeSH terms

  • DNA / genetics*
  • Databases, Factual*
  • Expressed Sequence Tags
  • Genetic Variation / genetics
  • Genome, Human
  • Humans
  • Polymorphism, Genetic*

Substances

  • DNA