Logo of narLink to Publisher's site
Nucleic Acids Res. 1995 Dec 25; 23(24): 4992–4999.
PMCID: PMC307504

A new DNA sequence assembly program.


We describe the Genome Assembly Program (GAP), a new program for DNA sequence assembly. The program is suitable for large and small projects, a variety of strategies and can handle data from a range of sequencing instruments. It retains the useful components of our previous work, but includes many novel ideas and methods. Many of these methods have been made possible by the program's completely new, and highly interactive, graphical user interface. The program provides many visual clues to the current state of a sequencing project and allows users to interact in intuitive and graphical ways with their data. The program has tools to display and manipulate the various types of data that help to solve and check difficult assemblies, particularly those in repetitive genomes. We have introduced the following new displays: the Contig Selector, the Contig Comparator, the Template Display, the Restriction Enzyme Map and the Stop Codon Map. We have also made it possible to have any number of Contig Editors and Contig Joining Editors running simultaneously even on the same contig. The program also includes a new 'Directed Assembly' algorithm and routines for automatically detecting unfinished segments of sequence, to which it suggests experimental solutions.

Full text

Full text is available as a scanned copy of the original print version. Get a printable copy (PDF file) of the complete article (2.7M), or click on a page image below to browse page by page. Links to PubMed are also available for Selected References.

Selected References

These references are in PubMed. This may not be the complete list of references from this article.
  • Staden R. Automation of the computer handling of gel reading data produced by the shotgun method of DNA sequencing. Nucleic Acids Res. 1982 Aug 11;10(15):4731–4751. [PMC free article] [PubMed]
  • Peltola H, Söderlund H, Ukkonen E. SEQAID: a DNA sequence assembling program based on a mathematical model. Nucleic Acids Res. 1984 Jan 11;12(1 Pt 1):307–321. [PMC free article] [PubMed]
  • Dear S, Staden R. A sequence assembly and editing program for efficient management of large projects. Nucleic Acids Res. 1991 Jul 25;19(14):3907–3911. [PMC free article] [PubMed]
  • Smith S, Welch W, Jakimcius A, Dahlberg T, Preston E, Van Dyke D. High throughput DNA sequencing using an automated electrophoresis analysis system and a novel sequence assembly program. Biotechniques. 1993 Jun;14(6):1014–1018. [PubMed]
  • Huang X. A contig assembly program based on sensitive detection of fragment overlaps. Genomics. 1992 Sep;14(1):18–25. [PubMed]
  • Lawrence CB, Honda S, Parrott NW, Flood TC, Gu L, Zhang L, Jain M, Larson S, Myers EW. The genome reconstruction manager: a software environment for supporting high-throughput DNA sequencing. Genomics. 1994 Sep 1;23(1):192–201. [PubMed]
  • Gleizes A, Hénaut A. A global approach for contig construction. Comput Appl Biosci. 1994 Jul;10(4):401–408. [PubMed]
  • Myers EW, Miller W. Optimal alignments in linear space. Comput Appl Biosci. 1988 Mar;4(1):11–17. [PubMed]
  • Huang X. On global sequence alignment. Comput Appl Biosci. 1994 Jun;10(3):227–235. [PubMed]
  • Staden R. A computer program to enter DNA gel reading data into a computer. Nucleic Acids Res. 1984 Jan 11;12(1 Pt 2):499–503. [PMC free article] [PubMed]
  • Dear S, Staden R. A standard file format for data from DNA sequencing instruments. DNA Seq. 1992;3(2):107–110. [PubMed]
  • Bonfield JK, Staden R. The application of numerical estimates of base calling accuracy to DNA sequencing projects. Nucleic Acids Res. 1995 Apr 25;23(8):1406–1410. [PMC free article] [PubMed]
  • Pearson WR. Using the FASTA program to search protein and DNA sequence databases. Methods Mol Biol. 1994;25:365–389. [PubMed]
  • Prober JM, Trainor GL, Dam RJ, Hobbs FW, Robertson CW, Zagursky RJ, Cocuzza AJ, Jensen MA, Baumeister K. A system for rapid DNA sequencing with fluorescent chain-terminating dideoxynucleotides. Science. 1987 Oct 16;238(4825):336–341. [PubMed]
  • Lee LG, Connell CR, Woo SL, Cheng RD, McArdle BF, Fuller CW, Halloran ND, Wilson RK. DNA sequencing with dye-labeled terminators and T7 DNA polymerase: effect of dyes and dNTPs on incorporation of dye-terminators and probability analysis of termination fragments. Nucleic Acids Res. 1992 May 25;20(10):2471–2483. [PMC free article] [PubMed]
  • Hillier L, Green P. OSP: a computer program for choosing PCR and DNA sequencing primers. PCR Methods Appl. 1991 Nov;1(2):124–128. [PubMed]

Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press


Save items

Related citations in PubMed

See reviews...See all...

Cited by other articles in PMC

See all...


  • PubMed
    PubMed citations for these articles

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...