Sequence Set BrowserShow helpHide help
AACQ00000000.1 Candida albicans SC5314
# of Contigs: 413 # of Proteins: 14,217 Total length: 27,558,918 bp BioProject: PRJNA10701 BioSample: SAMN02953594 Keywords: WGS Annotation: Contigs Organism: Candida albicans SC5314 – show lineagehide lineage Biosource:/mol_type = genomic genomic/strain = SC5314 WGS: AACQ01000001:AACQ01000413 Reference: Reference:Annotation of the Genome of Candida albicans : Unpublished – show 14 authorshide authorsDungan,J., Kuo,A., Newport,G., Lan,C.-Y., Iijima,C., Adegbola,O., Roberts,J., Persson,K., Donnelly,S., Favoreto,S., Tzung,K.-W., Jones,T., Scherer,S., Agabian,N. Submission:Submitted (16-APR-2004) Stanford Genome Technology Center, 855 California Avenue, Palo Alto, CA 94304, USA – show 12 authorshide authorsJones,T., Federspiel,N.A., Chibana,H., Dungan,J., Kalman,S., Magee,B.B., Newport,G., Thorstenson,Y.R., Agabian,N., Magee,P.T., Davis,R.W., Scherer,S.
The Candida albicans SC5314 whole genome shotgun (WGS) project has the project accession AACQ00000000. This version of the project (01) has the accession number AACQ01000000, and consists of sequences AACQ01000001-AACQ01000413.
We developed computational methods to reconstruct the diploid genome sequence, using phrap contigs as a starting point. The genome sequence obtained by these methods is much less fragmented than the original phrap assembly, and is in good agreement with available physical mapping data. It differs in various respects from other genome sequences in that both copies of the genome are explicitly represented. Many of the contigs occur in homologous pairs. The DNA sequence of paired homologous contigs is usually similar throughout, although in certain regions, such as the mating-type locus, divergence between the homologs is considerable. Paired homologous contigs have numbers differing by 10000; for example, contigs 10065 and 20065 form a homologous pair. In addition, some contigs do not have homologous partners. Such contigs appear to contain either homozygous sequence, or in a few cases, repeat sequences that occur in multiple locations in the genome. All contigs with numbers less than 10000 are unpaired. In addition, contig 10262 is unpaired.