AACQ00000000.1 Candida albicans SC5314
The Candida albicans SC5314 whole genome shotgun (WGS) project has the project accession AACQ00000000. This version of the project (01) has the accession number AACQ01000000, and consists of sequences AACQ01000001-AACQ01000413.
We developed computational methods to reconstruct the diploid genome sequence, using phrap contigs as a starting point. The genome sequence obtained by these methods is much less fragmented than the original phrap assembly, and is in good agreement with available physical mapping data. It differs in various respects from other genome sequences in that both copies of the genome are explicitly represented. Many of the contigs occur in homologous pairs. The DNA sequence of paired homologous contigs is usually similar throughout, although in certain regions, such as the mating-type locus, divergence between the homologs is considerable. Paired homologous contigs have numbers differing by 10000; for example, contigs 10065 and 20065 form a homologous pair. In addition, some contigs do not have homologous partners. Such contigs appear to contain either homozygous sequence, or in a few cases, repeat sequences that occur in multiple locations in the genome. All contigs with numbers less than 10000 are unpaired. In addition, contig 10262 is unpaired.