| |
|
| PubMed | Entrez | BLAST | OMIM | Books | TaxBrowser | Entrez Structure |
|
|
||||||
|
Cn3D feature highlights Frequently Asked Questions Installation and Configuration NCBI's structure database Taxonomy in MMDB Structure comparisons Submit structure database searches Research topics and staff |
|
Discontinuous AlignmentsThe alignments displayed by Cn3D, whether calculated by VAST or BLAST and related algorithms, are discontinuous pairwise alignments. This means that each sequence is related to the master sequence through a set of nonoverlapping alignments separated by unaligned regions. Since the unaligned regions are completely unconstrained, they can be represented by spaces or the sequences can be centered or right- or left-justified; all of these are viewer options in DDV. Similarly, the sequences may have N- or C- termini which are not contained in the alignment, so it is left up to the user whether to display one or both of these "tails". Unaligned regions are shown in lowercase to distinguish them from aligned regions. Also, since gaps in unaligned regions exist purely for display purposes (if one unaligned sequence is longer than another, gaps need to be added to the shorter sequence so that they can be displayed correctly), unaligned gaps are represented by ~ instead of the traditional -.
Pairwise-to-multiple: the Intersect By Master algorithmWhile the input alignments are all pairwise, the display shows what seems to be a multiple alignment. The display is created by combining and truncating the pairwise alignments, so that the residues of the master sequence which are shown as aligned in the final display are those residues which are aligned with all of the other sequences shown. The residues shown as aligned in the other rows are the residues aligned with those parts of the master sequence. To see the intersect by master algorithm at work, hide one or more of the rows in the display (you are not allowed to hide the master row). In most cases, the aligned regions will get bigger and may even merge. This is because the pairwise alignment represented by the row that was hidden did not include some residues of the master sequence that were contained in the other alignments in the set, so those residues were designated "unaligned" by the intersect by master algorithm. When a row is hidden, the alignment that contains that row is removed from the set and the intersect by master algorithm is re-run, so those residues of the master sequence that are contained in all the alignments in the new set are now designated "aligned". Note that sometimes the intersect by master algorithm will find that there are no residues of the master sequence which are aligned to all the other sequences; in these cases a null alignment is generated and only unaligned regions are shown. Example: seq2 Alignment 2 (black is unaligned, yellow is aligned) seq3 seq2 seq3
Importing a sequenceYou can add a sequence to the sequence displayed in UDV or to the alignment displayed by DDV. You can choose to use either gapped or ungapped statistics to calculate the alignment between your sequence and the master sequence. The results from either gapped or ungapped BLAST are processed to give a new discontinuous alignment between your imported sequence and the master sequence.
After a new pairwise alignment is generated by gapped or ungapped BLAST, this alignment is added to the current set of alignments and the intersect by master algorithm is applied. The new alignment can therefore cause aligned regions of the original alignment to shrink, split, or disappear altogether. Definitions
|
Updated 04/22/05 |
| Help Desk | NCBI | NLM | NIH | Credits |