Analysis and Interpretation of Data






The analysis and interpretation of various types of data.  Illustrated here are various ways in which individual entries in sequence and structure databases can be compiled to reveal patterns and trends in biology. For example, sequence families or neighborhoods can be defined and annotated based on the similarity of each sequence to other members of the family. Common sequence features in sequence families can be identified in multiple alignments. These motifs may provide clues to the biochemical function of members of the family. Clustering of sequences into trees that reflect the degree of similarity between each sequence and all of the others in the family reveals evolutionary relationships. Finally, identification of homologs to each gene in well-characterized metabolic pathways provides information about the prevalence of that pathway in other organisms.