(A) Each circle represents a gene, labeled with the source genome according to the first letter of each taxon—C, M, H, and F for Ciona, mouse, human, and fugu, respectively—and further differentiated by numeral. BLASTP was first used to search all vertebrate genes for the one most similar to Ciona's C1 gene, in this case the mouse gene M1. Then other genes are recruited to the cluster if they have a higher similarity score to M1 that that between C1 and M1, indicated here by the red lines. The six genes shown on the right side of the diagram have some sequence similarity to those in the cluster, but less than that between C1 and M1, so are not included. Because the vertebrates are more closely related to each other than any is to Ciona, each cluster will include those genes descended from a single gene in the common chordate ancestor, having arisen by either lineage splitting or gene duplication specific to one or more vertebrates. (See Materials and Methods for more details.)
(B) Evolutionary tree of the genes in this cluster show separate duplications for fugu and for human. Because the maximum likelihood method does not rely solely on sequence similarity, there is no significance to the mouse gene being most similar to C1. The mouse genome simply contained the most slowly evolving vertebrate gene in this multigene family; this can be from any vertebrate taxon with approximately equal likelihood.