U.S. flag

An official website of the United States government

PMC Full-Text Search Results

Items: 5

1.
Figure 4

Figure 4. From: GenDiS database update with improved approach and features to recognize homologous sequences of protein domain superfamilies.

Superfamily description page. Details pertaining to each superfamily like (a) SCOP/Pfam DA, genomes covered, full length are available in the summary page; (b) validated sequences, domain sequence alignment and phylogeny are available for download in the `Downloads’ page; and (c) the DA-distance trees can be visualized in the `Tree file’ page.

Meenakshi S Iyer, et al. Database (Oxford). 2019;2019:baz042.
2.
Figure 1

Figure 1. From: GenDiS database update with improved approach and features to recognize homologous sequences of protein domain superfamilies.

Workflow followed for GenDiS+. The sequence searches were carried out using BRS-MP approach for multi-membered superfamilies, as described in the Materials and Methods section. The hits were validated using a stringent validation using structure-based sequence alignments of superfamily members. From the validated hits, features like DA computation and phylogeny, domain and full-length alignment and phylogeny and conserved motifs were analysed and have been made available in the database.

Meenakshi S Iyer, et al. Database (Oxford). 2019;2019:baz042.
3.
Figure 3

Figure 3. From: GenDiS database update with improved approach and features to recognize homologous sequences of protein domain superfamilies.

Superfamily statistics in GenDiS+. (a) The distribution of homologues across superfamilies. A majority of the superfamilies (89.65%) have up to 20 000 homologues with only 19 (~1%) superfamilies having over 100 000 members. (b) Distribution of domain length across superfamilies. (c) Most of the superfamily homologues have an average length up to 200 residues, with only homologues of multi-domain superfamily class having a domain length above 500 residues.

Meenakshi S Iyer, et al. Database (Oxford). 2019;2019:baz042.
4.
Figure 5

Figure 5. From: GenDiS database update with improved approach and features to recognize homologous sequences of protein domain superfamilies.

A case study of Smad protein MH1 superfamily. (a) The phylogenetic distribution of the SCOP-DA is represented, across different taxa in Cnidaria and Bilateria. As shown, the single-domain form and the combination with SMAD/FHA are prevalent in all orders while the other domain combinations occur only in certain taxa. (b) The phylogenetic representation of Pfam DA indicates that there is a correspondence between DA predictions by both methods in spite of using differently derived HMM libraries.

Meenakshi S Iyer, et al. Database (Oxford). 2019;2019:baz042.
5.
Figure 2

Figure 2. From: GenDiS database update with improved approach and features to recognize homologous sequences of protein domain superfamilies.

Results for browsing by taxonomy. (a) The taxonomy page lists the superphyla, phyla and family under different categories like cellular organisms, viruses, unclassified sequences and artificial sequences. The users can click on the parent nodes to expand the contents. Clicking on the child nodes with a leaf sign leads to a page listing all the species in the node with the complete hierarchy, accession ids and SCOP superfamilies of homologues. Links have been provided to NCBI taxonomy database on clicking the taxids and GenBank/Refseq page on clicking protein accession ids. Results for the expanded branch of the kingdom `Archaea’ has been shown. (b) Users can also browse the database using names for taxa, phyla, family etc. An example for search result of the keyword `Alveolata’ is shown.

Meenakshi S Iyer, et al. Database (Oxford). 2019;2019:baz042.

Supplemental Content

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...
Support Center