U.S. flag

An official website of the United States government

PMC Full-Text Search Results

Items: 5

1.
Figure 1

Figure 1. From: Improved human disease candidate gene prioritization using mouse phenotype.

ROC curves of random-gene cross-validation based on score ranks. Blue curve was generated from the 19 disease gene training sets. Black curve, negative control, was generated from 20 random training sets. See text for the definitions of sensitivity and specificity.

Jing Chen, et al. BMC Bioinformatics. 2007;8:392-392.
2.
Figure 3

Figure 3. From: Improved human disease candidate gene prioritization using mouse phenotype.

ROC curves of random-gene cross-validation based on scores. The red curve was generated using all features sets (AUC score 0.913). The blue curve was generated without Mouse Phenotype annotations (AUC score 0.893). The orange curve was generated without Mouse Phenotype and Pubmed annotations (AUC score 0.888). See text for the definitions of sensitivity and specificity.

Jing Chen, et al. BMC Bioinformatics. 2007;8:392-392.
3.
Figure 2

Figure 2. From: Improved human disease candidate gene prioritization using mouse phenotype.

AUC of different feature sets. Red bars indicate the AUC scores based on each feature set, and blue bars are the corresponding random controls. Yellow bars indicate the coverage of each feature set in the whole genome. For example, mouse phenotype (MP) has AUC score 0.78 and covers 19% of genes in the whole genome. For each feature set, the ROC curve was generated using genes with annotations only.

Jing Chen, et al. BMC Bioinformatics. 2007;8:392-392.
4.
Figure 4

Figure 4. From: Improved human disease candidate gene prioritization using mouse phenotype.

The performance of locus-region cross-validation using different feature sets. The average rank ratio (y-axis on the left) indicates the average rank ratio of the "target" genes in the resulting list, thus lower value corresponding to a better performance. At the same time, the higher the number of top 5% ranked "target" genes among total of 150 prioritizations (y-axis on the right), the better the performance. As a result, it's very clear that removing MP, PubMed or both resulted in significant drop of performance.

Jing Chen, et al. BMC Bioinformatics. 2007;8:392-392.
5.
Figure 5

Figure 5. From: Improved human disease candidate gene prioritization using mouse phenotype.

Schematic representation of gene prioritization. (A) Genes in the training set are selected based on their attributes or current gene annotations (genes associated with a disease, phenotype, pathway or a GO term). (B) Test gene source can be candidate genes from linkage analysis studies or genes differentially expressed in a particular disease or phenotype. (C) Enriched terms of the eight gene annotations, namely, GO: Molecular Function, GO: Biological Process, Mouse Phenotype, Pathways, Protein Interactions, Protein Domains and Gene Expression, compiled from various data sources, are obtained for the training set of genes. (D) A similarity score is generated for each annotation of each test gene by comparing to the enriched terms in the training set of genes. The final prioritized gene list is then computed based on the aggregated values of the eight similarity scores.

Jing Chen, et al. BMC Bioinformatics. 2007;8:392-392.

Supplemental Content

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...
Support Center