Format

Send to

Choose Destination
BMC Bioinformatics. 2019 Oct 28;20(1):523. doi: 10.1186/s12859-019-3137-2.

Domainoid: domain-oriented orthology inference.

Author information

1
Department of Biochemistry and Biophysics, Science for Life Laboratory, Stockholm University, Box 1031, 17121, Solna, Sweden.
2
Experimental and Clinical Research Cente, a joint cooperation of Max-Delbrück Center for Molecular Medicine and Charité-Universitätsmedizin Berlin, 13125, Berlin, Germany.
3
European Molecular Biology Laboratory, Structural and Computational Biology Unit, 69117, Heidelberg, Germany.
4
Department of Biochemistry and Biophysics, Science for Life Laboratory, Stockholm University, Box 1031, 17121, Solna, Sweden. erik.sonnhammer@scilifelab.se.

Abstract

BACKGROUND:

Orthology inference is normally based on full-length protein sequences. However, most proteins contain independently folding and recurring regions, domains. The domain architecture of a protein is vital for its function, and recombination events mean individual domains can have different evolutionary histories. It has previously been shown that orthologous proteins may differ in domain architecture, creating challenges for orthology inference methods operating on full-length sequences. We have developed Domainoid, a new tool aiming to overcome these challenges faced by full-length orthology methods by inferring orthology on the domain level. It employs the InParanoid algorithm on single domains separately, to infer groups of orthologous domains.

RESULTS:

This domain-oriented approach allows detection of discordant domain orthologs, cases where different domains on the same protein have different evolutionary histories. In addition to domain level analysis, protein level orthology based on the fraction of domains that are orthologous can be inferred. Domainoid orthology assignments were compared to those yielded by the conventional full-length approach InParanoid, and were validated in a standard benchmark.

CONCLUSIONS:

Our results show that domain-based orthology inference can reveal many orthologous relationships that are not found by full-length sequence approaches.

AVAILABILITY:

https://bitbucket.org/sonnhammergroup/domainoid/.

KEYWORDS:

Domain ortholog; Orthology; Protein domain

Supplemental Content

Full text links

Icon for BioMed Central Icon for PubMed Central
Loading ...
Support Center