Format

Send to

Choose Destination
See comment in PubMed Commons below
Bioinformatics. 1999 Mar;15(3):219-27.

EDITtoTrEMBL: a distributed approach to high-quality automated protein sequence annotation.

Author information

1
European Bioinformatics Institute, Hinxton, UK. moeller@ebi.ac.uk

Abstract

SUMMARY:

Many databases in molecular biology face the problem that the ever increasing rate of data production can no longer be handled by traditional methods, especially human curation. Therefore, a number of projects are currently investigating methods for automated sequence annotation. This paper describes the EBI's approach to this problem for protein sequences by integration of arbitrary analysis programs into a distributed and highly flexible environment. Our software framework allows an individual treatment of sequences depending on their particular properties, which is achieved through a high-level description of the preconditions and capabilities of analysing modules. This not only improves the overall performance of the annotation process, as unnecessary steps are avoided, but also enhances its quality since dependencies between different modules are taken into account. We have implemented a prototype and use it in the production of TrEMBL releases.

AVAILABILITY:

Upon request.

PMID:
10222409
[Indexed for MEDLINE]
PubMed Commons home

PubMed Commons

0 comments
How to join PubMed Commons

    Supplemental Content

    Loading ...
    Support Center