Format

Send to

Choose Destination
Mol Biosyst. 2009 Dec;5(12):1593-605. doi: 10.1039/b907946g. Epub 2009 Oct 5.

Supervised learning with decision tree-based methods in computational and systems biology.

Author information

1
Department of EE and CS & GIGA-Research, University of Li├Ęge, Belgium. p.geurts@ulg.ac.be

Abstract

At the intersection between artificial intelligence and statistics, supervised learning allows algorithms to automatically build predictive models from just observations of a system. During the last twenty years, supervised learning has been a tool of choice to analyze the always increasing and complexifying data generated in the context of molecular biology, with successful applications in genome annotation, function prediction, or biomarker discovery. Among supervised learning methods, decision tree-based methods stand out as non parametric methods that have the unique feature of combining interpretability, efficiency, and, when used in ensembles of trees, excellent accuracy. The goal of this paper is to provide an accessible and comprehensive introduction to this class of methods. The first part of the review is devoted to an intuitive but complete description of decision tree-based methods and a discussion of their strengths and limitations with respect to other supervised learning methods. The second part of the review provides a survey of their applications in the context of computational and systems biology.

PMID:
20023720
DOI:
10.1039/b907946g
[Indexed for MEDLINE]

Supplemental Content

Full text links

Icon for Royal Society of Chemistry
Loading ...
Support Center