![]() | ![]() |
Formats:
|
||||||||||
Copyright © 2009 by The National Academy of Sciences of the USA Cell Biology Scoring diverse cellular morphologies in image-based screens with iterative feedback and machine learning aThe Broad Institute of MIT and Harvard, 7 Cambridge Center, Cambridge, MA 02142; bWhitehead Institute for Biomedical Research, Massachusetts Institute of Technology, 9 Cambridge Center, Cambridge, MA 02142; cComputer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, 32 Vassar Street, Cambridge, MA 02139; dDana-Farber Cancer Institute and Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, 250 Longwood Avenue, Boston, MA 02115; and eDepartment of Biology, Massachusetts Institute of Technology, 31 Ames Street, Cambridge, MA 02139 2To whom correspondence should be addressed. E-mail: anne/at/broad.mit.edu Edited by Edward M. Scolnick, The Broad Institute, Cambridge, MA, and approved December 12, 2008 Author contributions: T.R.J., A.E.C., D.E.R., P.G., and D.M.S. designed research; T.R.J., A.E.C., M.R.L., J.M., S.J.S., J.K.G., A.B.C., and U.S.E. performed research; T.R.J., A.E.C., and P.G. analyzed data; and T.R.J. and A.E.C. wrote the paper. 1T.R.J. and A.E.C. contributed equally to this work. 3Present address: Banting and Best Department of Medical Research, Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Room 802, Toronto, Ontario, Canada M5S 3E1. Received September 8, 2008. Freely available online through the PNAS open access option. This article has been cited by other articles in PMC.Abstract Many biological pathways were first uncovered by identifying mutants with visible phenotypes and by scoring every sample in a screen via tedious and subjective visual inspection. Now, automated image analysis can effectively score many phenotypes. In practical application, customizing an image-analysis algorithm or finding a sufficient number of example cells to train a machine learning algorithm can be infeasible, particularly when positive control samples are not available and the phenotype of interest is rare. Here we present a supervised machine learning approach that uses iterative feedback to readily score multiple subtle and complex morphological phenotypes in high-throughput, image-based screens. First, automated cytological profiling extracts hundreds of numerical descriptors for every cell in every image. Next, the researcher generates a rule (i.e., classifier) to recognize cells with a phenotype of interest during a short, interactive training session using iterative feedback. Finally, all of the cells in the experiment are automatically classified and each sample is scored based on the presence of cells displaying the phenotype. By using this approach, we successfully scored images in RNA interference screens in 2 organisms for the prevalence of 15 diverse cellular morphologies, some of which were previously intractable. Keywords: high-content screening, high-throughput image analysis, phenotype The history of biology has been dramatically shaped by classic visual screens in model organisms, including Drosophila melanogaster (1–3), Saccharomyces cerevisiae (4), Caenorhabditis elegans (5), and the zebrafish Danio rerio (6, 7). In each case, biological pathways were discovered because researchers were intrigued by groups of peculiar-looking mutants and identified the genes underlying their phenotypes. Because researchers have favored the extensive study of relatively few genes (8), classic, wide-net approaches like screening are as relevant as ever to probe known biological pathways and discover new ones. Modern technology now enables large-scale experiments in cultured cells to identify human genes that underlie biological processes via RNAi. Automation also allows the screening of chemical libraries to identify perturbants useful as research tools or drugs. Despite these advances, scoring cells in images for rare and unusual morphologies has, in general, remained a significant bottleneck (9–12). Cell image analysis allows accurate identification and measurement of cells' features, enabling automated analysis of certain phenotypes that were previously intractable (13–26). However, many interesting phenotypes require the assessment of several measured features of cells. Machine learning methods that select and combine multiple features for automated cell classification have been used to score many phenotypes (15–26). These methods require the provision of example cells that do and do not display the morphology of interest (i.e., positive and negative cells). Finding positive cells is straightforward when positive control samples are available and most of the cells therein show the phenotype. However, when this is not the case, as in classic exploratory screens, finding a sufficient number of positive cells can be prohibitively difficult. Even when positive control samples are available, using positive example cells from only those samples can lead to inaccurate scoring because of overfitting of the machine learning algorithm. Here we describe our approach to scoring multiple complex and subtle phenotypes in large-scale, image-based screens. It is particularly effective when positive control samples are not available or not highly penetrant, as is often the case in RNAi and chemical screens. Our approach uses: (a) a biologist's ability to identify an “interesting” phenotype, (b) automatic measurement of multiple features for each cell, (c) a computer's ability to rapidly test multiple combinations of features using machine learning algorithms, and (d) a computer's ability to quickly and objectively classify millions of individual cells based on their measured features. We used our approach to score 15 diverse cellular phenotypes in large-scale RNAi screens in human and D. melanogaster cells, demonstrating that automated scoring for image-based chemical and genetic screens for multiple complex, low-penetrance phenotypes is now feasible. Results Overview of the Approach. We have developed and validated a method for researchers to rapidly train a computer to score unusual cell morphologies automatically (Fig. 1
Once a few dozen individual cells have been classified by the researcher, a machine learning algorithm is used to determine a tentative rule (i.e., a classifier) that distinguishes the cytoprofiles of the positive and negative example cells, using the GentleBoosting algorithm applied to regression stumps (28). Other machine learning methods are likely to be equally effective, based on their performance in previous work (15–24). The system then presents the researcher with a new batch of cells, which it has classified based on the tentative rule, and the researcher corrects errors. The corrections are used to refine the rule. After several rounds of error correction and rule refinement, the researcher has classified a few hundred cells, and these are used to produce a rule specific to the phenotype of interest. In the final step (Fig. 1
Scoring RNAi Screens for Diverse Phenotypes in Human Cells. We used this iterative approach to recognize and score 14 diverse phenotypes (Figs. 3
Nearly every phenotype we attempted to score could be scored accurately without customization of the image processing. That is, the standard cytoprofiles were sufficient for accurate classification in all but the Peas in a Pod phenotype. We added one feature (angle between a nucleus' 2 nearest neighbors) to the image-analysis step to better identify this phenotype (Fig. 4 Features from the cytoprofiles that were used to classify cells for each phenotype usually included a mixture of measurements of intensity, texture, and area/shape (Fig. S2 and SI Text). Some features were unexpected, implying that choosing features manually by using biological or image-analysis expertise would have overlooked useful features. The features also served to generate hypotheses about phenotypes that were otherwise uncharacterized. For example, cells showing the actin blebs and peripheral actin phenotypes tend to have 4N DNA content, indicating an unexpected relationship to the cell cycle (Fig. S3). For most phenotypes, we knew of no samples that could be considered positive controls, precluding our use of existing methods that require highly penetrant controls (15, 19, 20). Typically, our only exemplars were unusual phenotypes that we observed at a low frequency in WT cells. Factors like cell cycle, local environment, stochastic noise, and epigenetics all play a role in generating nonuniform populations of cells (29, 30). We therefore wondered whether any samples would have an unusually high proportion of cells showing these naturally occurring rare morphologies. Interestingly, every phenotype we pursued yielded at least some RNAi samples in which the phenotype was significantly enriched. This is consistent with the possibility that the number of phenotypic states that are possible for a cell is fairly limited, and natural variation in mRNA expression levels can push cells into one of these states, even without the influence of RNAi. In any event, the system enabled us to indulge our curiosity by pursuing unusual and uncharacterized cellular morphologies, as in classic genetic screens. Validation, Comparison to Previous Methods, and Flexibility. We tested our method's accuracy at ranking samples by having researchers score samples (that is, images showing a population of cells) by eye. The biologically relevant score for a sample is enrichment of cells that display the phenotype, rather than a hard “positive” or “negative” label, because samples in screens typically do not fall into clear positive and negative classes (particularly when judged by different researchers), but instead fall along a continuum (31). Our goal is to bring highly enriched samples to the attention of the researcher; therefore, our validation design (forced choice, described in Methods) (32) aimed to test whether top-ranking samples were indeed enriched relative to samples scored as neutral. The results for actin blebs are shown in detail in Fig. 2 Among all 360 samples identified as “hits” across the different phenotypes (Figs. 3 The phenotypes we chose were particularly challenging because their average penetrance was low (0.2–6.1%), and even the strongest hits for some phenotypes contained <5% positive cells. All phenotypes were, nonetheless, readily scored by our method. Previous approaches (15, 19, 20) have succeeded on highly penetrant phenotypes where positive control samples are known, but none of the phenotypes in our study had positive control samples available, and most were low-penetrance. We chose 4 of the phenotypes in this study and retrospectively tested a positive control-based method on them (Fig. S4). The method worked well on the most highly penetrant, straightforward phenotype, large spread cells (Fig. S4A), but was inferior on the other 3 phenotypes of greater morphological complexity and lower penetrance, in some cases even failing to highly rank the training samples (Fig. S4 B–D). Overfitting is a concern when using machine learning algorithms, but boosting variants are fairly resistant to it (28). Cross-validation results (Fig. S5) show that this is also the case for our approach. The classification accuracy is typically not significantly reduced as the number of individual regression stumps forming a rule for a phenotype increases. To increase the coverage of the training set and guard against training to a too-narrow definition of a phenotype, it is useful to inspect images of the top-ranked samples (or positive control samples, if available), in which positively classified cells are marked. From these images, it is easy to identify false-negative cells and add them to the training set during the iterative training phase. We considered whether a rule will generalize to new experiments. A rule trained on one experiment is unlikely to be applicable to experiments involving different assay protocols, cellular stains, or image acquisition instrumentation, although with our approach, the time required to generate a new rule for the new experiment is minimal. For replicate experiments, creating a training set from one replicate and applying the rule it generates to another replicate risks negatively impacting its accuracy because of undetected experimental variation (Fig. S6B). The more robust approach is to create a training set spanning all replicates (Fig. S6A). Lastly, we tested our method's flexibility by applying it to another large-scale image set. Previously, 288 genes were screened for a metaphase phenotype by RNAi in Drosophila by using living-cell microarrays (33). In our previous work, we identified cells in metaphase by empirically applying sequential gates based on 4 measured features of the DNA stain of each cell. This process took more than a week. With our new approach, we identified metaphase nuclei and accurately scored the entire screen within 4 h, of which only 1 h was hands-on time (Fig. S7 and Fig. S8). The top of the rank-ordered list of genes from the screen (SI Text) contained widerborst (CG5643, the one hit in our original study), as well as other cell-cycle-related genes, e.g., polo (CG12306) and microtubule star (CG7109). The gene most deenriched for metaphase nuclei was Nima-related kinase 2 (Nek2, CG17256; “Nima” derives from “never in mitosis”). As was the case for complex human phenotypes (Fig. S4), providing the positive control sample images directly to the machine learning algorithm was unsuccessful (Fig. S9). Discussion Together, this work indicates that automated scoring of a wide variety of morphologies can be accomplished quickly and easily, even when a phenotype is rare in the WT population and positive control samples are not available. Specifically, the approach is scalable to large-scale, image-based screens (chemical or genetic) in which multiple complex phenotypes are examined. Whereas screening for perturbations of general cellular functions like cell division has yielded large networks of genes (14, 34), the ability to identify more subtle and rare cellular morphologies should yield more tightly focused families of genes worthy of study (35). In particular, morphologies of unknown biological significance are likely to lead to the study of entirely new pathways in the spirit of classic genetic screens. The approach described here is compatible with automated image analysis systems and, importantly, is robust to the occasional segmentation errors produced by such systems. Previous work has demonstrated that machine learning algorithms can be successfully trained by using all cells from positive and negative control samples to create a training set, even for some phenotypes that cannot be visually distinguished by humans (25). Here we showed that, whereas this approach can be successful for highly penetrant phenotypes (Fig. S4A), it is not suitable when the phenotype is less penetrant (Fig. S4 B–D and Fig. S9). We have addressed these challenging situations, thus enabling screens for low-penetrance phenotypes that lack positive control samples. Even when positive control samples are available, leveraging the user's visual perception to select individual example cells helps prevent the machine learning algorithm from focusing on aspects of morphology that are irrelevant to the biological question at hand or from becoming tuned to cells that display some complex combination of phenotypes as the positive control samples (i.e., pleiotropic effects) rather than the specific phenotype of interest. The machine learning approach presented here has been implemented and released as the “Classifier” feature in an open-source software package we developed previously for visualizing and exploring data from image-based screens, called CellProfiler Analyst (33). Methods Algorithms and Software. The software packages used in this work, CellProfiler and CellProfiler Analyst, are open-source (available from the Broad Institute at www.cellprofiler.org). The image-analysis pipeline, which can exactly recreate the analysis in CellProfiler, is provided along with a text description (SI Text). Based on code from Torralba et al. (36), the Classifier functionality was developed as a feature in CellProfiler Analyst for this study; its usage is described in a manual and an online demonstration video (available from the Broad Institute at www.cellprofiler.org/examples). The time to compute a rule is on the order of a few seconds, and grows linearly with training set size and the number of features. Using the rule to classify 8 million cells in a database takes ≈2 min, with the same orders of growth, primarily limited by disk transfer speed, as the full dataset must be read to classify every cell. Image processing times to identify and measure cells using CellProfiler are currently on the order of 10 s to several minutes per image, depending on the particular experimental and image analysis used (for example, ≈2.5 min per 3-channel, 512- × 512-image on a 2.4-GHz Intel CPU with 8 gigabytes of RAM for the human cell images in this study). Cluster computing prevents this from becoming a bottleneck. RNAi Screens, Images, and Cytological Profiles. Images used in the human screens presented here have been previously described (14). Cells were stained for DNA (Hoechst), actin (phalloidin), and phospho-histone-H3 serine 10 (antibody). Approximately 5 separate lentiviral-delivered shRNAs were tested for each of 1,028 genes, mostly kinases and phosphatases, with 2 samples for each shRNA (one with and one without the selection reagent for the shRNA, puromycin) and with 4 images captured per sample. We used the samples treated with puromycin (the selection agent for the shRNA vector) for the validation step shown in Figs. 3 Supporting Information
Acknowledgments. We thank InHan Kang (Massachusetts Institute of Technology) for creating the CellProfiler Analyst software infrastructure and engineering some of the machine learning functionality; the RNAi Consortium and the Broad Institute RNAi Platform for investment of time and resources in the project; the Broad Institute Imaging Platform members for image analysis, statistical analysis, and software engineering (especially Adam Fraser, Adam Papallo, and Martha Vokes); Shomit Sengupta for microscopy and assay guidance; Aviv Regev and Eric Lander for helpful comments on the manuscript; and David Bonnett, Renee Butterfield, Dan Card, Dianne Carpenter, Seth Carpenter, Christopher Lewis, and Themba Nyathi for scoring images for the project. This work was supported by the Broad Institute, the RNAi Consortium, a Novartis fellowship from the Life Sciences Research Foundation (to A.E.C.), a Society for Biomolecular Screening Academic grant (to A.E.C.), a L'Oreal for Women in Science fellowship (to A.E.C.), the Massachusetts Institute of Technology Department of Electrical Engineering and Computer Science/Whitehead/Broad Training Program in Computational Biology, National Institutes of Health Grant DK070069–01 (to T.R.J.), National Science Foundation CAREER Award 0642971 (to P.G.), National Institute of General Medical Sciences Grant R01 GM0725555 (to D.M.S.) and National Institute of Allergy and Infectious Diseases Grant RO1 AI047389 (to D.M.S.). Footnotes The authors declare no conflict of interest. This article is a PNAS Direct Submission. This article contains supporting information online at www.pnas.org/cgi/content/full/0808843106/DCSupplemental. References 1. Nusslein-Volhard C, Wieschaus E. Mutations affecting segment number and polarity in Drosophila. Nature. 1980;287:795–801. [PubMed] 2. Morgan TH. The origin of five mutations in eye color in Drosophila and their modes of inheritance. Science. 1911;33:534–537. [PubMed] 3. Muller H. Artificial Transmutation of the Gene. Science. 1927;66:84–87. [PubMed] 4. Hartwell LH, Culotti J, Reid B. Genetic control of the cell-division cycle in yeast. I. Detection of mutants. Proc Natl Acad Sci USA. 1970;66:352–359. [PubMed] 5. Brenner S. The genetics of Caenorhabditis elegans. Genetics. 1974;77:71–94. [PubMed] 6. Haffter P, et al. The identification of genes with unique and essential functions in the development of the zebrafish, Danio rerio. Development. 1996;123:1–36. [PubMed] 7. Driever W, et al. A genetic screen for mutations affecting embryogenesis in zebrafish. Development. 1996;123:37–46. [PubMed] 8. Su AI, Hogenesch JB. Power-law-like distributions in biomedical publications and research funding. Genome Biol. 2007;8:404. [PubMed] 9. Eggert US, Mitchison TJ. Small molecule screening by imaging. Curr Opin Chem Biol. 2006;10:232–237. [PubMed] 10. Carpenter AE. Image-based chemical screening. Nat Chem Biol. 2007;3:461–465. [PubMed] 11. Carpenter AE, Sabatini DM. Systematic genome-wide screens of gene function. Nat Rev Genet. 2004;5:11–22. [PubMed] 12. Kiger A, et al. A functional genomic analysis of cell morphology using RNA interference. J Biol. 2003;2(4):27. [PubMed] 13. Carpenter AE, et al. CellProfiler: image analysis software for identifying and quantifying cell phenotypes. Genome Biol. 2006;7(10):R100. [PubMed] 14. Moffat J, et al. A lentiviral RNAi library for human and mouse genes applied to an arrayed viral high-content screen. Cell. 2006;124:1283–1298. [PubMed] 15. Bakal C, Aach J, Church G, Perrimon N. Quantitative morphological signatures define local signaling networks regulating cell morphology. Science. 2007;316:1753–1756. [PubMed] 16. Neumann B, et al. High-throughput RNAi screening by time-lapse imaging of live human cells. Nat Methods. 2006;3:385–390. [PubMed] 17. Orlov N, Johnston J, Macura T, Shamir L, Goldberg I. Computer Vision for Microscopy Applications. In: Goro Obinata, Ashish Dutta., editors. Vision Systems: Segmentation and Pattern Recognition. Vienna: I-Tech; 2007. pp. 221–242. 18. Lin C, Mak W, Hong P, Sepp K, Perrimon N. Intelligent Interfaces for Mining Large-Scale RNAi-HCS Image Databases. IEEE 7th International Conference on Bioinformatics and Biomedical Engineering; Washington DC: IEEE; 2007. 19. Chen X, Murphy RF. Automated interpretation of protein subcellular location patterns. Int Rev Cytol. 2006;249:193–227. [PubMed] 20. Loo LH, Wu LF, Altschuler SJ. Image-based multivariate profiling of drug responses from single cells. Nat Methods. 2007;4:445–453. [PubMed] 21. Adams CL, et al. Compound classification using image-based cellular phenotypes. Methods Enzymol. 2006;414:440–468. [PubMed] 22. Tanaka M, et al. An unbiased cell morphology-based screen for new, biologically active small molecules. PLoS Biol. 2005;3(5):e128. [PubMed] 23. Young DW, et al. Integrating high-content screening and ligand-target prediction to identify mechanism of action. Nat Chem Biol. 2008;4:59–68. [PubMed] 24. Wang J, et al. Cellular phenotype recognition for high-content RNA interference genome-wide screening. J Biomol Screen. 2008;13:29–39. [PubMed] 25. Boland MV, Murphy RF. A neural network classifier capable of recognizing the patterns of all major subcellular structures in fluorescence microscope images of HeLa cells. Bioinformatics. 2001;17:1213–1223. [PubMed] 26. Boland MV, Markey MK, Murphy RF. Automated recognition of patterns characteristic of subcellular structures in fluorescence microscopy images. Cytometry. 1998;33:366–375. [PubMed] 27. Perlman ZE, et al. Multidimensional drug profiling by automated microscopy. Science. 2004;306:1194–1198. [PubMed] 28. Friedman JH, Hastie T, Tibshirani R. Additive logistic regression: A statistical view of boosting. Ann Stat. 2000;28(2):337–407. 29. Sigal A, et al. Dynamic proteomics in individual human cells uncovers widespread cell-cycle dependence of nuclear proteins. Nat Methods. 2006;3:525–531. [PubMed] 30. Levsky JM, Singer RH. Gene expression and the myth of the average cell. Trends Cell Biol. 2003;13:4–6. [PubMed] 31. Friedman A, Perrimon N. Genetic screening for signal transduction in the era of network biology. Cell. 2007;128:225–231. [PubMed] 32. Wichmann FA, Graf ABA, Simoncelli EP, Bülthoff HH, Schölkopf B. Machine learning applied to perception: Decision images for gender classification. Adv Neural Info Processing Syst. 2004;17:1489–1496. 33. Jones TR, et al. CellProfiler Analyst: Data exploration and analysis software for complex image-based screens. BMC Bioinformatics. 2008;9:482. [PubMed] 34. Mukherji M, et al. Genome-wide functional analysis of human cell-cycle regulators. Proc Natl Acad Sci USA. 2006;103:14819–14824. [PubMed] 35. Echeverri CJ, et al. Minimizing the risk of reporting false positives in large-scale RNAi screens. Nat Methods. 2006;3:777–779. [PubMed] 36. Torralba A, Murphy KP, Freeman WT. Sharing visual features for multiclass and multiview object detection. IEEE Trans Pattern Anal Machine Intell. 2007;29:854–869. |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||
Nature. 1980 Oct 30; 287(5785):795-801.
[Nature. 1980]Science. 1911 Apr 7; 33(849):534-537.
[Science. 1911]Science. 1927 Jul 22; 66(1699):84-87.
[Science. 1927]Proc Natl Acad Sci U S A. 1970 Jun; 66(2):352-9.
[Proc Natl Acad Sci U S A. 1970]Genetics. 1974 May; 77(1):71-94.
[Genetics. 1974]Curr Opin Chem Biol. 2006 Jun; 10(3):232-7.
[Curr Opin Chem Biol. 2006]Nat Chem Biol. 2007 Aug; 3(8):461-5.
[Nat Chem Biol. 2007]Nat Rev Genet. 2004 Jan; 5(1):11-22.
[Nat Rev Genet. 2004]J Biol. 2003; 2(4):27.
[J Biol. 2003]Genome Biol. 2006; 7(10):R100.
[Genome Biol. 2006]Genome Biol. 2006; 7(10):R100.
[Genome Biol. 2006]Science. 2004 Nov 12; 306(5699):1194-8.
[Science. 2004]Science. 2007 Jun 22; 316(5832):1753-6.
[Science. 2007]Nat Methods. 2006 May; 3(5):385-90.
[Nat Methods. 2006]Int Rev Cytol. 2006; 249():193-227.
[Int Rev Cytol. 2006]Nat Methods. 2007 May; 4(5):445-53.
[Nat Methods. 2007]Methods Enzymol. 2006; 414():440-68.
[Methods Enzymol. 2006]Cell. 2006 Mar 24; 124(6):1283-98.
[Cell. 2006]Science. 2007 Jun 22; 316(5832):1753-6.
[Science. 2007]Int Rev Cytol. 2006; 249():193-227.
[Int Rev Cytol. 2006]Nat Methods. 2007 May; 4(5):445-53.
[Nat Methods. 2007]Nat Methods. 2006 Jul; 3(7):525-31.
[Nat Methods. 2006]Trends Cell Biol. 2003 Jan; 13(1):4-6.
[Trends Cell Biol. 2003]Cell. 2007 Jan 26; 128(2):225-31.
[Cell. 2007]Science. 2007 Jun 22; 316(5832):1753-6.
[Science. 2007]Int Rev Cytol. 2006; 249():193-227.
[Int Rev Cytol. 2006]Nat Methods. 2007 May; 4(5):445-53.
[Nat Methods. 2007]BMC Bioinformatics. 2008 Nov 15; 9():482.
[BMC Bioinformatics. 2008]Cell. 2006 Mar 24; 124(6):1283-98.
[Cell. 2006]Proc Natl Acad Sci U S A. 2006 Oct 3; 103(40):14819-24.
[Proc Natl Acad Sci U S A. 2006]Nat Methods. 2006 Oct; 3(10):777-9.
[Nat Methods. 2006]Bioinformatics. 2001 Dec; 17(12):1213-23.
[Bioinformatics. 2001]BMC Bioinformatics. 2008 Nov 15; 9():482.
[BMC Bioinformatics. 2008]Cell. 2006 Mar 24; 124(6):1283-98.
[Cell. 2006]BMC Bioinformatics. 2008 Nov 15; 9():482.
[BMC Bioinformatics. 2008]