Machine Learning and Assay Development for Image-based Phenotypic Profiling of Drug Treatments

Jonathan Z. Sexton; Reid Fursmidt; Matthew J. O’Meara; Wienand Omta; Arvind Rao; David A. Egan; Steven A. Haney

Machine Learning and Assay Development for Image-based Phenotypic Profiling of Drug Treatments

Review

In: Assay Guidance Manual [Internet]. Bethesda (MD): Eli Lilly & Company and the National Center for Advancing Translational Sciences; 2004.

2023 Mar 15.

Authors

Jonathan Z. Sexton¹, Reid Fursmidt¹, Matthew J. O’Meara¹, Wienand Omta², Arvind Rao¹, David A. Egan², Steven A. Haney

Book Editors

Sarine Markossian¹, Abigail Grossman¹, Michelle Arkin², Douglas Auld³, Chris Austin⁴, Jonathan Baell⁵, Kyle Brimacombe¹, Thomas D.Y. Chung⁶, Nathan P. Coussens⁷, Jayme L. Dahlin⁸, Viswanath Devanarayan⁹, Timothy L. Foley¹⁰, Marcie Glicksman¹¹, Kirill Gorshkov¹², Joseph V. Haas¹³, Matthew D. Hall¹, Samuel Hoare¹⁴, James Inglese¹, Philip W. Iversen¹⁵, Madhu Lal-Nag¹⁶, Zhuyin Li¹², Jason R. Manro¹³, James McGee¹³, Owen McManus¹⁷, Mackenzie Pearson¹³, Terry Riss¹⁸, Peter Saradjian¹⁹, G. Sitta Sittampalam¹, Mike Tarselli²⁰, O. Joseph Trask Jr.²¹, Jeffrey R. Weidner²², Mary Jo Wildey²³, Kelli Wilson¹, Menghang Xia¹, Xin Xu¹

Affiliations

¹ University of Michigan
² Core Life Analytics

Book Affiliations

¹ National Center for Advancing Translational Sciences, National Institutes of Health
² University of California, San Francisco
³ Novartis Institutes for Biomedical Research
⁴ GSK plc
⁵ Lyterian Therapeutics
⁶ Sanford Burnham Prebys Medical Discovery Institute
⁷ Frederick National Laboratory for Cancer Research
⁸ Agios Pharmaceuticals
⁹ Eisai, Inc
¹⁰ Pfizer Inc
¹¹ EnClear Therapies
¹² Bristol Myers Squibb
¹³ Eli Lilly and Company
¹⁴ Pharmechanics, LLC
¹⁵ Luther College
¹⁶ InSphero AG
¹⁷ Quiver Bioscience
¹⁸ Promega Corporation
¹⁹ Beth Israel Deaconess Medical Center
²⁰ TetraScience
²¹ Revvity, Inc.
²² QualSci Consulting, LLC
²³ Merck & Co., Inc.

PMID: 36921077
Bookshelf ID: NBK589577

Excerpt

High content imaging produces significant volumes of data on individual cells. The number of discrete measurements per cell can be in the thousands for highly multiplexed assays. Typically, one of these measurements is used as the assay metric (as examples: cell cycle phase, post-translational modification of a protein reporter expression, shape change), and one or a few others can be used as counter screen measures (frequently cell death or stress, sometimes activity of an orthogonal pathway). However, the rich data sets from high content studies can be combined using machine learning to integrate many features into an assay metric that can enhance assay performance and increase the information content in a screen, allowing a more judicious choice of hits. This chapter will review the potential benefits of implementing a machine learning approach in screening, including examples of where it provides more information or better hit selection than single metrics. Examples of machine learning in screening design that cover a few of the key methods, including regression analyses, decision trees, linear discriminant analyses and support vector machines are provided. This chapter will emphasize key elements of assay validation, including increasing general reproducibility and robustness through tracking algorithm performance, and linking feature measurements to the underlying biology. The basic role of an assay to identify perturbations that are most similar to a positive control are covered in depth. The ability to identify novel phenotypes, such as encountered in phenotypic profiling or “Cell Painting”, are also presented.

Sections

Publication types

Review