18
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Predicting compound activity from phenotypic profiles and chemical structures

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Predicting assay results for compounds virtually using chemical structures and phenotypic profiles has the potential to reduce the time and resources of screens for drug discovery. Here, we evaluate the relative strength of three high-throughput data sources—chemical structures, imaging (Cell Painting), and gene-expression profiles (L1000)—to predict compound bioactivity using a historical collection of 16,170 compounds tested in 270 assays for a total of 585,439 readouts. All three data modalities can predict compound activity for 6–10% of assays, and in combination they predict 21% of assays with high accuracy, which is a 2 to 3 times higher success rate than using a single modality alone. In practice, the accuracy of predictors could be lower and still be useful, increasing the assays that can be predicted from 37% with chemical structures alone up to 64% when combined with phenotypic data. Our study shows that unbiased phenotypic profiling can be leveraged to enhance compound bioactivity prediction to accelerate the early stages of the drug-discovery process.

          Abstract

          Experimental assays are used to determine if compounds cause a desired activity in cells. Here the authors demonstrate that computational methods can predict compound bioactivity given their chemical structure, imaging and gene expression data from historic screening libraries.

          Related collections

          Most cited references43

          • Record: found
          • Abstract: found
          • Article: not found

          A Next Generation Connectivity Map: L1000 Platform and the First 1,000,000 Profiles

          We previously piloted the concept of a Connectivity Map (CMap), whereby genes, drugs, and disease states are connected by virtue of common gene-expression signatures. Here, we report more than a 1,000-fold scale-up of the CMap as part of the NIH LINCS Consortium, made possible by a new, low-cost, high-throughput reduced representation expression profiling method that we term L1000. We show that L1000 is highly reproducible, comparable to RNA sequencing, and suitable for computational inference of the expression levels of 81% of non-measured transcripts. We further show that the expanded CMap can be used to discover mechanism of action of small molecules, functionally annotate genetic variants of disease genes, and inform clinical trials. The 1.3 million L1000 profiles described here, as well as tools for their analysis, are available at https://clue.io.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            CellProfiler 3.0: Next-generation image processing for biology

            CellProfiler has enabled the scientific research community to create flexible, modular image analysis pipelines since its release in 2005. Here, we describe CellProfiler 3.0, a new version of the software supporting both whole-volume and plane-wise analysis of three-dimensional (3D) image stacks, increasingly common in biomedical research. CellProfiler’s infrastructure is greatly improved, and we provide a protocol for cloud-based, large-scale image processing. New plugins enable running pretrained deep learning models on images. Designed by and for biologists, CellProfiler equips researchers with powerful computational tools via a well-documented user interface, empowering biologists in all fields to create quantitative, reproducible image analysis workflows.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              New substructure filters for removal of pan assay interference compounds (PAINS) from screening libraries and for their exclusion in bioassays.

              This report describes a number of substructural features which can help to identify compounds that appear as frequent hitters (promiscuous compounds) in many biochemical high throughput screens. The compounds identified by such substructural features are not recognized by filters commonly used to identify reactive compounds. Even though these substructural features were identified using only one assay detection technology, such compounds have been reported to be active from many different assays. In fact, these compounds are increasingly prevalent in the literature as potential starting points for further exploration, whereas they may not be.
                Bookmark

                Author and article information

                Contributors
                jcaicedo@broad.mit.edu
                Journal
                Nat Commun
                Nat Commun
                Nature Communications
                Nature Publishing Group UK (London )
                2041-1723
                8 April 2023
                8 April 2023
                2023
                : 14
                : 1967
                Affiliations
                [1 ]GRID grid.66859.34, ISNI 0000 0004 0546 1623, Broad Institute of MIT and Harvard, ; Cambridge, USA
                [2 ]GRID grid.418331.c, ISNI 0000 0001 2195 9606, Biological Research Centre, ; Szeged, Hungary
                [3 ]GRID grid.47840.3f, ISNI 0000 0001 2181 7878, University of California, ; Berkeley, USA
                Author information
                http://orcid.org/0000-0001-9615-0799
                http://orcid.org/0000-0002-2629-361X
                http://orcid.org/0000-0002-1800-5112
                http://orcid.org/0000-0003-3150-3025
                http://orcid.org/0000-0003-1555-8261
                http://orcid.org/0000-0002-1277-4631
                Article
                37570
                10.1038/s41467-023-37570-1
                10082762
                37031208
                c41ffd60-4ecd-4531-aa84-e15cb36e94f5
                © The Author(s) 2023

                Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

                History
                : 24 May 2022
                : 23 March 2023
                Funding
                Funded by: The Broad Institute Schmidt Fellowship program
                Categories
                Article
                Custom metadata
                © The Author(s) 2023

                Uncategorized
                machine learning,cheminformatics
                Uncategorized
                machine learning, cheminformatics

                Comments

                Comment on this article