2
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      OLB-AC: toward optimizing ligand bioactivities through deep graph learning and activity cliffs

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Motivation

          Deep graph learning (DGL) has been widely employed in the realm of ligand-based virtual screening. Within this field, a key hurdle is the existence of activity cliffs (ACs), where minor chemical alterations can lead to significant changes in bioactivity. In response, several DGL models have been developed to enhance ligand bioactivity prediction in the presence of ACs. Yet, there remains a largely unexplored opportunity within ACs for optimizing ligand bioactivity, making it an area ripe for further investigation.

          Results

          We present a novel approach to simultaneously predict and optimize ligand bioactivities through DGL and ACs (OLB-AC). OLB-AC possesses the capability to optimize ligand molecules located near ACs, providing a direct reference for optimizing ligand bioactivities with the matching of original ligands. To accomplish this, a novel attentive graph reconstruction neural network and ligand optimization scheme are proposed. Attentive graph reconstruction neural network reconstructs original ligands and optimizes them through adversarial representations derived from their bioactivity prediction process. Experimental results on nine drug targets reveal that out of the 667 molecules generated through OLB-AC optimization on datasets comprising 974 low-activity, noninhibitor, or highly toxic ligands, 49 are recognized as known highly active, inhibitor, or nontoxic ligands beyond the datasets’ scope. The 27 out of 49 matched molecular pairs generated by OLB-AC reveal novel transformations not present in their training sets. The adversarial representations employed for ligand optimization originate from the gradients of bioactivity predictions. Therefore, we also assess OLB-AC’s prediction accuracy across 33 different bioactivity datasets. Results show that OLB-AC achieves the best Pearson correlation coefficient ( r 2) on 27/33 datasets, with an average improvement of 7.2%–22.9% against the state-of-the-art bioactivity prediction methods.

          Availability and implementation

          The code and dataset developed in this work are available at github.com/Yueming-Yin/OLB-AC.

          Related collections

          Most cited references41

          • Record: found
          • Abstract: found
          • Article: not found

          PubChem in 2021: new data content and improved web interfaces

          Abstract PubChem (https://pubchem.ncbi.nlm.nih.gov) is a popular chemical information resource that serves the scientific community as well as the general public, with millions of unique users per month. In the past two years, PubChem made substantial improvements. Data from more than 100 new data sources were added to PubChem, including chemical-literature links from Thieme Chemistry, chemical and physical property links from SpringerMaterials, and patent links from the World Intellectual Properties Organization (WIPO). PubChem's homepage and individual record pages were updated to help users find desired information faster. This update involved a data model change for the data objects used by these pages as well as by programmatic users. Several new services were introduced, including the PubChem Periodic Table and Element pages, Pathway pages, and Knowledge panels. Additionally, in response to the coronavirus disease 2019 (COVID-19) outbreak, PubChem created a special data collection that contains PubChem data related to COVID-19 and the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2).
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            ChEMBL: towards direct deposition of bioassay data

            Abstract ChEMBL is a large, open-access bioactivity database (https://www.ebi.ac.uk/chembl), previously described in the 2012, 2014 and 2017 Nucleic Acids Research Database Issues. In the last two years, several important improvements have been made to the database and are described here. These include more robust capture and representation of assay details; a new data deposition system, allowing updating of data sets and deposition of supplementary data; and a completely redesigned web interface, with enhanced search and filtering capabilities.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              ADMETlab 2.0: an integrated online platform for accurate and comprehensive predictions of ADMET properties

              Because undesirable pharmacokinetics and toxicity of candidate compounds are the main reasons for the failure of drug development, it has been widely recognized that absorption, distribution, metabolism, excretion and toxicity (ADMET) should be evaluated as early as possible. In silico ADMET evaluation models have been developed as an additional tool to assist medicinal chemists in the design and optimization of leads. Here, we announced the release of ADMETlab 2.0, a completely redesigned version of the widely used AMDETlab web server for the predictions of pharmacokinetics and toxicity properties of chemicals, of which the supported ADMET-related endpoints are approximately twice the number of the endpoints in the previous version, including 17 physicochemical properties, 13 medicinal chemistry properties, 23 ADME properties, 27 toxicity endpoints and 8 toxicophore rules (751 substructures). A multi-task graph attention framework was employed to develop the robust and accurate models in ADMETlab 2.0. The batch computation module was provided in response to numerous requests from users, and the representation of the results was further optimized. The ADMETlab 2.0 server is freely available, without registration, at https://admetmesh.scbdd.com/ . Graphical Abstract ADMETlab 2.0 assists medicinal chemists in the design and optimization of lead compounds.
                Bookmark

                Author and article information

                Contributors
                Role: Associate Editor
                Journal
                Bioinformatics
                Bioinformatics
                bioinformatics
                Bioinformatics
                Oxford University Press
                1367-4803
                1367-4811
                June 2024
                18 June 2024
                18 June 2024
                : 40
                : 6
                : btae365
                Affiliations
                School of Telecommunications and Information Engineering, Nanjing University of Posts and Telecommunications , Nanjing 210003, China
                College of Computing and Data Science, Nanyang Technological University , 639798, Singapore
                School of Telecommunications and Information Engineering, Nanjing University of Posts and Telecommunications , Nanjing 210003, China
                School of Telecommunications and Information Engineering, Nanjing University of Posts and Telecommunications , Nanjing 210003, China
                School of Telecommunications and Information Engineering, Nanjing University of Posts and Telecommunications , Nanjing 210003, China
                Lee Kong Chian School of Medicine, Nanyang Technological University , 637551, Singapore
                School of Biological Sciences, Nanyang Technological University , 637551, Singapore
                Center for Biomedical Informatics, Nanyang Technological University , 637551, Singapore
                Center for AI in Medicine, Nanyang Technological University , 639798, Singapore
                Division of Neurology, Department of Brain Sciences, Faculty of Medicine, Imperial College London , London W12 0NN, U.K
                College of Computing and Data Science, Nanyang Technological University , 639798, Singapore
                School of Computer Science, Nanjing University of Posts and Telecommunications , Nanjing 210023, China
                Author notes
                Corresponding authors. School of Telecommunications and Information Engineering, Nanjing University of Posts and Telecommunications, Nanjing 210003, China. Tel: +86(0)13813814879, E-mail: huhf@ 123456njupt.edu.cn (H.H.); School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing 210023, China. Tel: +86(0)13951913724, E-mail: jansen@ 123456njupt.edu.cn (J.W.)
                Author information
                https://orcid.org/0000-0002-9060-3248
                https://orcid.org/0000-0002-6585-3106
                https://orcid.org/0000-0003-3863-7501
                https://orcid.org/0000-0002-7941-9722
                Article
                btae365
                10.1093/bioinformatics/btae365
                11208724
                38889277
                5d26379a-aa30-4db1-90f7-28ae6d8d4c1f
                © The Author(s) 2024. Published by Oxford University Press.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 19 October 2023
                : 14 May 2024
                : 03 June 2024
                : 14 June 2024
                : 26 June 2024
                Page count
                Pages: 13
                Funding
                Funded by: National Natural Science Foundation of China, DOI 10.13039/501100001809;
                Award ID: 62371245
                Categories
                Original Paper
                Structural Bioinformatics
                AcademicSubjects/SCI01060

                Bioinformatics & Computational biology
                Bioinformatics & Computational biology

                Comments

                Comment on this article