10
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      ToxiM: A Toxicity Prediction Tool for Small Molecules Developed Using Machine Learning and Chemoinformatics Approaches

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The experimental methods for the prediction of molecular toxicity are tedious and time-consuming tasks. Thus, the computational approaches could be used to develop alternative methods for toxicity prediction. We have developed a tool for the prediction of molecular toxicity along with the aqueous solubility and permeability of any molecule/metabolite. Using a comprehensive and curated set of toxin molecules as a training set, the different chemical and structural based features such as descriptors and fingerprints were exploited for feature selection, optimization and development of machine learning based classification and regression models. The compositional differences in the distribution of atoms were apparent between toxins and non-toxins, and hence, the molecular features were used for the classification and regression. On 10-fold cross-validation, the descriptor-based, fingerprint-based and hybrid-based classification models showed similar accuracy (93%) and Matthews's correlation coefficient (0.84). The performances of all the three models were comparable (Matthews's correlation coefficient = 0.84–0.87) on the blind dataset. In addition, the regression-based models using descriptors as input features were also compared and evaluated on the blind dataset. Random forest based regression model for the prediction of solubility performed better ( R 2 = 0.84) than the multi-linear regression (MLR) and partial least square regression (PLSR) models, whereas, the partial least squares based regression model for the prediction of permeability (caco-2) performed better ( R 2 = 0.68) in comparison to the random forest and MLR based regression models. The performance of final classification and regression models was evaluated using the two validation datasets including the known toxins and commonly used constituents of health products, which attests to its accuracy. The ToxiM web server would be a highly useful and reliable tool for the prediction of toxicity, solubility, and permeability of small molecules.

          Related collections

          Most cited references35

          • Record: found
          • Abstract: found
          • Article: not found

          The meaning and use of the area under a receiver operating characteristic (ROC) curve.

          A representation and interpretation of the area under a receiver operating characteristic (ROC) curve obtained by the "rating" method, or by mathematical predictions based on patient characteristics, is presented. It is shown that in such a setting the area represents the probability that a randomly chosen diseased subject is (correctly) rated or ranked with greater suspicion than a randomly chosen non-diseased subject. Moreover, this probability of a correct ranking is the same quantity that is estimated by the already well-studied nonparametric Wilcoxon statistic. These two relationships are exploited to (a) provide rapid closed-form expressions for the approximate magnitude of the sampling variability, i.e., standard error that one uses to accompany the area under a smoothed ROC curve, (b) guide in determining the size of the sample required to provide a sufficiently reliable estimate of this area, and (c) determine how large sample sizes should be to ensure that one can statistically detect differences in the accuracy of diagnostic techniques.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            DeepTox: Toxicity Prediction using Deep Learning

              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Particulate air pollution and acute health effects.

              Epidemiological studies have consistently shown an association between particulate air pollution and not only exacerbations of illness in people with respiratory disease but also rises in the numbers of deaths from cardiovascular and respiratory disease among older people. Meta-analyses of these studies indicate that the associations are unlikely to be explained by any confounder, and suggest that they represent cause and effect. We propose that the explanation lies in the nature of the urban particulate cloud, which may contain up to 100000 nanometer-sized particles per mL, in what may be a gravimetric concentration of only 100-200 micrograms/m3 of pollutant. We suggest that such ultra-fine particles are able to provoke alveolar inflammation, with release of mediators capable, in susceptible individuals, of causing exacerbations of lung disease and of increasing blood coagulability, thus also explaining the observed increases in cardiovascular deaths associated with urban pollution episodes. This hypothesis is testable both experimentally and epidemiologically.
                Bookmark

                Author and article information

                Contributors
                Journal
                Front Pharmacol
                Front Pharmacol
                Front. Pharmacol.
                Frontiers in Pharmacology
                Frontiers Media S.A.
                1663-9812
                30 November 2017
                2017
                : 8
                : 880
                Affiliations
                Metagenomics and Systems Biology Laboratory, Department of Biological Sciences, Indian Institute of Science Education and Research , Bhopal, India
                Author notes

                Edited by: Vivek K. Bajpai, Dongguk University Seoul, South Korea

                Reviewed by: Marina Evans, Environmental Protection Agency, United States; Christoph Helma, In Silico Toxicology (Switzerland), Switzerland

                *Correspondence: Vineet K. Sharma vineetks@ 123456iiserb.ac.in

                This article was submitted to Predictive Toxicology, a section of the journal Frontiers in Pharmacology

                †These authors have contributed equally to this work.

                Article
                10.3389/fphar.2017.00880
                5714866
                29249969
                702c17e4-195d-46ca-9461-b5f9bba82311
                Copyright © 2017 Sharma, Srivastava, Roy and Sharma.

                This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

                History
                : 12 August 2017
                : 14 November 2017
                Page count
                Figures: 9, Tables: 6, Equations: 1, References: 47, Pages: 18, Words: 8591
                Categories
                Pharmacology
                Technology Report

                Pharmacology & Pharmaceutical medicine
                machine leaning,toxicity prediction,chemoinformatics,solubility,permeability,regression model,classification models

                Comments

                Comment on this article