2
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Machine learning based personalized drug response prediction for lung cancer patients

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Lung cancers with a mutated epidermal growth factor receptor (EGFR) are a major contributor to cancer fatalities globally. Targeted tyrosine kinase inhibitors (TKIs) have been developed against EGFR and show encouraging results for survival rate and quality of life. However, drug resistance may affect treatment plans and treatment efficacy may be lost after about a year. Predicting the response to EGFR-TKIs for EGFR-mutated lung cancer patients is a key research area. In this study, we propose a personalized drug response prediction model (PDRP), based on molecular dynamics simulations and machine learning, to predict the response of first generation FDA-approved small molecule EGFR-TKIs, Gefitinib/Erlotinib, in lung cancer patients. The patient’s mutation status is taken into consideration in molecular dynamics (MD) simulation. Each patient’s unique mutation status was modeled considering MD simulation to extract molecular-level geometric features. Moreover, additional clinical features were incorporated into machine learning model for drug response prediction. The complete feature set includes demographic and clinical information (DCI), geometrical properties of the drug-target binding site, and the binding free energy of the drug-target complex from the MD simulation. PDRP incorporates an XGBoost classifier, which achieves state-of-the-art performance with 97.5% accuracy, 93% recall, 96.5% precision, and 94% F1-score, for a 4-class drug response prediction task. We found that modeling the geometry of the binding pocket combined with binding free energy is a good predictor for drug response. However, we observed that clinical information had a little impact on the performance of the model. The proposed model could be tested on other types of cancers. We believe PDRP will support the planning of effective treatment regimes based on clinical-genomic information. The source code and related files are available on GitHub at:   https://github.com/rizwanqureshi123/PDRP/.

          Related collections

          Most cited references57

          • Record: found
          • Abstract: found
          • Article: not found

          The Protein Data Bank.

          The Protein Data Bank (PDB; http://www.rcsb.org/pdb/ ) is the single worldwide archive of structural data of biological macromolecules. This paper describes the goals of the PDB, the systems in place for data deposition and access, how to obtain further information, and near-term plans for the future development of the resource.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            PTRAJ and CPPTRAJ: Software for Processing and Analysis of Molecular Dynamics Trajectory Data.

            We describe PTRAJ and its successor CPPTRAJ, two complementary, portable, and freely available computer programs for the analysis and processing of time series of three-dimensional atomic positions (i.e., coordinate trajectories) and the data therein derived. Common tools include the ability to manipulate the data to convert among trajectory formats, process groups of trajectories generated with ensemble methods (e.g., replica exchange molecular dynamics), image with periodic boundary conditions, create average structures, strip subsets of the system, and perform calculations such as RMS fitting, measuring distances, B-factors, radii of gyration, radial distribution functions, and time correlations, among other actions and analyses. Both the PTRAJ and CPPTRAJ programs and source code are freely available under the GNU General Public License version 3 and are currently distributed within the AmberTools 12 suite of support programs that make up part of the Amber package of computer programs (see http://ambermd.org ). This overview describes the general design, features, and history of these two programs, as well as algorithmic improvements and new features available in CPPTRAJ.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              EGF receptor gene mutations are common in lung cancers from "never smokers" and are associated with sensitivity of tumors to gefitinib and erlotinib.

              Somatic mutations in the tyrosine kinase (TK) domain of the epidermal growth factor receptor (EGFR) gene are reportedly associated with sensitivity of lung cancers to gefitinib (Iressa), kinase inhibitor. In-frame deletions occur in exon 19, whereas point mutations occur frequently in codon 858 (exon 21). We found from sequencing the EGFR TK domain that 7 of 10 gefitinib-sensitive tumors had similar types of alterations; no mutations were found in eight gefitinib-refractory tumors (P = 0.004). Five of seven tumors sensitive to erlotinib (Tarceva), a related kinase inhibitor for which the clinically relevant target is undocumented, had analogous somatic mutations, as opposed to none of 10 erlotinib-refractory tumors (P = 0.003). Because most mutation-positive tumors were adenocarcinomas from patients who smoked <100 cigarettes in a lifetime ("never smokers"), we screened EGFR exons 2-28 in 15 adenocarcinomas resected from untreated never smokers. Seven tumors had TK domain mutations, in contrast to 4 of 81 non-small cell lung cancers resected from untreated former or current smokers (P = 0.0001). Immunoblotting of lysates from cells transiently transfected with various EGFR constructs demonstrated that, compared to wild-type protein, an exon 19 deletion mutant induced diminished levels of phosphotyrosine, whereas the phosphorylation at tyrosine 1092 of an exon 21 point mutant was inhibited at 10-fold lower concentrations of drug. Collectively, these data show that adenocarcinomas from never smokers comprise a distinct subset of lung cancers, frequently containing mutations within the TK domain of EGFR that are associated with gefitinib and erlotinib sensitivity.
                Bookmark

                Author and article information

                Contributors
                talam@hbku.edu.qa
                Journal
                Sci Rep
                Sci Rep
                Scientific Reports
                Nature Publishing Group UK (London )
                2045-2322
                7 November 2022
                7 November 2022
                2022
                : 12
                : 18935
                Affiliations
                [1 ]GRID grid.452146.0, ISNI 0000 0004 1789 3191, College of Science and Engineering, , Hamad Bin Khalifa University, ; Doha, Qatar
                [2 ]FAST National University of Computer and Emerging Sciences, Karachi, Pakistan
                [3 ]GRID grid.35030.35, ISNI 0000 0004 1792 6846, Department of Electrical Engineering, , City University of Hong Kong, ; Kowloon, Hong Kong
                [4 ]GRID grid.35030.35, ISNI 0000 0004 1792 6846, Center for Intelligent Multidimensional Data Analysis (CIMDA), , City University of Hong Kong, ; Kowloon, Hong Kong
                [5 ]GRID grid.10784.3a, ISNI 0000 0004 1937 0482, Department of Biomedical Engineering, , The Chinese University of Hong Kong, ; Shatin, Hong Kong, SAR China
                Article
                23649
                10.1038/s41598-022-23649-0
                9640729
                36344580
                a514e8cc-1526-45c7-9a53-3da5122bc60a
                © The Author(s) 2022

                Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

                History
                : 30 April 2022
                : 3 November 2022
                Categories
                Article
                Custom metadata
                © The Author(s) 2022

                Uncategorized
                computer science,lung cancer
                Uncategorized
                computer science, lung cancer

                Comments

                Comment on this article