4
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      Quantitative missense variant effect prediction using large-scale mutagenesis data

      research-article

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          SUMMARY

          Large datasets describing the quantitative effects of mutations on protein function are becoming increasingly available. Here, we leverage these datasets to develop Envision, which predicts the magnitude of a missense variant’s molecular effect. Envision combines 21,026 variant effect measurements from nine large-scale experimental mutagenesis datasets, a hitherto untapped training resource, with a supervised, stochastic gradient boosting learning algorithm. Envision outperforms other missense variant effect predictors both on large-scale mutagenesis data and on an independent test dataset comprising 2,312 TP53 variants whose effects were measured using a low-throughput approach. This data set was never used for hyperparameter tuning or model training, and thus serves as an independent validation set. Envision prediction accuracy is also more consistent across amino acids than other predictors. Finally, we demonstrate that Envision’s performance improves as more large-scale mutagenesis data is incorporated. We precompute Envision predictions for every possible single amino acid variant in human, mouse, frog, zebrafish, fruit fly, worm and yeast proteomes ( https://envision.gs.washington.edu/).

          eTOC BLURB

          We present Envision, an accurate predictor of protein variant molecular effect trained using large-scale experimental mutagenesis data.

          Related collections

          Author and article information

          Journal
          101656080
          43733
          Cell Syst
          Cell Syst
          Cell systems
          2405-4712
          2405-4720
          13 November 2017
          06 December 2017
          24 January 2018
          24 January 2019
          : 6
          : 1
          : 116-124.e3
          Affiliations
          [1 ]Department of Genome Sciences, University of Washington, Seattle, WA, 98105, USA
          [2 ]Howard Hughes Medical Institute, Seattle, WA, USA
          [3 ]Department of Bioengineering, University of Washington, Seattle, WA, USA
          Author notes
          Correspondence to: Douglas M. Fowler, dfowler@ 123456u.washington.edu
          [+]

          Lead contact

          Article
          PMC5799033 PMC5799033 5799033 nihpa919665
          10.1016/j.cels.2017.11.003
          5799033
          29226803
          52627091-c1d0-4c4b-bbb3-e0b473bc625a
          History
          Categories
          Article

          Comments

          Comment on this article