13
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Development of pathogenicity predictors specific for variants that do not comply with clinical guidelines for the use of computational evidence

      research-article
      1 , 2 , 1 , 1 , 3 ,
      BMC Genomics
      BioMed Central
      VarI-SIG 2016: identification and annotation of genetic variants in the context of structure, function, and disease (VarI-SIG 2016)
      09 July 2016
      In silico pathogenicity predictors, Protein sequence variants, Molecular diagnostics, Missense variants, Next-generation sequencing

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Strict guidelines delimit the use of computational information in the clinical setting, due to the still moderate accuracy of in silico tools. These guidelines indicate that several tools should always be used and that full coincidence between them is required if we want to consider their results as supporting evidence in medical decision processes. Application of this simple rule certainly decreases the error rate of in silico pathogenicity assignments. However, when predictors disagree this rule results in the rejection of potentially valuable information for a number of variants. In this work, we focus on these variants of the protein sequence and develop specific predictors to help improve the success rate of their annotation.

          Results

          We have used a set of 59,442 protein sequence variants (15,723 pathological and 43,719 neutral) from 228 proteins to identify those cases for which pathogenicity predictors disagree. We have repeated this process for all the possible combinations of five known methods (SIFT, PolyPhen-2, PON-P2, CADD and MutationTaster2). For each resulting subset we have trained a specific pathogenicity predictor. We find that these specific predictors are able to discriminate between neutral and pathogenic variants, with a success rate different from random. They tend to outperform the constitutive methods but this trend decreases as the performance of the constitutive predictor improves (e.g. with PON-P2 and PolyPhen-2). We also find that specific methods outperform standard consensus methods (Condel and CAROL).

          Conclusion

          Focusing development efforts on the case of variants for which known methods disagree we may obtain pathogenicity predictors with improved performances. Although we have not yet reached the success rate that allows the use of this computational evidence in a clinical setting, the simplicity of the approach indicates that more advanced methods may reach this goal in a close future.

          Electronic supplementary material

          The online version of this article (doi:10.1186/s12864-017-3914-0) contains supplementary material, which is available to authorized users.

          Related collections

          Most cited references22

          • Record: found
          • Abstract: found
          • Article: not found

          Amino acid substitution matrices from protein blocks.

          Methods for alignment of protein sequences typically measure similarity by using a substitution matrix with scores for all possible exchanges of one amino acid with another. The most widely used matrices are based on the Dayhoff model of evolutionary rates. Using a different approach, we have derived substitution matrices from about 2000 blocks of aligned sequence segments characterizing more than 500 groups of related proteins. This led to marked improvements in alignments and in searches using queries from each of the groups.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            The real cost of sequencing: scaling computation to keep pace with data generation

            As the cost of sequencing continues to decrease and the amount of sequence data generated grows, new paradigms for data storage and analysis are increasingly important. The relative scaling behavior of these evolving technologies will impact genomics research moving forward.
              Bookmark
              • Record: found
              • Abstract: found
              • Book: not found

              Neural Networks for Pattern Recognition

              This book provides the first comprehensive treatment of feed-forward neural networks from the perspective of statistical pattern recognition. After introducing the basic concepts of pattern recognition, the book describes techniques for modelling probability density functions, and discusses the properties and relative merits of the multi-layer perceptron and radial basis function network models. It also motivates the use of various forms of error functions, and reviews the principal algorithms for error function minimization. As well as providing a detailed discussion of learning and generalization in neural networks, the book also covers the important topics of data processing, feature extraction, and prior knowledge. The book concludes with an extensive treatment of Bayesian techniques and their applications to neural networks.
                Bookmark

                Author and article information

                Contributors
                xavier.delacruz@vhir.org
                Conference
                BMC Genomics
                BMC Genomics
                BMC Genomics
                BioMed Central (London )
                1471-2164
                11 August 2017
                11 August 2017
                2017
                : 18
                Issue : Suppl 5 Issue sponsor : Publication of this supplement has not been supported by sponsorship. Information about the source of funding for publication charges can be found in the individual articles. The articles have undergone the journal's standard peer review process for supplements. The Supplement Editors declare that they have no competing interests.
                : 569
                Affiliations
                [1 ]GRID grid.7080.f, Research Unit in Translational Bioinformatics, Vall d’Hebron Institute of Research (VHIR), , Universitat Autònoma de Barcelona, ; Barcelona, Spain
                [2 ]ISNI 0000 0004 1757 9848, GRID grid.428973.3, Department of Molecular Genomics, , Instituto de Biología Molecular de Barcelona (IBMB), Consejo Superior de Investigaciones Científicas (CSIC), ; Barcelona, Spain
                [3 ]ISNI 0000 0000 9601 989X, GRID grid.425902.8, , ICREA, ; Barcelona, Spain
                Article
                3914
                10.1186/s12864-017-3914-0
                5558188
                28812538
                e73aec26-62f3-46de-b32b-8d073aee68c8
                © The Author(s). 2017

                Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

                VarI-SIG 2016: identification and annotation of genetic variants in the context of structure, function, and disease
                VarI-SIG 2016
                Orlando, Florida, USA
                09 July 2016
                History
                Categories
                Research
                Custom metadata
                © The Author(s) 2017

                Genetics
                in silico pathogenicity predictors,protein sequence variants,molecular diagnostics,missense variants,next-generation sequencing

                Comments

                Comment on this article