4
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Discovery of Novel Conotoxin Candidates Using Machine Learning

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Cone snails (genus Conus) are venomous marine snails that inject prey with a lethal cocktail of conotoxins, small, secreted, and cysteine-rich peptides. Given the diversity and often high affinity for their molecular targets, consisting of ion channels, receptors or transporters, many conotoxins have become invaluable pharmacological probes, drug leads, and therapeutics. Transcriptome sequencing of Conus venom glands followed by de novo assembly and homology-based toxin identification and annotation is currently the state-of-the-art for discovery of new conotoxins. However, homology-based search techniques, by definition, can only detect novel toxins that are homologous to previously reported conotoxins. To overcome these obstacles for discovery, we have created ConusPipe, a machine learning tool that utilizes prominent chemical characters of conotoxins to predict whether a certain transcript in a Conus transcriptome, which has no otherwise detectable homologs in current reference databases, is a putative conotoxin. By using ConusPipe on RNASeq data of 10 species, we report 5148 new putative conotoxin transcripts that have no homologues in current reference databases. 896 of these were identified by at least three out of four models used. These data significantly expand current publicly available conotoxin datasets and our approach provides a new computational avenue for the discovery of novel toxin families.

          Related collections

          Most cited references26

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          nhmmer: DNA homology search with profile HMMs

          Summary: Sequence database searches are an essential part of molecular biology, providing information about the function and evolutionary history of proteins, RNA molecules and DNA sequence elements. We present a tool for DNA/DNA sequence comparison that is built on the HMMER framework, which applies probabilistic inference methods based on hidden Markov models to the problem of homology search. This tool, called nhmmer, enables improved detection of remote DNA homologs, and has been used in combination with Dfam and RepeatMasker to improve annotation of transposable elements in the human genome. Availability: nhmmer is a part of the new HMMER3.1 release. Source code and documentation can be downloaded from http://hmmer.org. HMMER3.1 is freely licensed under the GNU GPLv3 and should be portable to any POSIX-compliant operating system, including Linux and Mac OS/X. Contact: wheelert@janelia.hhmi.org
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            An Introduction to Kernel and Nearest-Neighbor Nonparametric Regression

            N. Altman (1992)
              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Cross-Validation of Regression Models

                Bookmark

                Author and article information

                Journal
                Toxins (Basel)
                Toxins (Basel)
                toxins
                Toxins
                MDPI
                2072-6651
                01 December 2018
                December 2018
                : 10
                : 12
                : 503
                Affiliations
                [1 ]Eccles Institute of Human Genetics, University of Utah, Salt Lake City, UT 84112, USA; liqing850104@ 123456gmail.com
                [2 ]Huntsman Cancer Institute, University of Utah, Salt Lake City, UT 84112, USA
                [3 ]Department of Biology, University of Utah, Salt Lake City, UT 84112, USA; maren.watkins@ 123456hsc.utah.edu (M.W.); s.robinson@ 123456imb.uq.edu.au (S.D.R.)
                [4 ]Department of Biochemistry, University of Utah, Salt Lake City, UT 84112, USA
                [5 ]USTAR Center for Genetic Discovery, University of Utah, Salt Lake City, UT 84112, USA
                Author notes
                [†]

                Equal contribution.

                Article
                toxins-10-00503
                10.3390/toxins10120503
                6315676
                30513724
                0154280e-6bdc-4339-bd9f-b395477c0df0
                © 2018 by the authors.

                Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license ( http://creativecommons.org/licenses/by/4.0/).

                History
                : 29 September 2018
                : 22 November 2018
                Categories
                Article

                Molecular medicine
                machine learning,conotoxins,cone snails,venom,drug discovery
                Molecular medicine
                machine learning, conotoxins, cone snails, venom, drug discovery

                Comments

                Comment on this article