52
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      Machine learning in genetics and genomics

      research-article
      Nature reviews. Genetics

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The field of machine learning promises to enable computers to assist humans in making sense of large, complex data sets. In this review, we outline some of the main applications of machine learning to genetic and genomic data. In the process, we identify some recurrent challenges associated with this type of analysis and provide general guidelines to assist in the practical application of machine learning to real genetic and genomic data.

          Related collections

          Most cited references28

          • Record: found
          • Abstract: found
          • Article: not found

          Gene Ontology: tool for the unification of biology

          Genomic sequencing has made it clear that a large fraction of the genes specifying the core biological functions are shared by all eukaryotes. Knowledge of the biological role of such shared proteins in one organism can often be transferred to other organisms. The goal of the Gene Ontology Consortium is to produce a dynamic, controlled vocabulary that can be applied to all eukaryotes even as knowledge of gene and protein roles in cells is accumulating and changing. To this end, three independent ontologies accessible on the World-Wide Web (http://www.geneontology.org) are being constructed: biological process, molecular function and cellular component.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            What is a support vector machine?

            Support vector machines (SVMs) are becoming popular in a wide variety of biological applications. But, what exactly are SVMs and how do they work? And what are their most promising applications in the life sciences?
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              A genomic code for nucleosome positioning.

              Eukaryotic genomes are packaged into nucleosome particles that occlude the DNA from interacting with most DNA binding proteins. Nucleosomes have higher affinity for particular DNA sequences, reflecting the ability of the sequence to bend sharply, as required by the nucleosome structure. However, it is not known whether these sequence preferences have a significant influence on nucleosome position in vivo, and thus regulate the access of other proteins to DNA. Here we isolated nucleosome-bound sequences at high resolution from yeast and used these sequences in a new computational approach to construct and validate experimentally a nucleosome-DNA interaction model, and to predict the genome-wide organization of nucleosomes. Our results demonstrate that genomes encode an intrinsic nucleosome organization and that this intrinsic organization can explain approximately 50% of the in vivo nucleosome positions. This nucleosome positioning code may facilitate specific chromosome functions including transcription factor binding, transcription initiation, and even remodelling of the nucleosomes themselves.
                Bookmark

                Author and article information

                Journal
                100962779
                22269
                Nat Rev Genet
                Nat. Rev. Genet.
                Nature reviews. Genetics
                1471-0056
                1471-0064
                29 December 2016
                07 May 2015
                June 2015
                02 January 2017
                : 16
                : 6
                : 321-332
                Affiliations
                Department of Computer Science and Engineering University of Washington Genome Sciences, Foege Building 3720 15th Ave NE Seattle, WA 98195-5065
                Department of Genome Sciences Department of Computer Science and Engineering Genome Sciences, Box 355065 Foege Building, S220B 3720 15th Ave NE Seattle, WA 98195-5065 University of Washington
                Author notes
                Article
                PMC5204302 PMC5204302 5204302 nihpa839467
                10.1038/nrg3920
                5204302
                25948244
                e8468b9b-c8b2-46eb-ba8d-d7a7c10f4ebe
                History
                Categories
                Article

                Comments

                Comment on this article