161
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      Fitting a mixture model by expectation maximization to discover motifs in biopolymers.

      Read this article at

      ScienceOpenPubMed
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The algorithm described in this paper discovers one or more motifs in a collection of DNA or protein sequences by using the technique of expectation maximization to fit a two-component finite mixture model to the set of sequences. Multiple motifs are found by fitting a mixture model to the data, probabilistically erasing the occurrences of the motif thus found, and repeating the process to find successive motifs. The algorithm requires only a set of unaligned sequences and a number specifying the width of the motifs as input. It returns a model of each motif and a threshold which together can be used as a Bayes-optimal classifier for searching for occurrences of the motif in other databases. The algorithm estimates how many times each motif occurs in each sequence in the dataset and outputs an alignment of the occurrences of the motif. The algorithm is capable of discovering several different motifs with differing numbers of occurrences in a single dataset.

          Related collections

          Author and article information

          Journal
          Proc Int Conf Intell Syst Mol Biol
          Proceedings. International Conference on Intelligent Systems for Molecular Biology
          1553-0833
          1553-0833
          1994
          : 2
          Affiliations
          [1 ] Department of Computer Science and Engineering, University of California at San Diego, La Jolla 92093-0114, USA.
          Article
          7584402
          6dd99bd4-7f4e-4a15-8b2e-ddbcf6e9a13a
          History

          Comments

          Comment on this article