8
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      Counting of oligomers in sequences generated by markov chains for DNA motif discovery.

      1 ,
      Journal of bioinformatics and computational biology

      Read this article at

      ScienceOpenPubMed
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          By means of the technique of the imbedded Markov chain, an efficient algorithm is proposed to exactly calculate first, second moments of word counts and the probability for a word to occur at least once in random texts generated by a Markov chain. A generating function is introduced directly from the imbedded Markov chain to derive asymptotic approximations for the problem. Two Z-scores, one based on the number of sequences with hits and the other on the total number of word hits in a set of sequences, are examined for discovery of motifs on a set of promoter sequences extracted from A. thaliana genome. Source code is available at http://www.itp.ac.cn/zheng/oligo.c.

          Related collections

          Author and article information

          Journal
          J Bioinform Comput Biol
          Journal of bioinformatics and computational biology
          0219-7200
          0219-7200
          Feb 2009
          : 7
          : 1
          Affiliations
          [1 ] Beijing Genomics Institute, Shenzhen (BGI-SZ), Shenzhen 518083, China.
          Article
          S0219720009003935
          19226659
          a50b0658-a62c-48a6-8d91-082e45f09520
          History

          Comments

          Comment on this article