161
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Fast-Find: A novel computational approach to analyzing combinatorial motifs

      product-review
      1 , 2 , 3 , 2 ,
      BMC Bioinformatics
      BioMed Central

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Many vital biological processes, including transcription and splicing, require a combination of short, degenerate sequence patterns, or motifs, adjacent to defined sequence features. Although these motifs occur frequently by chance, they only have biological meaning within a specific context. Identifying transcripts that contain meaningful combinations of patterns is thus an important problem, which existing tools address poorly.

          Results

          Here we present a new approach, Fast-FIND ( Fast- Fully Indexed Nucleotide Database), that uses a relational database to support rapid indexed searches for arbitrary combinations of patterns defined either by sequence or composition. Fast-FIND is easy to implement, takes less than a second to search the entire Drosophila genome sequence for arbitrary patterns adjacent to sites of alternative polyadenylation, and is sufficiently fast to allow sensitivity analysis on the patterns. We have applied this approach to identify transcripts that contain combinations of sequence motifs for RNA-binding proteins that may regulate alternative polyadenylation.

          Conclusion

          Fast-FIND provides an efficient way to identify transcripts that are potentially regulated via alternative polyadenylation. We have used it to generate hypotheses about interactions between specific polyadenylation factors, which we will test experimentally.

          Related collections

          Most cited references31

          • Record: found
          • Abstract: found
          • Article: not found

          Transcription regulation and animal diversity.

          Whole-genome sequence assemblies are now available for seven different animals, including nematode worms, mice and humans. Comparative genome analyses reveal a surprising constancy in genetic content: vertebrate genomes have only about twice the number of genes that invertebrate genomes have, and the increase is primarily due to the duplication of existing genes rather than the invention of new ones. How, then, has evolutionary diversity arisen? Emerging evidence suggests that organismal complexity arises from progressively more elaborate regulation of gene expression.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            On schemes of combinatorial transcription logic.

            Cells receive a wide variety of cellular and environmental signals, which are often processed combinatorially to generate specific genetic responses. Here we explore theoretically the potentials and limitations of combinatorial signal integration at the level of cis-regulatory transcription control. Our analysis suggests that many complex transcription-control functions of the type encountered in higher eukaryotes are already implementable within the much simpler bacterial transcription system. Using a quantitative model of bacterial transcription and invoking only specific protein-DNA interaction and weak glue-like interaction between regulatory proteins, we show explicit schemes to implement regulatory logic functions of increasing complexity by appropriately selecting the strengths and arranging the relative positions of the relevant protein-binding DNA sequences in the cis-regulatory region. The architectures that emerge are naturally modular and evolvable. Our results suggest that the transcription regulatory apparatus is a "programmable" computing machine, belonging formally to the class of Boltzmann machines. Crucial to our results is the ability to regulate gene expression at a distance. In bacteria, this can be achieved for isolated genes via DNA looping controlled by the dimerization of DNA-bound proteins. However, if adopted extensively in the genome, long-distance interaction can cause unintentional intergenic cross talk, a detrimental side effect difficult to overcome by the known bacterial transcription-regulation systems. This may be a key factor limiting the genome-wide adoption of complex transcription control in bacteria. Implications of our findings for combinatorial transcription control in eukaryotes are discussed.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Formation of mRNA 3' ends in eukaryotes: mechanism, regulation, and interrelationships with other steps in mRNA synthesis.

              Formation of mRNA 3' ends in eukaryotes requires the interaction of transacting factors with cis-acting signal elements on the RNA precursor by two distinct mechanisms, one for the cleavage of most replication-dependent histone transcripts and the other for cleavage and polyadenylation of the majority of eukaryotic mRNAs. Most of the basic factors have now been identified, as well as some of the key protein-protein and RNA-protein interactions. This processing can be regulated by changing the levels or activity of basic factors or by using activators and repressors, many of which are components of the splicing machinery. These regulatory mechanisms act during differentiation, progression through the cell cycle, or viral infections. Recent findings suggest that the association of cleavage/polyadenylation factors with the transcriptional complex via the carboxyl-terminal domain of the RNA polymerase II (Pol II) large subunit is the means by which the cell restricts polyadenylation to Pol II transcripts. The processing of 3' ends is also important for transcription termination downstream of cleavage sites and for assembly of an export-competent mRNA. The progress of the last few years points to a remarkable coordination and cooperativity in the steps leading to the appearance of translatable mRNA in the cytoplasm.
                Bookmark

                Author and article information

                Journal
                BMC Bioinformatics
                BMC Bioinformatics
                BioMed Central (London )
                1471-2105
                2006
                4 January 2006
                : 7
                : 1
                Affiliations
                [1 ]Department of Computer Science, University of Colorado, Boulder, CO 80309, USA
                [2 ]Department of Molecular, Cellular and Developmental Biology, University of Colorado, Boulder, CO 80309, USA
                [3 ]Department of Chemistry and Biochemistry, University of Colorado at Boulder, Boulder, CO 80309, USA
                Article
                1471-2105-7-1
                10.1186/1471-2105-7-1
                1360682
                16393334
                40bc5890-ed10-45a2-a260-7e93b3aee14f
                Copyright © 2006 Hamady et al; licensee BioMed Central Ltd.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 28 July 2005
                : 4 January 2006
                Categories
                Software

                Bioinformatics & Computational biology
                Bioinformatics & Computational biology

                Comments

                Comment on this article