5
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Predicting Preference of Transcription Factors for Methylated DNA Using Sequence Information

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Transcription factors play key roles in cell-fate decisions by regulating 3D genome conformation and gene expression. The traditional view is that methylation of DNA hinders transcription factors binding to them, but recent research has shown that many transcription factors prefer to bind to methylated DNA. Therefore, identifying such transcription factors and understanding their functions is a stepping-stone for studying methylation-mediated biological processes. In this paper, a two-step discriminated method was proposed to recognize transcription factors and their preference for methylated DNA based only on sequences information. In the first step, the proposed model was used to discriminate transcription factors from non-transcription factors. The areas under the curve (AUCs) are 0.9183 and 0.9116, respectively, for the 5-fold cross-validation test and independent dataset test. Subsequently, for the classification of transcription factors that prefer methylated DNA and transcription factors that prefer non-methylated DNA, our model could produce the AUCs of 0.7744 and 0.7356, respectively, for the 5-fold cross-validation test and independent dataset test. Based on the proposed model, a user-friendly web server called TFPred was built, which can be freely accessed at http://lin-group.cn/server/TFPred/.

          Graphical Abstract

          Abstract

          Transcription factors binding to methylated DNA perform special and unclear functions. Lin and colleagues developed a machine-learning-based method to predict transcription factors and their preference for methylated DNA, which will help the discovery of methylated DNA-bound transcription factors and the study of their functions.

          Related collections

          Most cited references68

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          CD-HIT Suite: a web server for clustering and comparing biological sequences

          Summary: CD-HIT is a widely used program for clustering and comparing large biological sequence datasets. In order to further assist the CD-HIT users, we significantly improved this program with more functions and better accuracy, scalability and flexibility. Most importantly, we developed a new web server, CD-HIT Suite, for clustering a user-uploaded sequence dataset or comparing it to another dataset at different identity levels. Users can now interactively explore the clusters within web browsers. We also provide downloadable clusters for several public databases (NCBI NR, Swissprot and PDB) at different identity levels. Availability: Free access at http://cd-hit.org Contact: liwz@sdsc.edu Supplementary information: Supplementary data are available at Bioinformatics online.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Transcriptional Addiction in Cancer.

            Cancer arises from genetic alterations that invariably lead to dysregulated transcriptional programs. These dysregulated programs can cause cancer cells to become highly dependent on certain regulators of gene expression. Here, we discuss how transcriptional control is disrupted by genetic alterations in cancer cells, why transcriptional dependencies can develop as a consequence of dysregulated programs, and how these dependencies provide opportunities for novel therapeutic interventions in cancer.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Impact of cytosine methylation on DNA binding specificities of human transcription factors.

              The majority of CpG dinucleotides in the human genome are methylated at cytosine bases. However, active gene regulatory elements are generally hypomethylated relative to their flanking regions, and the binding of some transcription factors (TFs) is diminished by methylation of their target sequences. By analysis of 542 human TFs with methylation-sensitive SELEX (systematic evolution of ligands by exponential enrichment), we found that there are also many TFs that prefer CpG-methylated sequences. Most of these are in the extended homeodomain family. Structural analysis showed that homeodomain specificity for methylcytosine depends on direct hydrophobic interactions with the methylcytosine 5-methyl group. This study provides a systematic examination of the effect of an epigenetic DNA modification on human TF binding specificity and reveals that many developmentally important proteins display preference for mCpG-containing sequences.
                Bookmark

                Author and article information

                Contributors
                Journal
                Mol Ther Nucleic Acids
                Mol Ther Nucleic Acids
                Molecular Therapy. Nucleic Acids
                American Society of Gene & Cell Therapy
                2162-2531
                31 July 2020
                04 December 2020
                31 July 2020
                : 22
                : 1043-1050
                Affiliations
                [1 ]Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China
                Author notes
                []Corresponding author: Hao Lin, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China. hlin@ 123456uestc.edu.cn
                Article
                S2162-2531(20)30224-9
                10.1016/j.omtn.2020.07.035
                7691157
                8699dc55-7b2a-42da-a02a-80e50d2ede46
                © 2020 The Author(s)

                This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

                History
                : 27 May 2020
                : 28 July 2020
                Categories
                Original Article

                Molecular medicine
                transcription factors,methylated dna,machine learning,sequence feature,web server

                Comments

                Comment on this article