9
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      PseudoFuN: Deriving functional potentials of pseudogenes from integrative relationships with genes and microRNAs across 32 cancers

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Long thought “relics” of evolution, not until recently have pseudogenes been of medical interest regarding regulation in cancer. Often, these regulatory roles are a direct by-product of their close sequence homology to protein-coding genes. Novel pseudogene-gene (PGG) functional associations can be identified through the integration of biomedical data, such as sequence homology, functional pathways, gene expression, pseudogene expression, and microRNA expression. However, not all of the information has been integrated, and almost all previous pseudogene studies relied on 1:1 pseudogene–parent gene relationships without leveraging other homologous genes/pseudogenes.

          Results

          We produce PGG families that expand beyond the current 1:1 paradigm. First, we construct expansive PGG databases by (i) CUDAlign graphics processing unit (GPU) accelerated local alignment of all pseudogenes to gene families (totaling 1.6 billion individual local alignments and >40,000 GPU hours) and (ii) BLAST-based assignment of pseudogenes to gene families. Second, we create an open-source web application (PseudoFuN [Pseudogene Functional Networks]) to search for integrative functional relationships of sequence homology, microRNA expression, gene expression, pseudogene expression, and gene ontology. We produce four “flavors” of CUDAlign-based databases (>462,000,000 PGG pairwise alignments and 133,770 PGG families) that can be queried and downloaded using PseudoFuN. These databases are consistent with previous 1:1 PGG annotation and also are much more powerful including millions of de novo PGG associations. For example, we find multiple known (e.g., miR-20a- PTEN- PTENP1) and novel (e.g., miR-375- SOX15- PPP4R1L) microRNA-gene-pseudogene associations in prostate cancer. PseudoFuN provides a “one stop shop” for identifying and visualizing thousands of potential regulatory relationships related to pseudogenes in The Cancer Genome Atlas cancers.

          Conclusions

          Thousands of new PGG associations can be explored in the context of microRNA-gene-pseudogene co-expression and differential expression with a simple-to-use online tool by bioinformaticians and oncologists alike.

          Related collections

          Most cited references51

          • Record: found
          • Abstract: found
          • Article: not found

          Gene Ontology: tool for the unification of biology

          Genomic sequencing has made it clear that a large fraction of the genes specifying the core biological functions are shared by all eukaryotes. Knowledge of the biological role of such shared proteins in one organism can often be transferred to other organisms. The goal of the Gene Ontology Consortium is to produce a dynamic, controlled vocabulary that can be applied to all eukaryotes even as knowledge of gene and protein roles in cells is accumulating and changing. To this end, three independent ontologies accessible on the World-Wide Web (http://www.geneontology.org) are being constructed: biological process, molecular function and cellular component.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Identification of common molecular subsequences.

              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              A coding-independent function of gene and pseudogene mRNAs regulates tumour biology

              The canonical role of messenger RNA (mRNA) is to deliver protein-coding information to sites of protein synthesis. However, given that microRNAs bind to RNAs, we hypothesized that RNAs possess a biological role in cancer cells that relies upon their ability to compete for microRNA binding and is independent of their protein-coding function. As a paradigm for the protein-coding-independent role of RNAs, we describe the functional relationship between the mRNAs produced by the PTEN tumour suppressor gene and its pseudogene (PTENP1) and the critical consequences of this interaction. We find that PTENP1 is biologically active as determined by its ability to regulate cellular levels of PTEN, and that it can exert a growth-suppressive role. We also show that PTENP1 locus is selectively lost in human cancer. We extend our analysis to other cancer-related genes that possess pseudogenes, such as oncogenic KRAS. Further, we demonstrate that the transcripts of protein coding genes such as PTEN are also biologically active. Together, these findings attribute a novel biological role to expressed pseudogenes, as they can regulate coding gene expression, and reveal a non-coding function for mRNAs.
                Bookmark

                Author and article information

                Journal
                Gigascience
                Gigascience
                gigascience
                GigaScience
                Oxford University Press
                2047-217X
                May 2019
                26 April 2019
                26 April 2019
                : 8
                : 5
                : giz046
                Affiliations
                [1 ]Department of Biomedical Informatics, College of Medicine, The Ohio State University, 1800 Cannon Drive, Columbus, OH 43210, USA
                [2 ]Ohio Supercomputer Center, 1224 Kinnear Road, Columbus, OH 43212, USA
                [3 ]School of Electrical and Computer Engineering, Purdue University, 465 Northwestern Avenue, West Lafayette, IN 47907, USA
                [4 ]Department of Medicine, Indiana University School of Medicine, 545 Barnhill Drive, Indianapolis, IN 46202, USA
                [5 ]Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029, USA
                [6 ]Division of Pharmaceutics and Pharmaceutical Chemistry, College of Pharmacy, The Ohio State University, 500 West 12 th Avenue, Columbus, OH 43210, USA
                [7 ]Regenstrief Institute, Indiana University, 1101 West 10 th Street, Indianapolis, IN 46262, USA
                [8 ]The Ohio State University Comprehensive Cancer Center (OSUCCC - James), 460 West 10 th Avenue, Columbus, OH 43210, USA
                Author notes
                Correspondence address. Yan Zhang, E-mail: yan.zhang@ 123456osumc.edu

                Mailing Address: 250 Lincoln Tower, 1800 Cannon Drive, Columbus, OH 43210

                Author information
                http://orcid.org/0000-0002-4628-2256
                http://orcid.org/0000-0002-3357-5121
                Article
                giz046
                10.1093/gigascience/giz046
                6486473
                31029062
                f2f8b26b-4221-4a1d-b8bf-be29ca7a9a2d
                © The Author(s) 2019. Published by Oxford University Press.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 22 September 2018
                : 13 December 2018
                : 29 March 2019
                Page count
                Pages: 13
                Funding
                Funded by: U.S. National Library of Medicine 10.13039/100000092
                Award ID: 4T15LM011270-05
                Funded by: National Research Service
                Award ID: 1F31LM013056-01
                Funded by: Comprehensive Cancer Center 10.13039/100008308
                Award ID: P30CA016058
                Categories
                Technical Note

                pseudogenes,database,functional prediction,gene regulation,network analysis,high-performance computing,graphics processing unit,competing endogenous rna

                Comments

                Comment on this article