26
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      A computational approach to identify transposable element insertions in cancer cells

      abstract
      1 , 2 , 1 , 1 , 3 ,
      Genome Biology
      BioMed Central
      Beyond the Genome 2011
      19-22 September 2011

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background Transposable elements (TEs) in the human genome may contribute to molecular evolution, hereditary diseases and cancer [1-3]. Therefore, analyzing the impact of TEs in the genome is necessary to better characterize genetic events related to tumorigenesis. Here, we used a computational approach to identify TE insertions in publicly available data for exome sequences in lymphoblastoid and breast tumor cells derived from the same patient. Methods A total of 29,340, sequences from the cell lines HCC1954 (18,365,271) and HCC1954BL (10,975,107) were used to investigate gene fusion with TEs (gfTEs) [4,5]. The RepeatMasker and Burrows-Wheeler Alignment (BWA) tools were used to identify and to map gfTEs, respectively. We also used BEDTools to find overlaps between gfTEs and genome annotations. Human mRNAs and RepeatMasker tracks were downloaded in BED format from the GRCh37/ hg19 assembly. Repbase was used to filter the eukaryotic TEs. Results RepeatMasker was used to identify gfTEs in the exome reads. Next, the repeat masked reads were aligned against the reference genome using BWA. Finally, we filtered the aligned reads to exclude those without TEs (length of Ns <15, Ns means block of nucleotides masked), those with alignments showing low sequence identity (<95%) or those with a small hit length (<50 nucleotides). The study focused on the detection of TEs in coding sequence gene regions. A total of 3,307,608 reads were excluded, and 23,841 reads were predicted as cancer-specific gfTEs. Table 1 shows the number of gfTEs distributed among the TE families and highlights the members with higher frequency in both cell lines. Insertions of LINE/L1 and SINE/Alu were the most frequent. The Gene Ontology analysis for the biological process and molecular function terms showed a bias toward membrane receptor and cell adhesion proteins. Table 1 Number of genes containing insertion of TEs from different families Class/Family HCC1954BL (N) HCC1954 (T) DNA 4 3 DNA/MuDR 5 1 DNA/PiggyBac 2 2 DNA/TcMar-Mariner 10 9 DNA/TcMar-Tc2 6 8 DNA/TcMar-Tigger 90 96 DNA/hAT 2 8 DNA/hAT-Blackjack 7 19 DNA/hAT-Charlie 107 137 DNA/hAT-Tip100 12 19 LINE/CR1 23 25 LINE/Dong-R4 1 1 LINE/L1 863 641 LINE/L2 163 175 LINE/RTE 9 13 LINE/RTE-BovB 1 0 LTR 1 2 LTR/ERV1 134 145 LTR/ERVK 11 17 LTR/ERVL 70 77 LTR/ERVL-MaLR 148 186 LTR/Gypsy 6 7 Other 5 4 RNA 1 3 SINE 6 17 SINE/Alu 264 406 SINE/Deu 5 14 SINE/MIR 109 145 SINE/tRNA 0 3 Satellite 7 15 Satellite/acro 2 1 Satellite/centr 52 112 Unknown 6 8 rRNA 14 11 scRNA 4 2 snRNA 0 2 srpRNA 4 1 tRNA 0 1 Total 2.154 2.340 Conclusions We used a computational approach to identify putative cancer-specific gfTEs using human exome capture sequences. Interestingly, the total number of gfTEs was similar in normal and tumor cell lines, but the Gene Ontology analysis revealed an enrichment of insertions in genes encoding protein receptors and cell adhesion molecules. These results suggest that TEs could be contributing to cancer development.

          Related collections

          Most cited references3

          • Record: found
          • Abstract: found
          • Article: not found

          Retrotransposable elements and human disease.

          Nearly 50% of the human genome is composed of fossils from the remains of past transposable element duplication. Mobilization continues in the genomes of extant humans but is now restricted to retrotransposons, a class of mobile elements that move via a copy and paste mechanism. Currently active retrotransposable elements include Long INterspersed Elements (LINEs), Short INterspersed Elements (SINEs) and SVA (SINE/VNTR/Alu) elements. Retrotransposons are responsible for creating genetic variation and on occasion, disease-causing mutations, within the human genome. Approximately 0.27% of all human disease mutations are attributable to retrotransposable elements. Different mechanisms of genome alteration created by retrotransposable elements include insertional mutagenesis, recombination, retrotransposition-mediated and gene conversion-mediated deletion, and 3' transduction. Although researchers in the field of human genetics have discovered many mutational mechanisms for retrotransposable elements, their contribution to genetic variation within humans is still being resolved.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            Systematic detection of putative tumor suppressor genes through the combined use of exome and transcriptome sequencing

            Background To identify potential tumor suppressor genes, genome-wide data from exome and transcriptome sequencing were combined to search for genes with loss of heterozygosity and allele-specific expression. The analysis was conducted on the breast cancer cell line HCC1954, and a lymphoblast cell line from the same individual, HCC1954BL. Results By comparing exome sequences from the two cell lines, we identified loss of heterozygosity events at 403 genes in HCC1954 and at one gene in HCC1954BL. The combination of exome and transcriptome sequence data also revealed 86 and 50 genes with allele specific expression events in HCC1954 and HCC1954BL, which comprise 5.4% and 2.6% of genes surveyed, respectively. Many of these genes identified by loss of heterozygosity and allele-specific expression are known or putative tumor suppressor genes, such as BRCA1, MSH3 and SETX, which participate in DNA repair pathways. Conclusions Our results demonstrate that the combined application of high throughput sequencing to exome and allele-specific transcriptome analysis can reveal genes with known tumor suppressor characteristics, and a shortlist of novel candidates for the study of tumor suppressor activities.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Distinct patterns of somatic alterations in a lymphoblastoid and a tumor genome derived from the same individual

              Although patterns of somatic alterations have been reported for tumor genomes, little is known on how they compare with alterations present in non-tumor genomes. A comparison of the two would be crucial to better characterize the genetic alterations driving tumorigenesis. We sequenced the genomes of a lymphoblastoid (HCC1954BL) and a breast tumor (HCC1954) cell line derived from the same patient and compared the somatic alterations present in both. The lymphoblastoid genome presents a comparable number and similar spectrum of nucleotide substitutions to that found in the tumor genome. However, a significant difference in the ratio of non-synonymous to synonymous substitutions was observed between both genomes (P = 0.031). Protein–protein interaction analysis revealed that mutations in the tumor genome preferentially affect hub-genes (P = 0.0017) and are co-selected to present synergistic functions (P < 0.0001). KEGG analysis showed that in the tumor genome most mutated genes were organized into signaling pathways related to tumorigenesis. No such organization or synergy was observed in the lymphoblastoid genome. Our results indicate that endogenous mutagens and replication errors can generate the overall number of mutations required to drive tumorigenesis and that it is the combination rather than the frequency of mutations that is crucial to complete tumorigenic transformation.
                Bookmark

                Author and article information

                Conference
                Genome Biol
                Genome Biol
                Genome Biology
                BioMed Central
                1465-6906
                1465-6914
                2011
                19 September 2011
                : 12
                : Suppl 1
                : P28
                Affiliations
                [1 ]Regional Blood Center of Ribeirão Preto, Molecular Biology and Bioinformatics Laboratory, Ribeirão Preto, São Paulo 14051-140, Brazil
                [2 ]Barão de Mauá University, Ribeirão Preto, São Paulo 14026-150, Brazil
                [3 ]Department of Genetics, Medical School of Ribeirão Preto, University of São Paulo, Ribeirão Preto, São Paulo 14049-900, Brazil
                Article
                gb-2011-12-s1-p28
                10.1186/gb-2011-12-s1-p28
                3439072
                7a64551c-bfe2-497b-a452-2cec099f82f5
                Copyright ©2011 Silva et al; licensee BioMed Central Ltd.

                This is an open access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

                Beyond the Genome 2011
                Washington, DC, USA
                19-22 September 2011
                History
                Categories
                Poster Presentation

                Genetics
                Genetics

                Comments

                Comment on this article