25
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Generation of Long Insert Pairs Using a Cre-LoxP Inverse PCR Approach

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Large insert mate pair reads have a major impact on the overall success of de novo assembly and the discovery of inherited and acquired structural variants. The positional information of mate pair reads generally improves genome assembly by resolving repeat elements and/or ordering contigs. Currently available methods for building such libraries have one or more of limitations, such as relatively small insert size; unable to distinguish the junction of two ends; and/or low throughput. We developed a new approach, Cre- LoxP Inverse PCR Paired- End (CLIP-PE), which exploits the advantages of (1) Cre-LoxP recombination system to efficiently circularize large DNA fragments, (2) inverse PCR to enrich for the desired products that contain both ends of the large DNA fragments, and (3) the use of restriction enzymes to introduce a recognizable junction site between ligated fragment ends and to improve the self-ligation efficiency. We have successfully created CLIP-PE libraries up to 22 kb that are rich in informative read pairs and low in small fragment background. These libraries have demonstrated the ability to improve genome assemblies. The CLIP-PE methodology can be implemented with existing and future next-generation sequencing platforms.

          Related collections

          Most cited references10

          • Record: found
          • Abstract: found
          • Article: not found

          Next-generation DNA sequencing of paired-end tags (PET) for transcriptome and genome analyses.

          Comprehensive understanding of functional elements in the human genome will require thorough interrogation and comparison of individual human genomes and genomic structures. Such an endeavor will require improvements in the throughputs and costs of DNA sequencing. Next-generation sequencing platforms have impressively low costs and high throughputs but are limited by short read lengths. An immediate and widely recognized solution to this critical limitation is the paired-end tag (PET) sequencing for various applications, collectively called the PET sequencing strategy, in which short and paired tags are extracted from the ends of long DNA fragments for ultra-high-throughput sequencing. The PET sequences can be accurately mapped to the reference genome, thus demarcating the genomic boundaries of PET-represented DNA fragments and revealing the identities of the target DNA elements. PET protocols have been developed for the analyses of transcriptomes, transcription factor binding sites, epigenetic sites such as histone modification sites, and genome structures. The exclusive advantage of the PET technology is its ability to uncover linkages between the two ends of DNA fragments. Using this unique feature, unconventional fusion transcripts, genome structural variations, and even molecular interactions between distant genomic elements can be unraveled by PET analysis. Extensive use of PET data could lead to efficient assembly of individual human genomes, transcriptomes, and interactomes, enabling new biological and clinical insights. With its versatile and powerful nature for DNA analysis, the PET sequencing strategy has a bright future ahead.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Bacteriophage P1 site-specific recombination. I. Recombination between loxP sites.

              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Gene identification signature (GIS) analysis for transcriptome characterization and genome annotation.

              We have developed a DNA tag sequencing and mapping strategy called gene identification signature (GIS) analysis, in which 5' and 3' signatures of full-length cDNAs are accurately extracted into paired-end ditags (PETs) that are concatenated for efficient sequencing and mapped to genome sequences to demarcate the transcription boundaries of every gene. GIS analysis is potentially 30-fold more efficient than standard cDNA sequencing approaches for transcriptome characterization. We demonstrated this approach with 116,252 PET sequences derived from mouse embryonic stem cells. Initial analysis of this dataset identified hundreds of previously uncharacterized transcripts, including alternative transcripts of known genes. We also uncovered several intergenically spliced and unusual fusion transcripts, one of which was confirmed as a trans-splicing event and was differentially expressed. The concept of paired-end ditagging described here for transcriptome analysis can also be applied to whole-genome analysis of cis-regulatory and other DNA elements and represents an important technological advance for genome annotation.
                Bookmark

                Author and article information

                Contributors
                Role: Editor
                Journal
                PLoS One
                plos
                plosone
                PLoS ONE
                Public Library of Science (San Francisco, USA )
                1932-6203
                2012
                9 January 2012
                : 7
                : 1
                : e29437
                Affiliations
                [1]Joint Genome Institute, United States Department of Energy, Walnut Creek, California, United States of America
                UCLA-DOE Institute for Genomics and Proteomics, United States of America
                Author notes

                Conceived and designed the experiments: ZP ZZ TZ JFC FC. Performed the experiments: ZP ZZ NN. Analyzed the data: ZP ZZ NN JLF AC JFC ACC LAP FC. Contributed reagents/materials/analysis tools: ZP ZZ NN JLF AC. Wrote the paper: ZP ZZ JLF TZ FC. Supervised the whole project: FC.

                Article
                PONE-D-11-15709
                10.1371/journal.pone.0029437
                3253782
                22253722
                615d2f43-5ad1-4c30-a770-7bb9adb22ec6
                Peng et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
                History
                : 12 August 2011
                : 29 November 2011
                Page count
                Pages: 8
                Categories
                Research Article
                Biology
                Computational Biology
                Genomics
                Genetics
                Molecular Genetics
                Genomics
                Science Policy
                Technology Development

                Uncategorized
                Uncategorized

                Comments

                Comment on this article