113
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      PEMer: a computational framework with simulation-based error models for inferring genomic structural variants from massive paired-end sequencing data

      product-review

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Paired-End Mapper (PEMer) enables mapping of genomic structural variants at considerably enhanced sensitivity, specificity and resolution over previous approaches.

          Abstract

          Personal-genomics endeavors, such as the 1000 Genomes project, are generating maps of genomic structural variants by analyzing ends of massively sequenced genome fragments. To process these we developed Paired-End Mapper ( PEMer; http://sv.gersteinlab.org/pemer). This comprises an analysis pipeline, compatible with several next-generation sequencing platforms; simulation-based error models, yielding confidence-values for each structural variant; and a back-end database. The simulations demonstrated high structural variant reconstruction efficiency for PEMer's coverage-adjusted multi-cutoff scoring-strategy and showed its relative insensitivity to base-calling errors.

          Related collections

          Most cited references29

          • Record: found
          • Abstract: not found
          • Article: not found

          Identification of common molecular subsequences.

            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Global variation in copy number in the human genome.

            Copy number variation (CNV) of DNA sequences is functionally significant but has yet to be fully ascertained. We have constructed a first-generation CNV map of the human genome through the study of 270 individuals from four populations with ancestry in Europe, Africa or Asia (the HapMap collection). DNA from these individuals was screened for CNV using two complementary technologies: single-nucleotide polymorphism (SNP) genotyping arrays, and clone-based comparative genomic hybridization. A total of 1,447 copy number variable regions (CNVRs), which can encompass overlapping or adjacent gains or losses, covering 360 megabases (12% of the genome) were identified in these populations. These CNVRs contained hundreds of genes, disease loci, functional elements and segmental duplications. Notably, the CNVRs encompassed more nucleotide content per genome than SNPs, underscoring the importance of CNV in genetic diversity and evolution. The data obtained delineate linkage disequilibrium patterns for many CNVs, and reveal marked variation in copy number among populations. We also demonstrate the utility of this resource for genetic disease studies.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Structural variation in the human genome.

              The first wave of information from the analysis of the human genome revealed SNPs to be the main source of genetic and phenotypic human variation. However, the advent of genome-scanning technologies has now uncovered an unexpectedly large extent of what we term 'structural variation' in the human genome. This comprises microscopic and, more commonly, submicroscopic variants, which include deletions, duplications and large-scale copy-number variants - collectively termed copy-number variants or copy-number polymorphisms - as well as insertions, inversions and translocations. Rapidly accumulating evidence indicates that structural variants can comprise millions of nucleotides of heterogeneity within every genome, and are likely to make an important contribution to human diversity and disease susceptibility.
                Bookmark

                Author and article information

                Journal
                Genome Biol
                Genome Biology
                BioMed Central
                1465-6906
                1465-6914
                2009
                23 February 2009
                : 10
                : 2
                : R23
                Affiliations
                [1 ]Gene Expression Unit, European Molecular Biology Laboratory (EMBL), Meyerhofstr., Heidelberg, 69117, Germany
                [2 ]EMBL Outstation Hinxton, EMBL-European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
                [3 ]Molecular Biophysics and Biochemistry Department, Yale University, Whitney Ave, New Haven, CT 06520, USA
                [4 ]Department of Molecular, Cellular, and Developmental Biology, Yale University, Whitney Ave, New Haven, CT 06520, USA
                [5 ]Department of Computer Science, Yale University, Prospect Street, New Haven, CT 06511, USA
                [6 ]Program in Computational Biology and Bioinformatics, Yale University, Whitney Ave, New Haven, CT 06520, USA
                Article
                gb-2009-10-2-r23
                10.1186/gb-2009-10-2-r23
                2688268
                19236709
                fe0fd73d-d082-4b09-a50c-0274b2a00a2d
                Copyright © 2009 Korbel et al.; licensee BioMed Central Ltd.

                This is an open access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 1 September 2008
                : 22 December 2008
                : 23 February 2009
                Categories
                Software

                Genetics
                Genetics

                Comments

                Comment on this article