128
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Combining tumor genome simulation with crowdsourcing to benchmark somatic single-nucleotide-variant detection

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The detection of somatic mutations from cancer genome sequences is key to understanding the genetic basis of disease progression, patient survival and response to therapy. Benchmarking is needed for tool assessment and improvement but is complicated by a lack of gold standards, by extensive resource requirements and by difficulties in sharing personal genomic information. To resolve these issues, we launched the ICGC-TCGA DREAM Somatic Mutation Calling Challenge, a crowdsourced benchmark of somatic mutation detection algorithms. Here we report the BAMSurgeon tool for simulating cancer genomes and the results of 248 analyses of three in silico tumors created with it. Different algorithms exhibit characteristic error profiles, and, intriguingly, false positives show a trinucleotide profile very similar to one found in human tumors. Although the three simulated tumors differ in sequence contamination (deviation from normal cell sequence) and in subclonality, an ensemble of pipelines outperforms the best individual pipeline in all cases. BAMSurgeon is available at https://github.com/adamewing/bamsurgeon/.

          Related collections

          Most cited references40

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          The Sequence Alignment/Map format and SAMtools

          Summary: The Sequence Alignment/Map (SAM) format is a generic alignment format for storing read alignments against reference sequences, supporting short and long reads (up to 128 Mbp) produced by different sequencing platforms. It is flexible in style, compact in size, efficient in random access and is the format in which alignments from the 1000 Genomes Project are released. SAMtools implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments. Availability: http://samtools.sourceforge.net Contact: rd@sanger.ac.uk
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Fast gapped-read alignment with Bowtie 2.

            As the rate of sequencing increases, greater throughput is demanded from read aligners. The full-text minute index is often used to make alignment very fast and memory-efficient, but the approach is ill-suited to finding longer, gapped alignments. Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              Fast and accurate short read alignment with Burrows–Wheeler transform

              Motivation: The enormous amount of short reads generated by the new DNA sequencing technologies call for the development of fast and accurate read alignment programs. A first generation of hash table-based methods has been developed, including MAQ, which is accurate, feature rich and fast enough to align short reads from a single individual. However, MAQ does not support gapped alignment for single-end reads, which makes it unsuitable for alignment of longer reads where indels may occur frequently. The speed of MAQ is also a concern when the alignment is scaled up to the resequencing of hundreds of individuals. Results: We implemented Burrows-Wheeler Alignment tool (BWA), a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps. BWA supports both base space reads, e.g. from Illumina sequencing machines, and color space reads from AB SOLiD machines. Evaluations on both simulated and real data suggest that BWA is ∼10–20× faster than MAQ, while achieving similar accuracy. In addition, BWA outputs alignment in the new standard SAM (Sequence Alignment/Map) format. Variant calling and other downstream analyses after the alignment can be achieved with the open source SAMtools software package. Availability: http://maq.sourceforge.net Contact: rd@sanger.ac.uk
                Bookmark

                Author and article information

                Journal
                101215604
                32338
                Nat Methods
                Nat. Methods
                Nature methods
                1548-7091
                1548-7105
                27 April 2016
                18 May 2015
                July 2015
                04 May 2016
                : 12
                : 7
                : 623-630
                Affiliations
                [1 ]Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, California, USA
                [2 ]Mater Research Institute, University of Queensland, Woolloongabba, Queensland, Australia
                [3 ]Informatics and Biocomputing Program, Ontario Institute for Cancer Research, Toronto, Ontario, Canada
                [4 ]Sage Bionetworks, Seattle, Washington, USA
                [5 ]A list of members and affiliations appears at the end of the paper
                [6 ]IBM Computational Biology Center, T.J. Watson Research Center, Yorktown Heights, New York, USA
                [7 ]Computational Biology Program, Oregon Health & Science University, Portland, Oregon, USA
                [8 ]Department of Biomedical Engineering, Oregon Health & Science University, Portland, Oregon, USA
                [9 ]Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
                [10 ]Department of Pharmacology & Toxicology, University of Toronto, Toronto, Ontario, Canada
                Author notes
                Correspondence should be addressed to P.C.B. ( paul.boutros@ 123456oicr.on.ca ) or A.D.E. ( adam.ewing@ 123456mater.uq.edu.au )
                [11]

                These authors contributed equally to this work.

                [12]

                These authors jointly supervised this work.

                ICGC-TCGA DREAM Somatic Mutation Calling Challenge participants

                Liu Xi 13, Ninad Dewal 13, Yu Fan 14, Wenyi Wang 14, David Wheeler 13,15, Andreas Wilm 16, Grace Hui Ting 16, Chenhao Li 16, Denis Bertrand 16, Niranjan Nagarajan 16, Qing-Rong Chen 17, Chih-Hao Hsu 17, Ying Hu 17, Chunhua Yan 17, Warren Kibbe 17, Daoud Meerzaman 17, Kristian Cibulskis 18, Mara Rosenberg 18, Louis Bergelson 18, Adam Kiezun 18, Amie Radenbaugh 1, Anne-Sophie Sertier 19, Anthony Ferrari 19, Laurie Tonton 19, Kunal Bhutani 20, Nancy F Hansen 21, Difei Wang 22,23, Lei Song 23, Zhongwu Lai 24, Yang Liao 25, Wei Shi 26, José Carbonell-Caballero 27, Joaquín Dopazo 27, Cheryl C K Lau 3 & Justin Guinney 4

                13Team Wang Wheeler HGSC, Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, USA. 14Team Wang Wheeler HGSC, Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA. 15Team Wang Wheeler HGSC, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA. 16Team LoFreq Somatic GIS, Genome Institute of Singapore, Computational and Systems Biology, Singapore. 17Team DMUT, Center for Biomedical Informatics and Information Technology, National Cancer Institute, National Institutes of Health (NIH), Bethesda, Maryland, USA. 18Team Broad, The Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA. 19Team SLC Platform, Synergie Lyon Cancer Foundation, Centre Léon Bérard, Lyon, France. 20Team Virmid, Bioinformatics and Systems Biology, University of California, San Diego, La Jolla, California, USA. 21Team Shimmer, National Human Genome Research Institute, NIH, Bethesda, Maryland, USA. 22Team 2014, Department of Oncology, Lombardi Comprehensive Cancer Center, Georgetown University Medical Center, Washington, DC, USA. 23Team 2014, Innovation Center for Biomedical Informatics, Georgetown University Medical Center, Washington, DC, USA. 24Team AstraZeneca, AstraZeneca, Waltham, Massachusetts, USA. 25Team WEHI-Subread, Department of Medical Biology, The University of Melbourne, Melbourne, Victoria, Australia. 26Team WEHI-Subread, Department of Computing and Information Systems, The University of Melbourne, Melbourne, Victoria, Australia. 27Team Germmatic, Functional Genomics Node (INB) at Príncipe Felipe Research Center (CIPF), Valencia, Spain.

                Article
                NIHMS781059
                10.1038/nmeth.3407
                4856034
                25984700
                ba4d4a52-639a-4e0f-b873-0653def9077c

                Reprints and permissions information is available online at http://www.nature.com/reprints/index.html.

                This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/.

                History
                Categories
                Article

                Life sciences
                Life sciences

                Comments

                Comment on this article