1
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      Characterizing allele- and haplotype-specific copy numbers in single cells with CHISEL

      research-article
      1 , 1 , 2
      Nature biotechnology

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Single-cell barcoding technologies enable genome sequencing of thousands of individual cells in parallel, but with extremely low sequencing coverage (<0.05 × ) per cell. While the total copy number of large multi-megabase segments can be derived from such data, important allele-specific mutations – such as copy-neutral loss-of-heterozygosity (LOH) in cancer – are missed. We introduce Copy-number Haplotype Inference in Single-cells using Evolutionary Links (CHISEL), a method to infer allele- and haplotype-specific copy numbers in single cells and subpopulations of cells by aggregating sparse signal across hundreds or thousands of individual cells. We applied CHISEL to 10 single-cell sequencing datasets of ≈2 000 cells from two breast cancer patients. We identified extensive allele-specific copy-number aberrations (CNAs) in these samples, including copy-neutral LOHs, whole-genome duplications (WGDs), and mirrored-subclonal CNAs. These allele-specific CNAs affect genomic regions containing well-known breast cancer genes. We also refined the reconstruction of tumor evolution, timing allele-specific CNAs before and after WGDs, identifying low-frequency subpopulations distinguished by unique CNAs, and uncovering evidence of convergent evolution.

          Related collections

          Most cited references66

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          The Sequence Alignment/Map format and SAMtools

          Summary: The Sequence Alignment/Map (SAM) format is a generic alignment format for storing read alignments against reference sequences, supporting short and long reads (up to 128 Mbp) produced by different sequencing platforms. It is flexible in style, compact in size, efficient in random access and is the format in which alignments from the 1000 Genomes Project are released. SAMtools implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments. Availability: http://samtools.sourceforge.net Contact: rd@sanger.ac.uk
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data.

            Heng Li (2011)
            Most existing methods for DNA sequence analysis rely on accurate sequences or genotypes. However, in applications of the next-generation sequencing (NGS), accurate genotypes may not be easily obtained (e.g. multi-sample low-coverage sequencing or somatic mutation discovery). These applications press for the development of new methods for analyzing sequence data with uncertainty. We present a statistical framework for calling SNPs, discovering somatic mutations, inferring population genetical parameters and performing association tests directly based on sequencing data without explicit genotyping or linkage-based imputation. On real data, we demonstrate that our method achieves comparable accuracy to alternative methods for estimating site allele count, for inferring allele frequency spectrum and for association mapping. We also highlight the necessity of using symmetric datasets for finding somatic mutations and confirm that for discovering rare events, mismapping is frequently the leading source of errors. http://samtools.sourceforge.net. hengli@broadinstitute.org.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Next-generation genotype imputation service and methods.

              Genotype imputation is a key component of genetic association studies, where it increases power, facilitates meta-analysis, and aids interpretation of signals. Genotype imputation is computationally demanding and, with current tools, typically requires access to a high-performance computing cluster and to a reference panel of sequenced genomes. Here we describe improvements to imputation machinery that reduce computational requirements by more than an order of magnitude with no loss of accuracy in comparison to standard imputation tools. We also describe a new web-based service for imputation that facilitates access to new reference panels and greatly improves user experience and productivity.
                Bookmark

                Author and article information

                Journal
                9604648
                20305
                Nat Biotechnol
                Nat Biotechnol
                Nature biotechnology
                1087-0156
                1546-1696
                10 September 2020
                February 2021
                02 September 2020
                25 January 2023
                : 39
                : 2
                : 207-214
                Affiliations
                [1 ]Department of Computer Science, Princeton University, Princeton, NJ, USA
                [2 ]Member, Rutgers Cancer Institute of New Jersey, New Brunswick, NJ, USA
                Author notes

                Author contributions

                S.Z. and B.J.R. conceived the project, developed the theory and algorithms, and wrote the paper; S.Z. implemented the algorithms and performed the analyses.

                [* ]Correspondence: braphael@ 123456princeton.edu
                Article
                NIHMS1618000
                10.1038/s41587-020-0661-6
                9876616
                32879467
                bd2daffe-6b0f-4bdb-910c-97dd6b498f76

                Users may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use: http://www.nature.com/authors/editorial_policies/license.html#terms

                History
                Categories
                Article

                Biotechnology
                Biotechnology

                Comments

                Comment on this article