7
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      Targeted nanopore sequencing by real-time mapping of raw electrical signal with UNCALLED

      research-article

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Conventional targeted sequencing methods eliminate many of the benefits of nanopore sequencing, such as the ability to accurately detect structural variants (SVs) or epigenetic modifications. The ReadUntil method allows nanopore devices to selectively eject reads from pores in real-time, which could enable purely computational targeted sequencing. However this requires rapid identification of on-target reads, and most mapping methods require computationally intensive basecalling. We present UNCALLED ( github.com/skovaka/UNCALLED), an open-source mapper that rapidly matches streaming nanopore current signals to a reference sequence. UNCALLED probabilistically considers k-mers that the signal could represent, and then prunes the candidates based on the reference encoded within an FM-index. We used UNCALLED to deplete sequencing of known bacterial genomes within a metagenomics community, enriching the remaining species by 4.46 fold. UNCALLED also enriched 148 human genes associated with hereditary cancers to 29.6x coverage using one MinION flowcell, enabling accurate detection of SNPs, indels, SVs, and methylation in these genes.

          Related collections

          Most cited references50

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          BEDTools: a flexible suite of utilities for comparing genomic features

          Motivation: Testing for correlations between different sets of genomic features is a fundamental task in genomics research. However, searching for overlaps between features with existing web-based methods is complicated by the massive datasets that are routinely produced with current sequencing technologies. Fast and flexible tools are therefore required to ask complex questions of these data in an efficient manner. Results: This article introduces a new software suite for the comparison, manipulation and annotation of genomic features in Browser Extensible Data (BED) and General Feature Format (GFF) format. BEDTools also supports the comparison of sequence alignments in BAM format to both BED and GFF features. The tools are extremely efficient and allow the user to compare large datasets (e.g. next-generation sequencing data) with both public and custom genome annotation tracks. BEDTools can be combined with one another as well as with standard UNIX commands, thus facilitating routine genomics tasks as well as pipelines that can quickly answer intricate questions of large genomic datasets. Availability and implementation: BEDTools was written in C++. Source code and a comprehensive user manual are freely available at http://code.google.com/p/bedtools Contact: aaronquinlan@gmail.com; imh4y@virginia.edu Supplementary information: Supplementary data are available at Bioinformatics online.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            HISAT: a fast spliced aligner with low memory requirements.

            HISAT (hierarchical indexing for spliced alignment of transcripts) is a highly efficient system for aligning reads from RNA sequencing experiments. HISAT uses an indexing scheme based on the Burrows-Wheeler transform and the Ferragina-Manzini (FM) index, employing two types of indexes for alignment: a whole-genome FM index to anchor each alignment and numerous local FM indexes for very rapid extensions of these alignments. HISAT's hierarchical index for the human genome contains 48,000 local FM indexes, each representing a genomic region of ∼64,000 bp. Tests on real and simulated data sets showed that HISAT is the fastest system currently available, with equal or better accuracy than any other method. Despite its large number of indexes, HISAT requires only 4.3 gigabytes of memory. HISAT supports genomes of any size, including those larger than 4 billion bases.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Minimap2: pairwise alignment for nucleotide sequences

              Heng Li (2018)
              Recent advances in sequencing technologies promise ultra-long reads of ∼100 kb in average, full-length mRNA or cDNA reads in high throughput and genomic contigs over 100 Mb in length. Existing alignment programs are unable or inefficient to process such data at scale, which presses for the development of new alignment algorithms.
                Bookmark

                Author and article information

                Journal
                9604648
                20305
                Nat Biotechnol
                Nat Biotechnol
                Nature biotechnology
                1087-0156
                1546-1696
                11 October 2020
                30 November 2020
                April 2021
                04 November 2021
                : 39
                : 4
                : 431-441
                Affiliations
                [1. ]Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
                [2. ]Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
                [3. ]Department of Biology, Johns Hopkins University, Baltimore, MD, USA
                [4. ]Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
                Author notes

                Contributions

                SK and MCS designed UNCALLED. SK implemented UNCALLED. BN and SK benchmarked UNCALLED. YF performed all sequencing library preparation. SK computed enrichment levels for all experiments and performed small variant and structural variant detection and analysis. YF performed methylation detection and analysis. WT supervised sequencing runs and advised on the experimental design. MCS supervised the entire project. All authors contributed to writing the manuscript. All authors read and approve the final manuscript.

                Corresponding author: Sam Kovaka ( skovaka1@ 123456jhu.edu )
                Article
                NIHMS1636148
                10.1038/s41587-020-0731-9
                8567335
                33257863
                a228cc5f-f9bc-48b5-8af0-a292ddc2dc72

                Users may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use: http://www.nature.com/authors/editorial_policies/license.html#terms

                History
                Categories
                Article

                Biotechnology
                Biotechnology

                Comments

                Comment on this article