15
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Dual redundant sequencing strategy: Full‐length gene characterisation of 1056 novel and confirmatory HLA alleles

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The high‐throughput department of DKMS Life Science Lab encounters novel human leukocyte antigen ( HLA) alleles on a daily basis. To characterise these alleles, we have developed a system to sequence the whole gene from 5′‐ to 3′‐ UTR for the HLA loci A, B, C, DQB1 and DPB1 for submission to the European Molecular Biology Laboratory – European Nucleotide Archive ( EMBL‐ENA) and the IPD‐IMGT/ HLA Database. Our workflow is based on a dual redundant sequencing strategy. Using shotgun sequencing on an Illumina MiSeq instrument and single molecule real‐time ( SMRT) sequencing on a PacBio RS II instrument, we are able to achieve highly accurate HLA full‐length consensus sequences. Remaining conflicts are resolved using the R package DR2S (Dual Redundant Reference Sequencing). Given the relatively high throughput of this strategy, we have developed the semi‐automated web service TypeLoader, to aid in the submission of sequences to the EMBL‐ENA and the IPD‐IMGT/ HLA Database. In the IPD‐IMGT/ HLA Database release 3.24.0 (April 2016; prior to the submission of the sequences described here), only 5.2% of all known HLA alleles have been fully characterised together with intronic and UTR sequences. So far, we have applied our strategy to characterise and submit 1056 HLA alleles, thereby more than doubling the number of fully characterised alleles. Given the increasing application of next generation sequencing ( NGS) for full gene characterisation in clinical practice, extending the HLA database concomitantly is highly desirable. Therefore, we propose this dual redundant sequencing strategy as a workflow for submission of novel full‐length alleles and characterisation of sequences that are as yet incomplete. This would help to mitigate the predominance of partially known alleles in the database.

          Related collections

          Most cited references15

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          Comparison of Next-Generation Sequencing Systems

          With fast development and wide applications of next-generation sequencing (NGS) technologies, genomic sequence information is within reach to aid the achievement of goals to decode life mysteries, make better crops, detect pathogens, and improve life qualities. NGS systems are typically represented by SOLiD/Ion Torrent PGM from Life Sciences, Genome Analyzer/HiSeq 2000/MiSeq from Illumina, and GS FLX Titanium/GS Junior from Roche. Beijing Genomics Institute (BGI), which possesses the world's biggest sequencing capacity, has multiple NGS systems including 137 HiSeq 2000, 27 SOLiD, one Ion Torrent PGM, one MiSeq, and one 454 sequencer. We have accumulated extensive experience in sample handling, sequencing, and bioinformatics analysis. In this paper, technologies of these systems are reviewed, and first-hand data from extensive experience is summarized and analyzed to discuss the advantages and specifics associated with each sequencing system. At last, applications of NGS are summarized.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Gene map of the extended human MHC.

            The major histocompatibility complex (MHC) is the most important region in the vertebrate genome with respect to infection and autoimmunity, and is crucial in adaptive and innate immunity. Decades of biomedical research have revealed many MHC genes that are duplicated, polymorphic and associated with more diseases than any other region of the human genome. The recent completion of several large-scale studies offers the opportunity to assimilate the latest data into an integrated gene map of the extended human MHC. Here, we present this map and review its content in relation to paralogy, polymorphism, immune function and disease.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              Improving PacBio Long Read Accuracy by Short Read Alignment

              The recent development of third generation sequencing (TGS) generates much longer reads than second generation sequencing (SGS) and thus provides a chance to solve problems that are difficult to study through SGS alone. However, higher raw read error rates are an intrinsic drawback in most TGS technologies. Here we present a computational method, LSC, to perform error correction of TGS long reads (LR) by SGS short reads (SR). Aiming to reduce the error rate in homopolymer runs in the main TGS platform, the PacBio® RS, LSC applies a homopolymer compression (HC) transformation strategy to increase the sensitivity of SR-LR alignment without scarifying alignment accuracy. We applied LSC to 100,000 PacBio long reads from human brain cerebellum RNA-seq data and 64 million single-end 75 bp reads from human brain RNA-seq data. The results show LSC can correct PacBio long reads to reduce the error rate by more than 3 folds. The improved accuracy greatly benefits many downstream analyses, such as directional gene isoform detection in RNA-seq study. Compared with another hybrid correction tool, LSC can achieve over double the sensitivity and similar specificity.
                Bookmark

                Author and article information

                Contributors
                albrecht@dkms-lab.de
                Journal
                HLA
                HLA
                10.1111/(ISSN)2059-2310
                TAN
                Hla
                Blackwell Publishing Ltd (Oxford, UK )
                2059-2302
                2059-2310
                25 May 2017
                August 2017
                : 90
                : 2 ( doiID: 10.1111/tan.2017.90.issue-2 )
                : 79-87
                Affiliations
                [ 1 ] DKMS Life Science Lab Dresden Germany
                [ 2 ] Deep Sequencing Group CRTD – Center for Regenerative Therapies Dresden Dresden Germany
                [ 3 ] DNA Sequencing, Max Planck Institute of Molecular Cell Biology and Genetics Dresden Germany
                [ 4 ] DKMS Tübingen Germany
                Author notes
                [*] [* ] Correspondence

                Viviane Albrecht, DKMS Life Science Lab, Dresden, Germany.

                Email: albrecht@ 123456dkms-lab.de

                Author information
                http://orcid.org/0000-0001-6304-3989
                http://orcid.org/0000-0003-1593-6461
                http://orcid.org/0000-0003-3000-3205
                http://orcid.org/0000-0002-6442-9573
                Article
                TAN13057
                10.1111/tan.13057
                6084308
                28547825
                ee6ba57b-0987-4731-8f94-7ffc2782ca9a
                © 2017 The Authors. HLA: Immune Response Genetics Published by John Wiley & Sons Ltd.

                This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.

                History
                : 05 September 2016
                : 19 April 2017
                : 01 May 2017
                Page count
                Figures: 4, Tables: 5, Pages: 1, Words: 5697
                Categories
                Original Article
                Original Articles
                Custom metadata
                2.0
                tan13057
                tan13057-hdr-0001
                August 2017
                Converter:WILEY_ML3GV2_TO_NLMPMC version:version=5.4.4 mode:remove_FC converted:09.08.2018

                full‐length gene sequencing,hla typing,ngs,novel hla alleles,pacbio

                Comments

                Comment on this article