18
views
0
recommends
+1 Recommend
1 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Identification of Known and Novel Recurrent Viral Sequences in Data from Multiple Patients and Multiple Cancers

      research-article

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Virus discovery from high throughput sequencing data often follows a bottom-up approach where taxonomic annotation takes place prior to association to disease. Albeit effective in some cases, the approach fails to detect novel pathogens and remote variants not present in reference databases. We have developed a species independent pipeline that utilises sequence clustering for the identification of nucleotide sequences that co-occur across multiple sequencing data instances. We applied the workflow to 686 sequencing libraries from 252 cancer samples of different cancer and tissue types, 32 non-template controls, and 24 test samples. Recurrent sequences were statistically associated to biological, methodological or technical features with the aim to identify novel pathogens or plausible contaminants that may associate to a particular kit or method. We provide examples of identified inhabitants of the healthy tissue flora as well as experimental contaminants. Unmapped sequences that co-occur with high statistical significance potentially represent the unknown sequence space where novel pathogens can be identified.

          Related collections

          Most cited references28

          • Record: found
          • Abstract: found
          • Article: not found

          Characterization and complete genome sequence of a novel coronavirus, coronavirus HKU1, from patients with pneumonia.

          Despite extensive laboratory investigations in patients with respiratory tract infections, no microbiological cause can be identified in a significant proportion of patients. In the past 3 years, several novel respiratory viruses, including human metapneumovirus, severe acute respiratory syndrome (SARS) coronavirus (SARS-CoV), and human coronavirus NL63, were discovered. Here we report the discovery of another novel coronavirus, coronavirus HKU1 (CoV-HKU1), from a 71-year-old man with pneumonia who had just returned from Shenzhen, China. Quantitative reverse transcription-PCR showed that the amount of CoV-HKU1 RNA was 8.5 to 9.6 x 10(6) copies per ml in his nasopharyngeal aspirates (NPAs) during the first week of the illness and dropped progressively to undetectable levels in subsequent weeks. He developed increasing serum levels of specific antibodies against the recombinant nucleocapsid protein of CoV-HKU1, with immunoglobulin M (IgM) titers of 1:20, 1:40, and 1:80 and IgG titers of <1:1,000, 1:2,000, and 1:8,000 in the first, second and fourth weeks of the illness, respectively. Isolation of the virus by using various cell lines, mixed neuron-glia culture, and intracerebral inoculation of suckling mice was unsuccessful. The complete genome sequence of CoV-HKU1 is a 29,926-nucleotide, polyadenylated RNA, with G+C content of 32%, the lowest among all known coronaviruses with available genome sequence. Phylogenetic analysis reveals that CoV-HKU1 is a new group 2 coronavirus. Screening of 400 NPAs, negative for SARS-CoV, from patients with respiratory illness during the SARS period identified the presence of CoV-HKU1 RNA in an additional specimen, with a viral load of 1.13 x 10(6) copies per ml, from a 35-year-old woman with pneumonia. Our data support the existence of a novel group 2 coronavirus associated with pneumonia in humans.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            A mathematical theory of communication

            C. Shannon (2001)
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Cloning of a human parvovirus by molecular screening of respiratory tract samples.

              The identification of new virus species is a key issue for the study of infectious disease but is technically very difficult. We developed a system for large-scale molecular virus screening of clinical samples based on host DNA depletion, random PCR amplification, large-scale sequencing, and bioinformatics. The technology was applied to pooled human respiratory tract samples. The first experiments detected seven human virus species without the use of any specific reagent. Among the detected viruses were one coronavirus and one parvovirus, both of which were at that time uncharacterized. The parvovirus, provisionally named human bocavirus, was in a retrospective clinical study detected in 17 additional patients and associated with lower respiratory tract infections in children. The molecular virus screening procedure provides a general culture-independent solution to the problem of detecting unknown virus species in single or pooled samples. We suggest that a systematic exploration of the viruses that infect humans, "the human virome," can be initiated.
                Bookmark

                Author and article information

                Contributors
                Role: Academic Editor
                Journal
                Viruses
                Viruses
                viruses
                Viruses
                MDPI
                1999-4915
                19 February 2016
                February 2016
                : 8
                : 2
                : 53
                Affiliations
                [1 ]Center for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, DK-2800 Kgs. Lyngby, Denmark; jef@ 123456cbs.dtu.dk (J.F.-N.); thomas@ 123456cbs.dtu.dk (T.S.-P.); lund@ 123456cbs.dtu.dk (O.L.); txema@ 123456cbs.dtu.dk (J.M.G.I.)
                [2 ]Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, DK-1350 Copenhagen, Denmark; kristin.kjartansdottir@ 123456snm.ku.dk (K.R.K.); sarah.mollerup@ 123456snm.ku.dk (S.M.); amasplund@ 123456snm.ku.dk (M.A.); tmourier@ 123456snm.ku.dk (T.M.); randi.jensen@ 123456snm.ku.dk (R.H.J.); thomas.hansen@ 123456snm.ku.dk (T.A.H.); zld305@ 123456alumni.ku.dk (A.R.-I.); srichter@ 123456snm.ku.dk (S.R.R.); ida.nielsen@ 123456snm.ku.dk (I.B.N.); d.e.alquezar@ 123456gmail.com (D.E.A.-P.); pvsolsen@ 123456snm.ku.dk (P.V.S.O.); lasse.vinner@ 123456snm.ku.dk (L.V.); helena.fridholm@ 123456gmail.com (H.F.); ewillerslev@ 123456snm.ku.dk (E.W.); AJHansen@ 123456snm.ku.dk (A.J.H.)
                [3 ]Department of Autoimmunology and Biomarkers, Statens Serum Institut, DK-2300 Copenhagen S, Denmark; lpn@ 123456ssi.dk
                [4 ]NNF Center for Protein Research, University of Copenhagen, Blegdamsvej 3B, DK-2200 Copenhagen, Denmark
                Author notes
                [* ]Correspondence: soren.brunak@ 123456cpr.ku.dk ; Tel.: +45-45252477
                [†]

                These authors contributed equally to this work.

                Article
                viruses-08-00053
                10.3390/v8020053
                4776208
                26907326
                7699d665-3758-4c22-b53b-c25e6ec306f9
                © 2016 by the authors; licensee MDPI, Basel, Switzerland.

                This article is an open access article distributed under the terms and conditions of the Creative Commons by Attribution (CC-BY) license ( http://creativecommons.org/licenses/by/4.0/).

                History
                : 30 October 2015
                : 05 February 2016
                Categories
                Article

                Microbiology & Virology
                sequence clustering,taxonomic characterisation,novel sequence identification,next generation sequencing,cancer causing viruses,oncoviruses,assay contamination

                Comments

                Comment on this article