23
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      Compression of DNA sequence reads in FASTQ format.

      Bioinformatics
      Algorithms, Base Sequence, Computational Biology, methods, Data Compression, Genomics, Internet, Sequence Analysis, DNA

      Read this article at

      ScienceOpenPublisherPubMed
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Modern sequencing instruments are able to generate at least hundreds of millions short reads of genomic data. Those huge volumes of data require effective means to store them, provide quick access to any record and enable fast decompression. We present a specialized compression algorithm for genomic data in FASTQ format which dominates its competitor, G-SQZ, as is shown on a number of datasets from the 1000 Genomes Project (www.1000genomes.org). DSRC is freely available at http:/sun.aei.polsl.pl/dsrc.

          Related collections

          Author and article information

          Journal
          21252073
          10.1093/bioinformatics/btr014

          Chemistry
          Algorithms,Base Sequence,Computational Biology,methods,Data Compression,Genomics,Internet,Sequence Analysis, DNA

          Comments

          Comment on this article