Blog
About

940
views
0
recommends
+1 Recommend
0 collections
    36
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      Search and clustering orders of magnitude faster than BLAST.

      Bioinformatics

      methods, Sequence Analysis, Protein, Sequence Alignment, chemistry, Proteins, Databases, Protein, Computational Biology, Cluster Analysis, Algorithms

      Read this article at

      ScienceOpenPublisherPubMed
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Biological sequence data is accumulating rapidly, motivating the development of improved high-throughput methods for sequence classification. UBLAST and USEARCH are new algorithms enabling sensitive local and global search of large sequence databases at exceptionally high speeds. They are often orders of magnitude faster than BLAST in practical applications, though sensitivity to distant protein relationships is lower. UCLUST is a new clustering method that exploits USEARCH to assign sequences to clusters. UCLUST offers several advantages over the widely used program CD-HIT, including higher speed, lower memory use, improved sensitivity, clustering at lower identities and classification of much larger datasets. Binaries are available at no charge for non-commercial use at http://www.drive5.com/usearch.

          Related collections

          Author and article information

          Journal
          10.1093/bioinformatics/btq461
          20709691

          Comments

          Comment on this article