16
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      Finding Protein and Nucleotide Similarities with FASTA

      research-article

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The FASTA programs provide a comprehensive set of rapid similarity searching tools ( fasta36, fastx36, tfastx36, fasty36, tfasty36), similar to those provided by the BLAST package, as well as programs for slower, optimal, local and global similarity searches ( ssearch36, ggsearch36) and for searching with short peptides and oligonucleotides ( fasts36, fastm36). The FASTA programs use an empirical strategy for estimating statistical significance that accommodates a range of similarity scoring matrices and gap penalties, improving alignment boundary accuracy and search sensitivity (Unit 3.5). The FASTA programs can produce “BLAST-like” alignment and tabular output, for ease of integration into existing analysis pipelines, and can search small, representative databases, and then report results for a larger set of sequences, using links from the smaller dataset. The FASTA programs work with a wide variety of database formats, including mySQL and postgreSQL databases (Unit 9.4). The programs also provide a strategy for integrating domain and active site annotations into alignments and highlighting the mutational state of functionally critical residues. These protocols describe how to use the FASTA programs to characterize protein and DNA sequences, using protein:protein, protein:DNA, and DNA:DNA comparisons.

          Related collections

          Author and article information

          Contributors
          Journal
          101157830
          34237
          Curr Protoc Bioinformatics
          Curr Protoc Bioinformatics
          Current protocols in bioinformatics
          1934-3396
          1934-340X
          19 July 2016
          24 March 2016
          24 March 2016
          24 March 2017
          : 53
          : 3.9.1-3.925
          Affiliations
          University of Virginia School of Medicine, Charlottesville, Virginia
          Article
          PMC5072362 PMC5072362 5072362 nihpa799150
          10.1002/0471250953.bi0309s53
          5072362
          27010337
          609a3408-0934-433d-ba3a-cefd48dfae66
          History
          Categories
          Article

          scoring matrices,alignment annotation,E()-value,expectation,homology,Similarity

          Comments

          Comment on this article