9
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: not found
      • Article: not found

      Classification and identification of proteins by means of common and specific amino acid n-tuples in unaligned sequences

      , ,
      Computer Methods and Programs in Biomedicine
      Elsevier BV

      Read this article at

      ScienceOpenPublisherPubMed
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Unaligned amino acid sequences can be characterized by their composition of amino acid n-tuples (i.e. doublets, triplets, quadruplets, etc.). In this study we investigated the performance of two statistics, termed commonality and specificity, that are derived from n-tuple counts using a set of G-protein coupled receptor (GPCR) sequences. The commonality of a tuple is defined as its relative occurrence in the sequences that belong to a given GPCR subtype. The specificity of a tuple is derived from its relative occurrence in the sequences of a given GPCR subtype and from its relative non-occurrence in the sequences that do not belong to this subtype. A graphical presentation, termed 'polygram', is described for the visualization of common and specific tuples. The method can be applied to the classification of unknown GPCR sequences. It can also be applied to the identification of fragments of GPCRs, such as may occur in chimeric receptors. The method is generally applicable to other protein families and other types of coding.

          Related collections

          Author and article information

          Journal
          Computer Methods and Programs in Biomedicine
          Computer Methods and Programs in Biomedicine
          Elsevier BV
          01692607
          June 1998
          June 1998
          : 56
          : 3
          : 221-233
          Article
          10.1016/S0169-2607(98)00031-5
          9725648
          1fc2ff54-9ff4-416e-8a6a-d4035c65e936
          © 1998

          https://www.elsevier.com/tdm/userlicense/1.0/

          History

          Comments

          Comment on this article