• Record: found
  • Abstract: found
  • Article: not found

Markov entropy backbone electrostatic descriptors for predicting proteins biological activity.

Bioorganic & Medicinal Chemistry Letters

pharmacology, chemistry, Tetrahydrofolate Dehydrogenase, Static Electricity, Quantitative Structure-Activity Relationship, Proteins, Probability, Muramidase, Markov Chains, Linear Models, Alcohol Dehydrogenase

Read this article at

      There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.


      The spherical truncation of electrostatic interactions between aminoacids makes it possible to break down long-range spatial electrostatic interactions, resulting in short-range interactions. As a result, a Markov Chain model may be used to calculate the probabilities with which the effect of a given interaction reaches aminoacids at different distances within the backbone. The entropies of a Markov Chain model of this type may then be used to codify information about the spatial distribution of charges in the protein used in this study exploring the structure-activity relationship. In this paper, a linear discriminant analysis is reported, which correctly classified 92.3% of 26 under investigation in training and leave-one-out cross validation, purely for illustrative purposes. Classification was carried out for three possible activities: lysozymes, dihydrofolate reductases, and alcohol dehydrogenases. The discriminant analysis equations were contracted into two canonical roots. These simple canonical roots have high regression coefficients (R(c1)=0.903 and R(c2)=0.70). Root1 explains the biological activity of alcohol dehydrogenases while Root2 discriminates between lysozymes and dihydrofolate reductases. It was possible to profile the effect of core, middle, and surface aminoacids on biological activity. In contrast, a model considering classic physicochemical parameters such as: polarizability, refractivity, and partition coefficient classify correctly only the 80.8% of the proteins.

      Related collections

      Author and article information



      Comment on this article