36
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      A structural-alphabet-based strategy for finding structural motifs across protein families

      research-article
      1 , 2 , 1 , 2 , *
      Nucleic Acids Research
      Oxford University Press

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Proteins with insignificant sequence and overall structure similarity may still share locally conserved contiguous structural segments; i.e. structural/3D motifs. Most methods for finding 3D motifs require a known motif to search for other similar structures or functionally/structurally crucial residues. Here, without requiring a query motif or essential residues, a fully automated method for discovering 3D motifs of various sizes across protein families with different folds based on a 16-letter structural alphabet is presented. It was applied to structurally non-redundant proteins bound to DNA, RNA, obligate/non-obligate proteins as well as free DNA-binding proteins (DBPs) and proteins with known structures but unknown function. Its usefulness was illustrated by analyzing the 3D motifs found in DBPs. A non-specific motif was found with a ‘corner’ architecture that confers a stable scaffold and enables diverse interactions, making it suitable for binding not only DNA but also RNA and proteins. Furthermore, DNA-specific motifs present ‘only’ in DBPs were discovered . The motifs found can provide useful guidelines in detecting binding sites and computational protein redesign.

          Related collections

          Most cited references40

          • Record: found
          • Abstract: found
          • Article: not found

          Satisfying hydrogen bonding potential in proteins.

          We have analysed the frequency with which potential hydrogen bond donors and acceptors are satisfied in protein molecules. There are a small percentage of nitrogen or oxygen atoms that do not form hydrogen bonds with either solvent or protein atoms, when standard criteria are used. For high resolution structures 9.5% and 5.1% of buried main-chain nitrogen and oxygen atoms, respectively, fail to hydrogen bond under our standard criteria, representing 5.8% and 2.1% of all main-chain nitrogen and oxygen atoms. We find that as the resolution of the data improves, the percentages fall. If the hydrogen bond criteria are relaxed many of these unsatisfied atoms form weak hydrogen bonds. However, there remain some buried atoms (1.3% NH and 1.8% CO) that fail to hydrogen bond without any immediately obvious compensating interactions.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            A method for simultaneous alignment of multiple protein structures.

            Here, we present MultiProt, a fully automated highly efficient technique to detect multiple structural alignments of protein structures. MultiProt finds the common geometrical cores between input molecules. To date, most methods for multiple alignment start from the pairwise alignment solutions. This may lead to a small overall alignment. In contrast, our method derives multiple alignments from simultaneous superpositions of input molecules. Further, our method does not require that all input molecules participate in the alignment. Actually, it efficiently detects high scoring partial multiple alignments for all possible number of molecules in the input. To demonstrate the power of MultiProt, we provide a number of case studies. First, we demonstrate known multiple alignments of protein structures to illustrate the performance of MultiProt. Next, we present various biological applications. These include: (1) a partial alignment of hinge-bent domains; (2) identification of functional groups of G-proteins; (3) analysis of binding sites; and (4) protein-protein interface alignment. Some applications preserve the sequence order of the residues in the alignment, whereas others are order-independent. It is their residue sequence order-independence that allows application of MultiProt to derive multiple alignments of binding sites and of protein-protein interfaces, making MultiProt an extremely useful structural tool. Copyright 2004 Wiley-Liss, Inc.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Dog eat dogma

              It is human nature to inflate one's ideas and contributions. It is also human nature to hang onto one's ideas long after they have outlived their usefulness, in much the same way that a parent will still support a child who has grown up to be a menace to society. Both traits are at work whenever a scientist makes sweeping statements. The more general one's insight or discovery can be claimed to be, the greater its seeming importance. And having gone out on that proverbial limb, a scientist will do much to avoid conceding that it may be less than it was thought to be, which is why so many outdated concepts have more lives than a cat. Few statements in biology have been as sweeping as the 'Central Dogma of Molecular Biology': DNA makes RNA makes protein. Its name is always capitalized, like the Constitution of the United States or the Magna Carta. It is usually stated without qualification. It was referred to, from its inception, as a dogma rather than a theory. (Even Darwin had the modesty to call evolution a theory.) Scientists don't usually produce dogmas; that is nominally the province of religions, and even the briefest study of history will suggest that, in addition to admitting of no contradiction, dogmas tend to be accompanied by lots of other fun things, such as inquisitions and wars. The Central Dogma was beloved of students because it was easy to remember and had no stated exceptions, like any good dogma. Sadly, it has fallen on hard times of late. The discovery of reverse transcriptase provided an inconvenient example of the synthesis of DNA from RNA. An attempt was made to re-establish dogma status by explaining that the phrase 'DNA makes RNA makes protein' really referred to the flow of genetic information, not the actual steps of synthesis. Then along came RNA editing, in which guide RNAs or enzyme action modify some messenger RNAs such that the final protein sequence cannot be deduced from the gene sequence alone. Alternative splicing didn't help either: it could be argued that it represents a case of RNA making itself, then making a bunch of different proteins. And then there was that inconvenient stuff about RNA catalysis, which suggested that there was once an RNA world in which RNA made protein without DNA getting into the act at all. To account for all this, the Central Dogma now would have to go something like this: 'DNA makes RNA makes protein, but sometimes RNA can make DNA and other times RNA makes RNA, which makes proteins different from what they would be if only DNA made the RNA, and once upon a time RNA made protein, probably, but no-one knows for certain'. Or, if you prefer your dogmas pithy: 'DNA makes RNA makes protein, except when it doesn't'. Perhaps it is best to retire the Central Dogma, and before suggesting a replacement remind ourselves that, because it was a dogma, all of the exceptions - from reverse transcription to RNA catalysis to editing of the message - were initially dismissed as artifacts and had more trouble gaining acceptance than perhaps they should have from the quality of the experimental work. Skepticism in science is a good thing, but dogmas breed cynicism (which is not) and lead to reactionary thinking. Just ask Galileo. Still, dogmas have their uses. Students, as stated earlier, find them very helpful. They provide a convenient encapsulation of the perceived wisdom of the moment. They are usually easier for lay people to understand than laboriously qualified statements. And they provide a clearly visible target for that most interesting breed of scientist, the iconoclast, to shoot at - rather like policemen's hats. So, I might as well suggest that genomics has a dogma that is more profound, I think, than the Central Dogma, and more robust to boot. The Central Dogma of Genomics derives from structural biology. Concisely stated, it is: 'sequence determines structure determines function'. Chaperone-mediated protein folding does not violate this dogma, because chaperones do not induce in proteins a fold that is different from one adopted when the proteins are allowed to fold on their own in dilute solution - chaperones just expedite the folding or prevent unwanted aggregation. Post-translational modification of the structure by limited proteolysis, phosphorylation, glycosylation and the like also does not violate the dogma, because the sequence and structure of the protein determine the nature of such modifications, and the sequence and structure of the protein after modification determines what the consequences of that modification will be. The dogma is vague about which sequence is referred to, which is useful because it forces us to think about it. Upon reflection, it is clear that the relevant sequence must be that of the protein, not the DNA or RNA, and specifically that of the protein following any modifications that may be made on the pathway to full expression of function. That is a very important conclusion, because it means the task of genome sequencing cannot be considered complete until genome-wide methods for detecting and characterizing changes in the protein sequence have been developed and applied. The dogma asserts that it should be possible, ultimately, to deduce the function of a protein from its structure. Belief in the truth of this statement lies at the heart of structural genomics, which endeavors to determine structures for all of the gene products in a given organism. Yet here the dogma can be accused of oversimplification. Protein-protein interactions and protein localization within the cell can have profound influence on protein function. But I believe it is sound to argue in rebuttal that these things also depend directly on the sequence and structure of the protein in question. The Central Dogma of Genomics is a concise summary of the basic assumptions that underlie this field. Though they are cast as a dogma, we would do well to bear in mind that they are only assumptions, albeit ones with good legs to stand on. If the seeds of eventual overthrow are sown anywhere, they may be in the word 'function'. For, unlike most important words in science, function is a word whose meaning is highly situational. But that's another column.
                Bookmark

                Author and article information

                Journal
                Nucleic Acids Res
                nar
                nar
                Nucleic Acids Research
                Oxford University Press
                0305-1048
                1362-4962
                August 2010
                August 2010
                4 June 2010
                4 June 2010
                : 38
                : 14
                : e150
                Affiliations
                1Department of Chemistry, National Tsing Hua University, Hsinchu 300 and 2Institute of Biomedical Sciences, Academia Sinica, Taipei 115, Taiwan
                Author notes
                *To whom correspondence should be addressed. Tel: 886 2 2652 3031; Fax: 886 2 2788 7641; Email: carmay@ 123456gate.sinica.edu.tw
                Article
                gkq478
                10.1093/nar/gkq478
                2919736
                20525797
                ab3f4952-062c-48a2-9956-31c266f4b95c
                © The Author(s) 2010. Published by Oxford University Press.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License ( http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 12 February 2010
                : 2 April 2010
                : 12 May 2010
                Categories
                Methods Online

                Genetics
                Genetics

                Comments

                Comment on this article