33
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Classification of the Adenylation and Acyl-Transferase Activity of NRPS and PKS Systems Using Ensembles of Substrate Specific Hidden Markov Models

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          There is a growing interest in the Non-ribosomal peptide synthetases (NRPSs) and polyketide synthases (PKSs) of microbes, fungi and plants because they can produce bioactive peptides such as antibiotics. The ability to identify the substrate specificity of the enzyme's adenylation (A) and acyl-transferase (AT) domains is essential to rationally deduce or engineer new products. We here report on a Hidden Markov Model (HMM)-based ensemble method to predict the substrate specificity at high quality. We collected a new reference set of experimentally validated sequences. An initial classification based on alignment and Neighbor Joining was performed in line with most of the previously published prediction methods. We then created and tested single substrate specific HMMs and found that their use improved the correct identification significantly for A as well as for AT domains. A major advantage of the use of HMMs is that it abolishes the dependency on multiple sequence alignment and residue selection that is hampering the alignment-based clustering methods. Using our models we obtained a high prediction quality for the substrate specificity of the A domains similar to two recently published tools that make use of HMMs or Support Vector Machines (NRPSsp and NRPS predictor2, respectively). Moreover, replacement of the single substrate specific HMMs by ensembles of models caused a clear increase in prediction quality. We argue that the superiority of the ensemble over the single model is caused by the way substrate specificity evolves for the studied systems. It is likely that this also holds true for other protein domains. The ensemble predictor has been implemented in a simple web-based tool that is available at http://www.cmbi.ru.nl/NRPS-PKS-substrate-predictor/.

          Related collections

          Most cited references39

          • Record: found
          • Abstract: found
          • Article: not found

          Profile hidden Markov models.

          S. Eddy (1998)
          The recent literature on profile hidden Markov model (profile HMM) methods and software is reviewed. Profile HMMs turn a multiple sequence alignment into a position-specific scoring system suitable for searching databases for remotely homologous sequences. Profile HMM analyses complement standard pairwise comparison methods for large-scale sequence analysis. Several software implementations and two large libraries of profile HMMs of common protein domains are available. HMM methods performed comparably to threading methods in the CASP2 structure prediction exercise.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Reorganizing the protein space at the Universal Protein Resource (UniProt)

            The mission of UniProt is to support biological research by providing a freely accessible, stable, comprehensive, fully classified, richly and accurately annotated protein sequence knowledgebase, with extensive cross-references and querying interfaces. UniProt is comprised of four major components, each optimized for different uses: the UniProt Archive, the UniProt Knowledgebase, the UniProt Reference Clusters and the UniProt Metagenomic and Environmental Sequence Database. A key development at UniProt is the provision of complete, reference and representative proteomes. UniProt is updated and distributed every 4 weeks and can be accessed online for searches or download at http://www.uniprot.org.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Intimate bacterial-fungal interaction triggers biosynthesis of archetypal polyketides in Aspergillus nidulans.

              Fungi produce numerous low molecular weight molecules endowed with a multitude of biological activities. However, mining the full-genome sequences of fungi indicates that their potential to produce secondary metabolites is greatly underestimated. Because most of the biosynthesis gene clusters are silent under laboratory conditions, one of the major challenges is to understand the physiological conditions under which these genes are activated. Thus, we cocultivated the important model fungus Aspergillus nidulans with a collection of 58 soil-dwelling actinomycetes. By microarray analyses of both Aspergillus secondary metabolism and full-genome arrays and Northern blot and quantitative RT-PCR analyses, we demonstrate at the molecular level that a distinct fungal-bacterial interaction leads to the specific activation of fungal secondary metabolism genes. Most surprisingly, dialysis experiments and electron microscopy indicated that an intimate physical interaction of the bacterial and fungal mycelia is required to elicit the specific response. Gene knockout experiments provided evidence that one induced gene cluster codes for the long-sought after polyketide synthase (PKS) required for the biosynthesis of the archetypal polyketide orsellinic acid, the typical lichen metabolite lecanoric acid, and the cathepsin K inhibitors F-9775A and F-9775B. A phylogenetic analysis demonstrates that orthologs of this PKS are widespread in nature in all major fungal groups, including mycobionts of lichens. These results provide evidence of specific interaction among microorganisms belonging to different domains and support the hypothesis that not only diffusible signals but intimate physical interactions contribute to the communication among microorganisms and induction of otherwise silent biosynthesis genes.
                Bookmark

                Author and article information

                Contributors
                Role: Editor
                Journal
                PLoS One
                PLoS ONE
                plos
                plosone
                PLoS ONE
                Public Library of Science (San Francisco, USA )
                1932-6203
                2013
                18 April 2013
                : 8
                : 4
                : e62136
                Affiliations
                [1 ]Center for Molecular and Biomolecular Informatics, Nijmegen Center for Molecular Life Sciences, Radboud University Nijmegen Medical Centre, Nijmegen, The Netherlands
                [2 ]Netherlands Bioinformatics Center, Nijmegen, The Netherlands
                [3 ]Kluyver Center for Genomics of Industrial Fermentation, Delft, The Netherlands
                [4 ]TI Food and Nutrition, Wageningen, The Netherlands
                The University of Hong Kong, China
                Author notes

                Competing Interests: The authors have declared that no competing interests exist.

                Coordination and advice: RS CF. Conceived and designed the experiments: BK CF. Performed the experiments: BK. Analyzed the data: BK CF. Contributed reagents/materials/analysis tools: LO. Wrote the paper: BK LO RS CF.

                [¤]

                Current address: Department of Soil and Water Sciences, Faculty of Agricultural Sciences, University of Sulaimani, Sulaimani, Kurdistan, Iraq

                Article
                PONE-D-12-27711
                10.1371/journal.pone.0062136
                3630128
                23637983
                50118475-356f-4526-90b3-970d6c2d16d6
                Copyright @ 2013

                This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

                History
                : 12 September 2012
                : 19 March 2013
                Page count
                Pages: 10
                Funding
                This work was supported by the City Nijmegen-The Netherlands (Gemeente Nijmegen). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
                Categories
                Research Article
                Biology
                Computational Biology
                Sequence Analysis
                Synthetic Biology
                Genomics
                Comparative Genomics
                Functional Genomics
                Genome Analysis Tools
                Genome Sequencing
                Synthetic Biology
                Chemistry
                Chemical Biology
                Synthetic Chemistry
                Medicine
                Infectious Diseases
                Bacterial Diseases
                Infectious Disease Control

                Uncategorized
                Uncategorized

                Comments

                Comment on this article