69
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      A Network Approach to Analyzing Highly Recombinant Malaria Parasite Genes

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The var genes of the human malaria parasite Plasmodium falciparum present a challenge to population geneticists due to their extreme diversity, which is generated by high rates of recombination. These genes encode a primary antigen protein called PfEMP1, which is expressed on the surface of infected red blood cells and elicits protective immune responses. Var gene sequences are characterized by pronounced mosaicism, precluding the use of traditional phylogenetic tools that require bifurcating tree-like evolutionary relationships. We present a new method that identifies highly variable regions (HVRs), and then maps each HVR to a complex network in which each sequence is a node and two nodes are linked if they share an exact match of significant length. Here, networks of var genes that recombine freely are expected to have a uniformly random structure, but constraints on recombination will produce network communities that we identify using a stochastic block model. We validate this method on synthetic data, showing that it correctly recovers populations of constrained recombination, before applying it to the Duffy Binding Like-α (DBLα) domain of var genes. We find nine HVRs whose network communities map in distinctive ways to known DBLα classifications and clinical phenotypes. We show that the recombinational constraints of some HVRs are correlated, while others are independent. These findings suggest that this micromodular structuring facilitates independent evolutionary trajectories of neighboring mosaic regions, allowing the parasite to retain protein function while generating enormous sequence diversity. Our approach therefore offers a rigorous method for analyzing evolutionary constraints in var genes, and is also flexible enough to be easily applied more generally to any highly recombinant sequences.

          Author Summary

          The human malaria parasite kills nearly 1 million people each year globally. Frequent genetic exchange between malaria parasites creates enormous genetic diversity that largely explains the lack of an effective vaccine for the disease. Traditional phylogenetic tools cannot accommodate this type of diversity, however, and rigorous analytical tools capable of making sense of gene sequences that recombine frequently are still lacking. Here, we use network techniques that have been developed by the physics and network science communities to analyze malaria parasite gene sequences, allowing us to automatically identify highly variable mosaic regions in sequence data and to derive the network of recombination events. We apply our method to seven fully-sequenced parasite genomes, and show that our method provides new insights into the complex evolutionary patterns of the parasite. Our results suggest that the structure of these sequences allows the parasite to rapidly diversify to evade immune responses while maintaining antigen structure and function.

          Related collections

          Most cited references29

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          Assortative mixing in networks

          M. Newman (2002)
          A network is said to show assortative mixing if the nodes in the network that have many connections tend to be connected to other nodes with many connections. We define a measure of assortative mixing for networks and use it to show that social networks are often assortatively mixed, but that technological and biological networks tend to be disassortative. We propose a model of an assortative network, which we study both analytically and numerically. Within the framework of this model we find that assortative networks tend to percolate more easily than their disassortative counterparts and that they are also more robust to vertex removal.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            Community Structure in Time-Dependent, Multiscale, and Multiplex Networks

            Network science is an interdisciplinary endeavor, with methods and applications drawn from across the natural, social, and information sciences. A prominent problem in network science is the algorithmic detection of tightly-connected groups of nodes known as communities. We developed a generalized framework of network quality functions that allowed us to study the community structure of arbitrary multislice networks, which are combinations of individual networks coupled through links that connect each node in one network slice to itself in other slices. This framework allows one to study community structure in a very general setting encompassing networks that evolve over time, have multiple types of links (multiplexity), and have multiple scales.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              The large diverse gene family var encodes proteins involved in cytoadherence and antigenic variation of Plasmodium falciparum-infected erythrocytes.

              The human malaria parasite Plasmodium falciparum evades host immunity by varying the antigenic and adhesive character of infected erythrocytes. We describe a large and extremely diverse family of P. falciparum genes (var) that encode 200-350 kDa proteins having the expected properties of antigenically variant adhesion molecules. Predicted amino acid sequences of var genes show a variable extracellular segment with domains having receptor-binding features, a transmembrane sequence, and a terminal segment that is a probable submembrane anchor. There are 50-150 var genes on multiple parasite chromosomes, and some are in clustered arrangements. var probes detect two classes of transcripts in steady-state RNA: 7-9 kb var transcripts, and an unusual family of 1.8-2.4 kb transcripts that may be involved in expression or rearrangements of var genes.
                Bookmark

                Author and article information

                Contributors
                Role: Editor
                Journal
                PLoS Comput Biol
                PLoS Comput. Biol
                plos
                ploscomp
                PLoS Computational Biology
                Public Library of Science (San Francisco, USA )
                1553-734X
                1553-7358
                October 2013
                October 2013
                10 October 2013
                : 9
                : 10
                : e1003268
                Affiliations
                [1 ]Department of Epidemiology, Harvard School of Public Health, Boston, Massachusetts, United States of America
                [2 ]Center for Communicable Disease Dynamics, Harvard School of Public Health, Boston, Massachusetts, United States of America
                [3 ]Department of Computer Science, University of Colorado, Boulder, Colorado, United States of America
                [4 ]BioFrontiers Institute, University of Colorado, Boulder, Colorado, United States of America
                [5 ]Santa Fe Institute, Santa Fe, New Mexico, United States of America
                Emory University, United States of America
                Author notes

                The authors have declared that no competing interests exist.

                Conceived and designed the experiments: DBL AC COB. Performed the experiments: DBL. Analyzed the data: DBL COB. Wrote the paper: DBL AC COB. Designed the software used in analysis: DBL.

                Article
                PCOMPBIOL-D-13-00735
                10.1371/journal.pcbi.1003268
                3794903
                24130474
                105c15b9-f05d-477d-b9b9-58fa09aca51b
                Copyright @ 2013

                This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

                History
                : 29 April 2013
                : 23 August 2013
                Page count
                Pages: 12
                Funding
                The project described was supported by Award Numbers R21GM100207 (DBL, AC, COB) and U54GM088558 (DBL, COB) from the National Institute Of General Medical Sciences. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institute Of General Medical Sciences or the National Institutes of Health. The funder had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
                Categories
                Research Article

                Quantitative & Systems biology
                Quantitative & Systems biology

                Comments

                Comment on this article