1
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Analysis of HubP-dependent cell pole protein targeting in  Vibrio cholerae uncovers novel motility regulators

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          In rod-shaped bacteria, the emergence and maintenance of long-axis cell polarity is involved in key cellular processes such as cell cycle, division, environmental sensing and flagellar motility among others. Many bacteria achieve cell pole differentiation through the use of polar landmark proteins acting as scaffolds for the recruitment of functional macromolecular assemblies. In Vibrio cholerae a large membrane-tethered protein, HubP, specifically interacts with proteins involved in chromosome segregation, chemotaxis and flagellar biosynthesis. Here we used comparative proteomics, genetic and imaging approaches to identify additional HubP partners and demonstrate that at least six more proteins are subject to HubP-dependent polar localization. These include a cell-wall remodeling enzyme (DacB), a likely chemotaxis sensory protein (HlyB), two presumably cytosolic proteins of unknown function (VC1210 and VC1380) and two membrane-bound proteins, named here MotV and MotW, that exhibit distinct effects on chemotactic motility. We show that while both Δ motW and Δ motV mutants retain monotrichous flagellation, they present significant to severe motility defects when grown in soft agar. Video-tracking experiments further reveal that Δ motV cells can swim in liquid environments but are unable to tumble or penetrate a semisolid matrix, whereas a motW deletion affects both tumbling frequency and swimming speed. Motility suppressors and gene co-occurrence analyses reveal co-evolutionary linkages between MotV, a subset of non-canonical CheV proteins and flagellar C-ring components FliG and FliM, whereas MotW regulatory inputs appear to intersect with specific c-di-GMP signaling pathways. Together, these results reveal an ever more versatile role for the landmark cell pole organizer HubP and identify novel mechanisms of motility regulation.

          Author summary

          Cell polarity is the result of controlled asymmetric distribution of protein macrocomplexes, genetic material, membrane lipids and cellular metabolites, and can play crucial physiological roles not only in multicellular organisms but also in unicellular bacteria. In the opportunistic cholera pathogen Vibrio cholerae, the polar landmark protein HubP tethers key actors in chromosome segregation, chemotaxis and flagellar biosynthesis and thus converts the cell pole into an important functional microdomain for cell proliferation, environmental sensing and adaptation between free-living and pathogenic life-styles. Using a comparative proteomics approach, we here-in present a comprehensive analysis of HubP-dependent cell pole protein sorting and identify novel HubP partners including ones likely involved in cell wall remodeling (DacB), chemotaxis (HlyB) and motility regulation (MotV and MotW). Unlike previous studies which have identified early roles for HubP in flagellar assembly, functional, genetic and phylogenetic analyses of its MotV and MotW partners suggest a direct role in flagellar rotary mechanics and provide new insights into the coevolution and functional interdependence of chemotactic signaling, bacterial motility and biofilm formation.

          Related collections

          Most cited references76

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          Fast and accurate short read alignment with Burrows–Wheeler transform

          Motivation: The enormous amount of short reads generated by the new DNA sequencing technologies call for the development of fast and accurate read alignment programs. A first generation of hash table-based methods has been developed, including MAQ, which is accurate, feature rich and fast enough to align short reads from a single individual. However, MAQ does not support gapped alignment for single-end reads, which makes it unsuitable for alignment of longer reads where indels may occur frequently. The speed of MAQ is also a concern when the alignment is scaled up to the resequencing of hundreds of individuals. Results: We implemented Burrows-Wheeler Alignment tool (BWA), a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps. BWA supports both base space reads, e.g. from Illumina sequencing machines, and color space reads from AB SOLiD machines. Evaluations on both simulated and real data suggest that BWA is ∼10–20× faster than MAQ, while achieving similar accuracy. In addition, BWA outputs alignment in the new standard SAM (Sequence Alignment/Map) format. Variant calling and other downstream analyses after the alignment can be achieved with the open source SAMtools software package. Availability: http://maq.sourceforge.net Contact: rd@sanger.ac.uk
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            Highly accurate protein structure prediction with AlphaFold

            Proteins are essential to life, and understanding their structure can facilitate a mechanistic understanding of their function. Through an enormous experimental effort 1 – 4 , the structures of around 100,000 unique proteins have been determined 5 , but this represents a small fraction of the billions of known protein sequences 6 , 7 . Structural coverage is bottlenecked by the months to years of painstaking effort required to determine a single protein structure. Accurate computational approaches are needed to address this gap and to enable large-scale structural bioinformatics. Predicting the three-dimensional structure that a protein will adopt based solely on its amino acid sequence—the structure prediction component of the ‘protein folding problem’ 8 —has been an important open research problem for more than 50 years 9 . Despite recent progress 10 – 14 , existing methods fall far short of atomic accuracy, especially when no homologous structure is available. Here we provide the first computational method that can regularly predict protein structures with atomic accuracy even in cases in which no similar structure is known. We validated an entirely redesigned version of our neural network-based model, AlphaFold, in the challenging 14th Critical Assessment of protein Structure Prediction (CASP14) 15 , demonstrating accuracy competitive with experimental structures in a majority of cases and greatly outperforming other methods. Underpinning the latest version of AlphaFold is a novel machine learning approach that incorporates physical and biological knowledge about protein structure, leveraging multi-sequence alignments, into the design of the deep learning algorithm. AlphaFold predicts protein structures with an accuracy competitive with experimental structures in the majority of cases using a novel deep learning architecture.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Integrative Genomics Viewer

              To the Editor Rapid improvements in sequencing and array-based platforms are resulting in a flood of diverse genome-wide data, including data from exome and whole genome sequencing, epigenetic surveys, expression profiling of coding and non-coding RNAs, SNP and copy number profiling, and functional assays. Analysis of these large, diverse datasets holds the promise of a more comprehensive understanding of the genome and its relation to human disease. Experienced and knowledgeable human review is an essential component of this process, complementing computational approaches. This calls for efficient and intuitive visualization tools able to scale to very large datasets and to flexibly integrate multiple data types, including clinical data. However, the sheer volume and scope of data poses a significant challenge to the development of such tools. To address this challenge we developed the Integrative Genomics Viewer (IGV), a lightweight visualization tool that enables intuitive real-time exploration of diverse, large-scale genomic datasets on standard desktop computers. It supports flexible integration of a wide range of genomic data types including aligned sequence reads, mutations, copy number, RNAi screens, gene expression, methylation, and genomic annotations (Figure S1). The IGV makes use of efficient, multi-resolution file formats to enable real-time exploration of arbitrarily large datasets over all resolution scales, while consuming minimal resources on the client computer (see Supplementary Text). Navigation through a dataset is similar to Google Maps, allowing the user to zoom and pan seamlessly across the genome at any level of detail from whole-genome to base pair (Figure S2). Datasets can be loaded from local or remote sources, including cloud-based resources, enabling investigators to view their own genomic datasets alongside publicly available data from, for example, The Cancer Genome Atlas (TCGA) 1 , 1000 Genomes (www.1000genomes.org/), and ENCODE 2 (www.genome.gov/10005107) projects. In addition, IGV allows collaborators to load and share data locally or remotely over the Web. IGV supports concurrent visualization of diverse data types across hundreds, and up to thousands of samples, and correlation of these integrated datasets with clinical and phenotypic variables. A researcher can define arbitrary sample annotations and associate them with data tracks using a simple tab-delimited file format (see Supplementary Text). These might include, for example, sample identifier (used to link different types of data for the same patient or tissue sample), phenotype, outcome, cluster membership, or any other clinical or experimental label. Annotations are displayed as a heatmap but more importantly are used for grouping, sorting, filtering, and overlaying diverse data types to yield a comprehensive picture of the integrated dataset. This is illustrated in Figure 1, a view of copy number, expression, mutation, and clinical data from 202 glioblastoma samples from the TCGA project in a 3 kb region around the EGFR locus 1, 3 . The investigator first grouped samples by tumor subtype, then by data type (copy number and expression), and finally sorted them by median copy number over the EGFR locus. A shared sample identifier links the copy number and expression tracks, maintaining their relative sort order within the subtypes. Mutation data is overlaid on corresponding copy number and expression tracks, based on shared participant identifier annotations. Several trends in the data stand out, such as a strong correlation between copy number and expression and an overrepresentation of EGFR amplified samples in the Classical subtype. IGV’s scalable architecture makes it well suited for genome-wide exploration of next-generation sequencing (NGS) datasets, including both basic aligned read data as well as derived results, such as read coverage. NGS datasets can approach terabytes in size, so careful management of data is necessary to conserve compute resources and to prevent information overload. IGV varies the displayed level of detail according to resolution scale. At very wide views, such as the whole genome, IGV represents NGS data by a simple coverage plot. Coverage data is often useful for assessing overall quality and diagnosing technical issues in sequencing runs (Figure S3), as well as analysis of ChIP-Seq 4 and RNA-Seq 5 experiments (Figures S4 and S5). As the user zooms below the ~50 kb range, individual aligned reads become visible (Figure 2) and putative SNPs are highlighted as allele counts in the coverage plot. Alignment details for each read are available in popup windows (Figures S6 and S7). Zooming further, individual base mismatches become visible, highlighted by color and intensity according to base call and quality. At this level, the investigator may sort reads by base, quality, strand, sample and other attributes to assess the evidence of a variant. This type of visual inspection can be an efficient and powerful tool for variant call validation, eliminating many false positives and aiding in confirmation of true findings (Figures S6 and S7). Many sequencing protocols produce reads from both ends (“paired ends”) of genomic fragments of known size distribution. IGV uses this information to color-code paired ends if their insert sizes are larger than expected, fall on different chromosomes, or have unexpected pair orientations. Such pairs, when consistent across multiple reads, can be indicative of a genomic rearrangement. When coloring aberrant paired ends, each chromosome is assigned a unique color, so that intra- (same color) and inter- (different color) chromosomal events are readily distinguished (Figures 2 and S8). We note that misalignments, particularly in repeat regions, can also yield unexpected insert sizes, and can be diagnosed with the IGV (Figure S9). There are a number of stand-alone, desktop genome browsers available today 6 including Artemis 7 , EagleView 8 , MapView 9 , Tablet 10 , Savant 11 , Apollo 12 , and the Integrated Genome Browser 13 . Many of them have features that overlap with IGV, particularly for NGS sequence alignment and genome annotation viewing. The Integrated Genome Browser also supports viewing array-based data. See Supplementary Table 1 and Supplementary Text for more detail. IGV focuses on the emerging integrative nature of genomic studies, placing equal emphasis on array-based platforms, such as expression and copy-number arrays, next-generation sequencing, as well as clinical and other sample metadata. Indeed, an important and unique feature of IGV is the ability to view all these different data types together and to use the sample metadata to dynamically group, sort, and filter datasets (Figure 1 above). Another important characteristic of IGV is fast data loading and real-time pan and zoom – at all scales of genome resolution and all dataset sizes, including datasets comprising hundreds of samples. Finally, we have placed great emphasis on the ease of installation and use of IGV, with the goal of making both the viewing and sharing of their data accessible to non-informatics end users. IGV is open source software and freely available at http://www.broadinstitute.org/igv/, including full documentation on use of the software. Supplementary Material 1
                Bookmark

                Author and article information

                Contributors
                Role: Formal analysisRole: InvestigationRole: MethodologyRole: Writing – original draftRole: Writing – review & editing
                Role: Formal analysisRole: InvestigationRole: MethodologyRole: VisualizationRole: Writing – original draftRole: Writing – review & editing
                Role: InvestigationRole: Writing – review & editing
                Role: Formal analysisRole: InvestigationRole: Writing – review & editing
                Role: InvestigationRole: Writing – review & editing
                Role: Formal analysisRole: Funding acquisitionRole: InvestigationRole: SupervisionRole: ValidationRole: VisualizationRole: Writing – original draftRole: Writing – review & editing
                Role: ConceptualizationRole: Formal analysisRole: Funding acquisitionRole: InvestigationRole: MethodologyRole: Project administrationRole: SupervisionRole: ValidationRole: VisualizationRole: Writing – original draftRole: Writing – review & editing
                Role: Editor
                Journal
                PLoS Genet
                PLoS Genet
                plos
                PLoS Genetics
                Public Library of Science (San Francisco, CA USA )
                1553-7390
                1553-7404
                12 January 2022
                January 2022
                : 18
                : 1
                : e1009991
                Affiliations
                [1 ] Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), Gif-sur-Yvette, France
                [2 ] Graduate School of Structure and Dynamics of Living Systems, Université Paris-Saclay, Orsay, France
                [3 ] Doctoral School of Therapeutic Innovation ITFA. Université Paris-Saclay, Orsay, France
                [4 ] ‘Structural Biology of Biofilms’ Group, European Institute of Chemistry and Biology (IECB), Pessac, France
                [5 ] Université de Bordeaux, CNRS, Bordeaux INP, CBMN, UMR 5248, Pessac, France
                Max-Planck-Institute for Marine Microbiology: Max-Planck-Institut fur marine Mikrobiologie, GERMANY
                Author notes

                The authors have declared that no competing interests exist.

                [¤a]

                Current address: Abbelight, Cachan, France

                [¤b]

                Current address: Micalis Institute, INRAE, AgroParisTech, Université Paris-Saclay, Jouy-en-Josas, France

                Author information
                https://orcid.org/0000-0002-9822-3012
                https://orcid.org/0000-0002-1597-7557
                https://orcid.org/0000-0003-3689-3069
                https://orcid.org/0000-0001-8486-0005
                https://orcid.org/0000-0002-1858-8127
                https://orcid.org/0000-0003-0835-5407
                Article
                PGENETICS-D-21-01611
                10.1371/journal.pgen.1009991
                8789113
                35020734
                db18fde1-9ccb-45b1-88f1-f5028eb4efce
                © 2022 Altinoglu et al

                This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

                History
                : 12 December 2021
                : 14 December 2021
                Page count
                Figures: 6, Tables: 1, Pages: 28
                Funding
                Funded by: IDEX Biologie Intégrative des Génomes
                Award Recipient :
                Funded by: IDEX Biologie Intégrative des Génomes
                Award Recipient :
                Funded by: I2BC
                Award Recipient :
                Funded by: I2BC
                Award Recipient :
                Funded by: European Research Council
                Award ID: 757507
                Award Recipient :
                Funded by: IDEX Junior Chair
                Award Recipient :
                This project was supported by IDEX “Biologie Intégrative des Génomes” (to I.A. and Y.Y), I2BC intramural funding (VPS2, to P.V.K and Y.Y), an ERC Starting grant (BioMatrix #757507 to P.V.K) and an IDEX Junior Chair (Université de Bordeaux, to P.V.K.). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
                Categories
                Research Article
                Medicine and Health Sciences
                Pathology and Laboratory Medicine
                Pathogens
                Virulence Factors
                Pathogen Motility
                Biology and Life Sciences
                Cell Biology
                Cell Motility
                Flagellar Motility
                Biology and Life Sciences
                Microbiology
                Medical Microbiology
                Microbial Pathogens
                Bacterial Pathogens
                Vibrio Cholerae
                Medicine and Health Sciences
                Pathology and Laboratory Medicine
                Pathogens
                Microbial Pathogens
                Bacterial Pathogens
                Vibrio Cholerae
                Biology and Life Sciences
                Organisms
                Bacteria
                Vibrio
                Vibrio Cholerae
                Biology and Life Sciences
                Cell Biology
                Cell Physiology
                Cell Polarity
                Biology and Life Sciences
                Biochemistry
                Proteins
                Protein Domains
                Biology and Life Sciences
                Cell Biology
                Cell Motility
                Chemotaxis
                Biology and Life Sciences
                Cell Biology
                Cell Motility
                Research and Analysis Methods
                Imaging Techniques
                Fluorescence Imaging
                Custom metadata
                vor-update-to-uncorrected-proof
                2022-01-25
                The authors confirm that all data underlying the findings are fully available without restriction. WGS data is available from the ArrayExpress database ( https://www.ebi.ac.uk/arrayexpress/), with the accession number E-MTAB-10691. Other underlying numerical data is provided in Supporting Information file S1 Data.

                Genetics
                Genetics

                Comments

                Comment on this article