Bacteria often cope with environmental stress by inducing alternative sigma (σ) factors, which direct RNA polymerase to specific promoters, thereby inducing a set of genes called a regulon to combat the stress. To understand the conserved and organism-specific functions of each σ, it is necessary to be able to predict their promoters, so that their regulons can be followed across species. However, the variability of promoter sequences and motif spacing makes their prediction difficult. We developed and validated an accurate promoter prediction model for Escherichia coli σ E, which enabled us to predict a total of 89 unique σ E-controlled transcription units in E. coli K-12 and eight related genomes. σ E controls the envelope stress response in E. coli K-12. The portion of the regulon conserved across genomes is functionally coherent, ensuring the synthesis, assembly, and homeostasis of lipopolysaccharide and outer membrane porins, the key constituents of the outer membrane of Gram-negative bacteria. The larger variable portion is predicted to perform pathogenesis-associated functions, suggesting that σ E provides organism-specific functions necessary for optimal host interaction. The success of our promoter prediction model for σ E suggests that it will be applicable for the prediction of promoter elements for many alternative σ factors.
A model for predicting the variable promoter sequences associated with the bacterial stress response is developed and used to identify constituents of the transcriptional response to σ E.