Whole-genome duplication has played a central role in genome evolution of many organisms, including the human genome. Most duplicated genes are eliminated and factors that influence the retention of persisting duplicates remain poorly understood. Here, we describe a systematic complex genetic interaction analysis with yeast paralogs derived from the whole-genome duplication event. Mapping digenic interactions for a deletion mutant of each paralog and trigenic interactions for the double mutant provides insight into their roles and a quantitative measure of their functional redundancy. Trigenic interaction analysis distinguishes two classes of paralogs, a more functionally divergent subset and another that retained more functional overlap. Gene feature analysis and modeling suggest that evolutionary trajectories of duplicated genes are dictated by combined functional and structural entanglement factors.
Whole genome duplication (WGD) events are pervasive in eukaryotes, shaping genomes of simple single-celled organisms, such as yeast, and more complex metazoans, including humans. Most duplicated genes are eliminated after WGD because one copy accumulates deleterious mutations, leading to its loss. However, a significant proportion of duplicates persists, and factors that result in duplicate gene retention are poorly understood but critical for understanding the evolutionary forces that shape genomes.
Quantifying the functional divergence of paralog pairs is of particular interest because of the strong selection against functional redundancy. Negative genetic interactions identify functional relationships between genes and provide a means to directly capture the functional relationship between duplicated genes. Genetic interactions occur when the phenotype associated with a combination of mutations in two or more different genes deviates from the expected combined effect of the individual mutations. A negative genetic interaction refers to a combination of mutations that generates a stronger fitness defect than expected, such as synthetic lethality. Here, we use systematic analysis of digenic and trigenic interaction profiles to assess the functional relationship of retained duplicated genes.
To map both digenic and trigenic interactions of duplicated genes, we profiled query strains carrying single deletion mutations and the corresponding double deletion mutations for 240 different dispensable paralog pairs originating from the yeast WGD event. In total, we tested ~550,000 double and ~260,000 triple mutants for genetic interactions and identified ~4,700 negative digenic interactions, and ~2,500 negative trigenic interactions. We quantified the trigenic interaction fraction, defined as the ratio of negative trigenic interactions to the total number of interactions associated with the paralog pair. The distribution of the resulting trigenic interaction fractions was distinctly bimodal, with two-thirds of paralogs exhibiting a low trigenic interaction fraction (diverged paralogs) and one-third showing a high trigenic interaction fraction (functionally redundant paralogs). High trigenic interaction fraction paralogs showed a relatively low asymmetry in their number of digenic interactions, low rates of protein sequence divergence, and a negative digenic interaction within the gene pair.
We correlated position-specific evolutionary rate patterns between paralogs to assess constraints acting on their evolutionary trajectories. Paralogs with a high trigenic interaction fraction showed more correlated evolutionary rate patterns and thus were more evolutionary constrained than paralogs with a low trigenic interaction fraction. Computational simulations that modeled duplicate gene evolution revealed that as the extent of the initial entanglement (overlap of functions) of paralogs increased, so did the range of functional redundancy at steady-state. Thus, the bimodal distribution of the trigenic interaction fraction may reflect that some paralogs diverged, primarily evolving distinct functions without redundancy, while others converged to an evolutionary steady-state with substantial redundancy due to their structural and functional entanglement.
We propose that the evolutionary fate of a duplicated gene is dictated by an interplay of structural and functional entanglement. Paralog pairs with high levels of entanglement are more likely to revert to a singleton state. In contrast, unconstrained paralogs will tend to partition their functions and adopt divergent roles. Intermediately entangled paralog pairs may partition or expand non-overlapping functions while also retaining some common, overlapping functions, such that they can both adopt paralog-specific roles and maintain functional redundancy at an evolutionarily steady-state.
Trigenic interaction fraction, which incorporates digenic and trigenic interactions, captures the functional relationship of duplicated genes and follows a bimodal distribution. High trigenic interaction fraction paralogs are under evolutionary constraints reflecting their structural and functional entanglement.
Exploring evolutionary trajectories of duplicated genes with complex genetic interaction analysis