Languages evolve rapidly due to an interaction between sociocultural factors and underlying phonological processes that are influenced by genetic factors. DCDC2 has been strongly associated with core components of the phonological processing system in animal models and multiple independent studies of populations and languages. To characterize subtle language differences arising from genetic variants associated with phonological processes, we examined the relationship between READ1, a regulatory element in DCDC2, and phonemes in languages of 43 populations across five continents. Variation in READ1 was significantly correlated with the number of consonants. Our results suggest that subtle cognitive biases conferred by different READ1 alleles are amplified through cultural transmission that shape consonant use by populations over time.
DCDC2 is a gene strongly associated with components of the phonological processing system in animal models and in multiple independent studies of populations and languages. We propose that it may also influence population-level variation in language component usage. To test this hypothesis, we investigated the evolution and worldwide distribution of the READ1 regulatory element within DCDC2, and compared its distribution with variation in different language properties. The mutational history of READ1 was estimated by examining primate and archaic hominin sequences. This identified duplication and expansion events, which created a large number of polymorphic alleles based on internal repeat units (RU1 and RU2). Association of READ1 alleles was studied with respect to the numbers of consonants and vowels for languages in 43 human populations distributed across five continents. Using population-based approaches with multivariate ANCOVA and linear mixed effects analyses, we found that the RU1-1 allele group of READ1 is significantly associated with the number of consonants within languages independent of genetic relatedness, geographic proximity, and language family. We propose that allelic variation in READ1 helped create a subtle cognitive bias that was amplified by cultural transmission, and ultimately shaped consonant use by different populations over time.