To comprehend emotional prosodic cues in speech is a critical function of human social life. However, it is common in everyday communication that conflicting information in emotional prosody and semantic content co-occur. Here, we sought to specify brain regions involved in conflict monitoring of these interfering communication channels. By means of functional magnetic resonance imaging, we obtained signal increases in the right dorsal anterior cingulate cortex and right superior temporal gyrus (STG) and superior temporal sulcus when participants listened to incongruous compared with congruous sentences. Moreover, valence-specific effects were found in the left inferior frontal gyrus and left STG for happily intoned sentences expressing a negative content. The left caudate nucleus along with the thalamus was active when angrily intoned sentences were coupled with positive semantic content. Our results suggest a brain network that monitors conflict in emotional prosody and emotional semantic content comprising of medial prefrontal areas that have previously been associated with cognitive conflict processing. Furthermore, our study extends the knowledge of these processes by suggesting valence-specific differences of emotional conflict processing.