In recent years, many studies have drawn attention to the important role of collective awareness and human behaviour during epidemic outbreaks. A number of modelling efforts have investigated the interaction between the disease transmission dynamics and human behaviour change mediated by news coverage and by information spreading in the population. Yet, given the scarcity of data on public awareness during an epidemic, few studies have relied on empirical data. Here, we use fine-grained, geo-referenced data from three online sources—Wikipedia, the GDELT Project and the Internet Archive—to quantify population-scale information seeking about the 2016 Zika virus epidemic in the U.S., explicitly linking such behavioural signal to epidemiological data. Geo-localized Wikipedia pageview data reveal that visiting patterns of Zika-related pages in Wikipedia were highly synchronized across the United States and largely explained by exposure to national television broadcast. Contrary to the assumption of some theoretical epidemic models, news volume and Wikipedia visiting patterns were not significantly correlated with the magnitude or the extent of the epidemic. Attention to Zika, in terms of Zika-related Wikipedia pageviews, was high at the beginning of the outbreak, when public health agencies raised an international alert and triggered media coverage, but subsequently exhibited an activity profile that suggests nonlinear dependencies and memory effects in the relation between information seeking, media pressure, and disease dynamics. This calls for a new and more general modelling framework to describe the interaction between media exposure, public awareness and disease dynamics during epidemic outbreaks.
Despite its importance for public health policy-makers, understanding the impact of media coverage on collective attention during disease outbreaks remains an elusive research task, due to the lack of available data, especially at high spatial granularity. In this paper, we study the dynamics of collective attention received by the 2016 Zika epidemic in the USA and its interplay with the media coverage of the outbreak, at level of US states and cities. We measure the attention to Zika through geo-localized Wikipedia page view data, and we compare it with mentions of Zika in US news outlets and TV shows. We also compare the collective attention received by the outbreak with the incidence of Zika reported by the US Centers for Disease Control and Prevention in each state. We find that the attention dynamics was highly synchronized across states, irrespective of the local risk of transmission of the virus. By building a linear regression model, we show that the dynamics of collective attention is highly predictable, even at state level, only based on the national media coverage received by the outbreak.