London, Jan 25 : Even before the first cases of Covid-19 in Europe were announced at the end of January 2020, signals that something strange was happening were already circulating on Twitter, said a new study.
The study, published in the journal Scientific Reports, identified tracks of increasing concern about pneumonia cases on posts published on Twitter in seven countries, between the end of 2019 and the beginning of 2020.
The analysis of the posts shows that the "whistleblowing" came precisely from the geographical regions where the primary outbreaks later developed.
"Our study adds on to the existing evidence that social media can be a useful tool of epidemiological surveillance," Massimo Riccaboni, Professor at IMT School for Advanced Studies Lucca in Italy.
"They can help intercept the first signs of a new disease, before it proliferates undetected, and also track its spread." To conduct the research, the authors first created a unique database with all the messages posted on Twitter containing the keyword "pneumonia" in the seven most spoken languages of the European Union -- English, German, French, Italian, Spanish, Polish, and Dutch -- from December 2014 until 1 March 2020.
The word "pneumonia" was chosen because the disease is the most severe condition induced by the SARS-CoV-2, and also because the 2020 flu season was milder than the previous ones, so there was no reason to think it to be responsible for all the mentions and worries.
The researchers then made a number of adjustments and corrections to the posts in the database to avoid overestimating the number of tweets mentioning pneumonia between December 2019 and January 2020.
The World Health Organization (WHO) announced the first "cases of pneumonia of unknown etiology" on December 31, 2019 and the official recognition of Covid-19 as a serious transmissible disease was made on January 21, 2020.
The analysis of the authors shows an increase in tweets mentioning the keyword "pneumonia" in most of the European countries included in the study as early as January 2020, such as to indicate an ongoing concern and public interest in pneumonia cases.
In Italy, for example, where the first lock-down measures to contain Covid-19 infections were introduced on February 22, 2020, the increase rate in mentions of pneumonia during the first few weeks of 2020 differs substantially from the rate observed in the same weeks in 2019.
That is to say that potentially hidden infection hotspots were identified several weeks before the announcement of the first local source of a Covid-19 infection -- February 20, Codogno, Italy.
France exhibited a similar pattern, whereas Spain, Poland, and the UK witnessed a delay of two weeks.
The authors also geo-localized over 13,000 pneumonia-related tweets in this same period, and discovered that they came exactly from the regions where the first cases of infections were later reported, such as the Lombardia region in Italy, Madrid, Spain, and Ile-de-France.
Following the same procedure used for the keyword "pneumonia", the researchers also produced a new dataset containing the keyword "dry cough", one of the other symptoms later associated with the Covid-19 syndrome.
Even then, they observed the same pattern, namely an abnormal and statistically significant increase in the number of mentions of the word during the weeks leading up to the surge of infections in February 2020.