Just a few hours before the ballots open for the 57th presidential election, the key question for us data scientists is: which data set could really show some special information, that would not be easily available through a classic poll. We have already seen some interesting correlations of Wikipedia usage with the ongoing campaign – just looking on how many people would search for the page on the candidates would provide a time series with many fascinating details.
Today we focused on the question, if the Democrats had been successful to rally Hispanic voters for Obama’s support. We took the Spanish Wikipedia, checkt the daily views of the Obama’s Spanish language Wikipedia-page and compared this with 2008 and also with the time series of his Republican competitors.
This table shows the monthly average for the daily views in 2008 and 2012:
McCain | Romney | Obama | Obama | |||||
---|---|---|---|---|---|---|---|---|
2008 | 2012 | 2008 | 2012 | |||||
Feb | 549 | 674 | 23% | 2297 | 3154 | 37% | ||
Mar | 265 | 532 | 101% | 949 | 3009 | 217% | ||
Apr | 181 | 399 | 120% | 574 | 2748 | 379% | ||
May | 240 | 466 | 94% | 817 | 2759 | 238% | ||
Jun | 435 | 331 | -24% | 2052 | 2477 | 21% | ||
Jul | 423 | 448 | 6% | 668 | 2161 | 224% | ||
Aug | 501 | 918 | 83% | 1289 | 2226 | 73% | ||
Sep | 1155 | 1285 | 11% | 1757 | 2915 | 66% | ||
Oct | 1252 | 2064 | 65% | 3005 | 3502 | 17% | ||
Nov | 2458 | 19110 | ||||||
506 | 841 | 66% | 1385 | 2801 | 102% |
Obama clearly leads – not only in absolute numbers but in particular regarding the increase of this year’s views with four years ago. While Romney would have gained 66% in views compared with McCain, Obama’s views would have more than doubled.
However, if we look just on the last week prior election day, we can see something strange happen: the view’s of Obama’s es.wikipedia-page have droped from the daily average of 5065 in 2008 to merely 3752 in 2012. The same is true for Romney versus McCain: 1795 average views in this year’s 44th week compared to 1986 in 2008.
This decreasing interest in the candidates is not reflected in the numbers that we see on other election-related search-terms. If we e.g. take ‘US Presidential Election’, we count 2672 daily views during the week before election day in 2008 and 3812 views in 2012 – the same rise in interest that we found in the English Wikipedia, too. (See the last post “Why the 2012 US elections are more exciting than 2008”). While the general interest in the elections is huge, the candidates no longer draw that much attention of the Spanish speaking community.
Maybe “Sandy” would work as an explanation since the campaing was halted during the Hurican – nevertheless it would not be plausible why only the candidates but not the election in general would suffer in awareness from this.
So we cannot draw a clear conclusion from our findings. There is evidence that Obama would have succeded to some extend to activate the interest of Hispanic people but regarding the unexpected drop we will have to drill further down. The real work, though, will anyway start right after the vote: to learn what would have been a signal and how we can seperate these from the noise next time.