data science, digital politics, smart cities...|

#indyref on Wikipedia

My colleague Taha Yasseri and I are currently working on a Fell Fund project on social media data and election prediction, looking especially at data from Google and Wikipedia (first paper out soon; will also be presenting on that at IPP 2014 which should be great). As part of that we thought we’d have a bit of fun looking at Scotland’s independence referendum on Wikipedia.

For election prediction the method is relatively straightforward: examine readership stats on the party Wikipedia pages of the country in question, and see which page is read the most (of course that doesn’t correspond straight away to election results – would that life were so simple – and the idea of the project is to see what corrections and biases need to be accounted for to make it work). It isn’t quite so clear how to do that for Scotland, but (just for fun really) we compared the following pages:


First we look at the UK and Scotland -> interesting how Scotland has leapfrogged the UK in the last days of the independence campaign. Points to a yes victory?


In terms of flags, though, the Union Jack is well ahead of the Saltire, peaking in the last few days. Is it a last minute outbreak of unionism?


In terms of national dishes, meanwhile, Haggis has been dominating Fish and Chips for the full period of the campaign, with interest in Haggis especially spiking in the last couple of days.

Well, one of these graphs will predict the winner of the referendum: we just don’t know which one đŸ˜‰ More seriously, I think its interesting how most of these terms are spiking in the days before the vote, showing again how the social web really responds to political events.

UPDATE: Taha has passed me the comparison of the Yes and No campaign pages, as below. Yes for a narrow win following months of No dominance – you heard it here first.


By | 2014-09-18T13:39:53+00:00 September 18th, 2014|Social Web|0 Comments

Ideology and Social Structure on Twitter

Last week I was at the VOXPOL conference @ King’s College London. Vast majority of researchers were talking about terrorists and extremists, so I was a bit out of my field; though interestingly they were also all talking about big data and computational social science, which seems to be a staple in every social science conference these days. Ongoing debate about whether we need more teams of social scientists + computer scientists, or whether social scientists need to up their computing skills. I think both approaches are fine in the short term but in the long run social scientists need to skill up, as computer scientists won’t always be interested in our questions (we will want to use automatic content analysis in social science long after it becomes a boring topic in computer science, in the same way as we are still using the t test).


I gave a presentation on the relationship between ideology and social structure on twitter, arguing that political groups at the ideological extremes are more likely to exhibit closed and centralising communication patterns than those in the middle, which is an early result from a join project between myself, Diego Garzia and Alex Trechsel. The main point of the presentation was to discuss different ways of measuring closure and centralisation, which I’m still not sure about. Luckily most of our measures point in a similar direction, so I’m pretty sure there’s an interesting result in there somewhere.