This paper is the product of a workshop that brought together practitioners, researchers, and data experts to discuss how big data is becoming a resource for positive social change in low- and middle-income countries (LMICs). We include in our definition of big data sources such as social media data, mobile phone use records, digitally mediated transactions, online news media sources, and administrative records. We argue that there are four main areas where big data has potential for promoting positive social change: advocacy; analysis and prediction; facilitating information exchange; and promoting accountability and transparency. These areas all have particular challenges and possibilities, but there are also issues shared across them, such as open data and privacy concerns. Big data is shaping up to be one of the key battlefields of our time, and the paper argues that this is therefore an opportune moment for civil society groups in particular to become a larger part of the conversation about the use of big data, since questions about the asymmetries of power involved are especially urgent in these uses in LMICs. Civil society groups are also currently underrepresented in debates about privacy and the rights of technology users, which are dominated by corporations, governments and nongovernmental organizations in the Global North. We conclude by offering some lessons drawn from a number of case studies that represent the current state-of-the-art.
Category: technology
What the Thornberry Affair tells us about politicians online
Published on the the LSE British Politics and Policy blog.
Many commentators have speculated what was going through the mind of Emily Thornberry, the shadow Attorney General, when she tweeted a picture of a terraced Rochester house draped in three England flags and a white van parked in the driveway, with the simple caption ‘Image from Rochester’. Continue reading “What the Thornberry Affair tells us about politicians online”
Who does the web think you are?
Who does the web think you are?‘ Public talk at ‘Being Human: a festival of the humanities’, London, November 2014. (With Ralph Schroeder)
Big Data in the Humanities: lessons from papyrus and Instagram
I’m currently in Washington DC to attend the IEEE International Conference on Big Data. The first day is set aside for workshops, and I’ve just attended a really insightful one on ‘Big Humanities Data’. The diversity of work presented was immense, covering a huge sweep of history: from fragments of ancient Greek text to Instagram photos taken during the Ukraine revolution, via the Irish Rebellion of 1641 and the Spanish flu outbreak of a century ago. Nonetheless, certain patterns stuck out from many of most of the talks given. Continue reading “Big Data in the Humanities: lessons from papyrus and Instagram”
The Crowd in the Cloud? Three challenges for gauging public opinion online
Social media: the good, the bad and the ugly
Social media: the good, the bad, and the ugly.‘ Public talk at BCS, The Chartered Institute for IT, Oxfordshire Branch, Oxford, UK, September 2014.
Big Data, Ethics, and the Social Implications of Knowledge Production
This position paper addresses current debates about data in general, and big data specifically, by examining the ethical issues arising from advances in knowledge production. Typically ethical issues such as privacy and data protection are discussed in the context of regulatory and policy debates. Here we argue that this overlooks a larger picture whereby human autonomy is undermined by the growth of scientific knowledge. To make this argument, we first offer definitions of data and big data, and then examine why the uses of data-driven analyses of human behaviour in particular have recently experienced rapid growth. Next, we distinguish between the contexts in which big data research is used, and argue that this research has quite different implications in the context of scientific as opposed to applied research. We conclude by pointing to the fact that big data analyses are both enabled and constrained by the nature of data sources available. Big data research will nevertheless inevitably become more pervasive, and this will require more awareness on the part of data scientists, policymakers and a wider public about its contexts and often unintended consequences.
Murder in the time of virality

That the beheading of journalist James Foley is ‘media’ is horrific. Whether it is ‘social’ falls on all of us.
I, like millions of others, learned about the death of journalist James Foley on social media. But it just so happened that the news was delivered to me in as sensitive and sombre a way as possible. Continue reading “Murder in the time of virality”
Social media and public opinion: what’s new?
I’m currently writing up a paper for submission to the Internet, Politics and Policy 2014 conference to be held by the OII in September. My paper – which draws substantially on interviews conducted as part of the Sloan Foundation-funded project of which I’m part – asks whether and to what extent the measurement of public opinion has been transformed by the new availability of socially-generated sources of big data, such as social media postings and search queries, and the tools which allow us to analyse them. Continue reading “Social media and public opinion: what’s new?”
Streisandfreude: how the right to be forgotten may become an excuse to be remembered

The past fortnight saw the first ripples of reaction to the European Court of Justice’s assertion of a citizen’s ‘right to be forgotten’ online. Following the court’s ruling, Google began the implementation of a process whereby individuals can petition for the removal of links in search results to pages deemed objectionable.

