Blog
Nationalism and the Scottish Genius
It’s the smell that hits you first. Stepping into Waverley is to step into a wave of malty musk which suffuses your sinuses. Off the platform and into the car, it’s what you feel that gets you next: the juddery drive over improbably cobbly streets. And finally, what you see: Castle Rock and Arthur’s Seat, peaks that, if it’s misty, you might only be able to peek at. Continue reading “Nationalism and the Scottish Genius”
Social media: the good, the bad and the ugly
Social media: the good, the bad, and the ugly.‘ Public talk at BCS, The Chartered Institute for IT, Oxfordshire Branch, Oxford, UK, September 2014.
Big Data, Ethics, and the Social Implications of Knowledge Production
This position paper addresses current debates about data in general, and big data specifically, by examining the ethical issues arising from advances in knowledge production. Typically ethical issues such as privacy and data protection are discussed in the context of regulatory and policy debates. Here we argue that this overlooks a larger picture whereby human autonomy is undermined by the growth of scientific knowledge. To make this argument, we first offer definitions of data and big data, and then examine why the uses of data-driven analyses of human behaviour in particular have recently experienced rapid growth. Next, we distinguish between the contexts in which big data research is used, and argue that this research has quite different implications in the context of scientific as opposed to applied research. We conclude by pointing to the fact that big data analyses are both enabled and constrained by the nature of data sources available. Big data research will nevertheless inevitably become more pervasive, and this will require more awareness on the part of data scientists, policymakers and a wider public about its contexts and often unintended consequences.
Murder in the time of virality

That the beheading of journalist James Foley is ‘media’ is horrific. Whether it is ‘social’ falls on all of us.
I, like millions of others, learned about the death of journalist James Foley on social media. But it just so happened that the news was delivered to me in as sensitive and sombre a way as possible. Continue reading “Murder in the time of virality”
Social media and public opinion: what’s new?
I’m currently writing up a paper for submission to the Internet, Politics and Policy 2014 conference to be held by the OII in September. My paper – which draws substantially on interviews conducted as part of the Sloan Foundation-funded project of which I’m part – asks whether and to what extent the measurement of public opinion has been transformed by the new availability of socially-generated sources of big data, such as social media postings and search queries, and the tools which allow us to analyse them. Continue reading “Social media and public opinion: what’s new?”
Streisandfreude: how the right to be forgotten may become an excuse to be remembered

The past fortnight saw the first ripples of reaction to the European Court of Justice’s assertion of a citizen’s ‘right to be forgotten’ online. Following the court’s ruling, Google began the implementation of a process whereby individuals can petition for the removal of links in search results to pages deemed objectionable.
Mapping the UK webspace: fifteen years of british universities on the web
This paper maps the national UK web presence on the basis of an analysis of the .uk domain from 1996 to 2010. It reviews previous attempts to use web archives to understand national web domains and describes the dataset. Next, it presents an analysis of the .uk domain, including the overall number of links in the archive and changes in the link density of different second-level domains over time. We then explore changes over time within a particular second-level domain, the academic subdomain .ac.uk, and compare linking practices with variables, including institutional affiliation, league table ranking, and geographic location. We do not detect institutional affiliation affecting linking practices and find only partial evidence of league table ranking affecting network centrality, but find a clear inverse relationship between the density of links and the geographical distance between universities. This echoes prior findings regarding offline academic activity, which allows us to argue that real-world factors like geography continue to shape academic relationships even in the Internet age. We conclude with directions for future uses of web archive resources in this emerging area of research.
Recreational bugs: the limits of representing the past through web archives
I am in Aarhus this week for the ‘Web Archiving and Archived Web’ seminar organised by Netlab at Aarhus University. Before the seminar got underway, I had time to walk around ‘The Old Town’ (Den Gamle By), a vibrant, open-air reconstruction of historic Danish buildings from the eighteenth century to the present. The Old Town is described as an open-air museum, but in many ways it’s much more than that: it’s filled with actors who walk around impersonating townsfolk from across history, interacting with guests to bring the old town more vividly to life. Continue reading “Recreational bugs: the limits of representing the past through web archives”
Big Data in Bellagio: who counts, what counts, and how do we count?

One of the early discussions emerging at our ‘Big Data for Social Change’ at the Rockefeller Center in Bellagio surrounds how the act of capturing of big data impinges on our understanding of it. There are three strands in particular which have been flagged up. Firstly, who does the counting? As Marc Ventresca has showed, the shift from ecclesiastical to secular authority in the collection of data affected perceptions of society, for example shifting the focus to the individual from the collective. The national census is not an impassive, aloof process but rather a culturally and politically significant object, reflecting and reinforcing societal debate and conflict. This significance is reflected in the 1918 observation that, “the science of statistics is the chief instrumentality through which the progress of civilization is now measured, and by which its development hereafter will be largely controlled”. Continue reading “Big Data in Bellagio: who counts, what counts, and how do we count?”


