Mapping the UK webspace: fifteen years of british universities on the web

Hale, SA, Yasseri, T, Cowls, J, Meyer, ET, Schroeder, R and H Margetts (2015) Mapping the UK webspace: fifteen years of British universities on the web. Proceedings of the 2014 ACM conference on Web science, 62-70.

This paper maps the national UK web presence on the basis of an analysis of the .uk domain from 1996 to 2010. It reviews previous attempts to use web archives to understand national web domains and describes the dataset. Next, it presents an analysis of the .uk domain, including the overall number of links in the archive and changes in the link density of different second-level domains over time. We then explore changes over time within a particular second-level domain, the academic subdomain .ac.uk, and compare linking practices with variables, including institutional affiliation, league table ranking, and geographic location. We do not detect institutional affiliation affecting linking practices and find only partial evidence of league table ranking affecting network centrality, but find a clear inverse relationship between the density of links and the geographical distance between universities. This echoes prior findings regarding offline academic activity, which allows us to argue that real-world factors like geography continue to shape academic relationships even in the Internet age. We conclude with directions for future uses of web archive resources in this emerging area of research.

Recreational bugs: the limits of representing the past through web archives

I am in Aarhus this week for the ‘Web Archiving and Archived Web’ seminar organised by Netlab at Aarhus University. Before the seminar got underway, I had time to walk around ‘The Old Town’ (Den Gamle By), a vibrant, open-air reconstruction of historic Danish buildings from the eighteenth century to the present. The Old Town is described as an open-air museum, but in many ways it’s much more than that: it’s filled with actors who walk around impersonating townsfolk from across history, interacting with guests to bring the old town more vividly to life. Continue reading “Recreational bugs: the limits of representing the past through web archives”

Big Data in Bellagio: who counts, what counts, and how do we count?

photo-2

One of the early discussions emerging at our ‘Big Data for Social Change’ at the Rockefeller Center in Bellagio surrounds how the act of capturing of big data impinges on our understanding of it. There are three strands in particular which have been flagged up. Firstly, who does the counting? As Marc Ventresca has showed, the shift from ecclesiastical to secular authority in the collection of data affected perceptions of society, for example shifting the focus to the individual from the collective. The national census is not an impassive, aloof process but rather a culturally and politically significant object, reflecting and reinforcing societal debate and conflict. This significance is reflected in the 1918 observation that, “the science of statistics is the chief instrumentality through which the progress of civilization is now measured, and by which its development hereafter will be largely controlled”. Continue reading “Big Data in Bellagio: who counts, what counts, and how do we count?”

Preserving the present: the unique challenges of archiving the web

In March 2012, as Mitt Romney was seeking to win over conservative voters in his bid to become the Republican Party’s presidential nominee, his adviser Eric Fehrnstrom discussed concerns over his appeal to moderate voters later in the campaign, telling a CNN interviewer, “For the fall campaign … everything changes. It’s almost like an Etch A Sketch. You can kind of shake it up, and we start all over again.” Fehrnstrom’s unfortunate response provided a memorable metaphor for the existing perception of Romney as a ‘flip-flopper’. Fehrnstrom’s opposite number in the Obama campaign, David Axelrod, would later jibe that “it’s hard to Etch-A-Sketch the truth away”, and indeed, tying Romney to his less appetising positions and comments formed a core component of the President’s successful re-election strategy. Continue reading “Preserving the present: the unique challenges of archiving the web”

Piecing Together the Value of Big Data

Image
The multi-tiled Kuggen building, part of Chalmers’ Lindholmen campus

During the construction of a jigsaw or model, there is invariably a moment in which one’s perception shifts from the level of ‘parts’ to the level of ‘whole’ – when, as it were, the bigger picture becomes clear. (Presumably the German language offers an elegant compound noun for this, but I am yet to come across it.) Since its ascension from first appearance to its current perch at the peak of inflated expectations, big data as a phenomenon has seemed to operate primarily on the level of parts or pieces. These usually take the form of noteworthy findings from or utilisations of big data that are eye-opening for one reason or another. Continue reading “Piecing Together the Value of Big Data”

Why social data isn’t always a reliable indicator

From XKCD

“What social data can tell you: pretty much everything” proclaimed Azeem Azhar, founder of PeerIndex, in a popular post on LinkedIn earlier this week. We can perhaps forgive Azhar the hyperbolic lead-in, but hisarticle as a whole indulges in untrammeled evangelism for social data which obscures much of the nuance and uncertainty regarding what exactly this new source of data can actually tell us about society. Continue reading “Why social data isn’t always a reliable indicator”

“Twitter says…” – Can big social data tell us about public opinion?

coffee house

“Like Noah’s ark, (there was) every kind of creature in every walk of life. They included a town wit, a grave citizen, a worthy lawyer, a worship justice, a reverend nonconformist, and a voluble sailor.”

The above description comes from a history of English coffee houses in the seventeenth century¹, but might just as well apply to the twenty first century’s sites of caffeinated conversation: online social networks. With the rapid uptake of the Internet and the more recent rise to prominence of social network sites like Facebook and Twitter, hundreds of millions of ordinary people – the witty, the worthy, and the decidedly neither – are now connected not only to the web, a source of news, but also to social networks, a source of views. Continue reading ““Twitter says…” – Can big social data tell us about public opinion?”

The new ‘power of now’ and the perils of the hyper-present

With modern technology, living life ‘in the moment’ has never been easier. But this new nowness is far from what earlier advocates had in mind, and might only be distracting us from the planet’s ever more pressing challenges. Continue reading “The new ‘power of now’ and the perils of the hyper-present”

Cyber-revisionism: Parties’ Attitudes to Web Archiving Are a Worrying Sign for Digital Democracy

ImageFollowing an earlier, somewhat rantier post on this blog when the news originally broke, I’ve written a more academically-oriented piece on British political parties’ cyber-revisionism with Mor Rubenstein, a current MSc student here at the OII. This was published yesterday on the LSE Politics and Policy blog, and you can read it here. Mor’s undergraduate work on political parties on Facebook provided a useful counterpoint to party websites, and unearthed some deep ironies regarding the (dis)integration between the two platforms and some of the unintended consequences.

(The eagle-eyed may notice that the ‘Fahrenheit 401’ hook in the original post became ‘Fahrenheit 404’ – a less phonaesthetically evocative contrast with Bradbury’s 451 but more accurate, technically speaking…)

Big Data’s People Problem

Big Data Debate

To Google Campus in east London to hear what a number of practitioners thought were the most controversial questions surrounding the use and abuse of big data. After a couple of lightning pitches from big data startups (if you dream of using augmented reality to make your exercise regime more exciting, you’ll be in luck when Google Glass is released) the event moved into a wide ranging panel discussion with participants including journalist Paul Bradshaw, Daniel Hulme, founder of Satalia and Duncan Ross, Director of Data Science at Teradata. The event was billed as tackling the controversial questions over big data, and the panelists got right down to business, eschewing talk of big data’s big potential in favour of honest reflection about the darker side of the data revolution. Continue reading “Big Data’s People Problem”