Basile Simon (blog) Data, hackery, stories

Building a datastringer

##WTF data-stringer? Who knows what a data-stringer is? A quick Google search reveals that only a couple of people actually use this word.

I was introduced to this esoteric thing in November, during a Hacks/Hackers(1) meetup. OpenNews fellow Annabel Church was running a workshop about… well, here’s her definition of a data-stringer:

Stringer journalists are freelance reporters that create news stories from remote countries or crisis zones. Data stringers, in a similar way, are small computer programmes that live in a remote data feed and ask routine journalistic questions of that data. Keeping track of what is happening in government or business, they can trigger notifications whenever events of journalistic significance occur.

Computer-assisted journalism level > 9000.

##Getting the idea Anyway, Iain, Michael, Massimo, and I pitched an idea for a stringer which would gather housing data, cost of living indexes, and energy prices, to notify a journalist about a certain raise - the topic is very popular in London, where prices are going bananas these days.

The idea stayed in my mind for some time, but I’m at it again, joined by Clément, a software engineer friend.

Annabel actually published a tool to create data-stringers, called However, I find the requirements too high for the layman journo, who is no sysadmin. Keeping that in mind, Clément and I will try to design something easier to grasp and use.

##Proof of concept I won’t have much time, as I would like to showcase the project to OpenNews mid-August. Hopefully, it will be enough to produce something at least decent.

So, we are going to take three different data sources:

  • Three ways to query them
  • And three types of results

And… try to build a proof of concept prototype which would be rather simple to scale and adapt to other types of sources. So far, we went for Rest API, which sends us nice and simple JSON files with street-level crimes and news in your neighbourhood, and also a set of PR releases, powered by an Atom RSS feed. The third source is likely to be a numbers-only CSV dataset.

##Wanna help? The project is open-source, so if you want to get your hands dirty or simply have a look at how we’re doing, head to the Datastringer repo on Github.

It will be much appreciated!

(1) If you don’t know Hacks/Hackers, head to the website to find a local chapter in your city. The goal of these meetups is to put together in a room journalists and hackers, to “reboot journalism”, and “rethink the future of news and information.” It’s a vibrant initiative, and the London chapter is taking us to incredible offices for talks and drinks. Do join!