Revealing the Value of Open Data
17 Jun 2015



The big data and data analytics capabilities of the Hartree Centre and its partner IBM are enabling Democrata to help construction firms predict the presence of archaeological remains.






Construction and archaeology are both, literally, ground-breaking businesses. But they can come into conflict when building projects unexpectedly hit hidden objects or structures that must be identified, recorded and excavated – resulting in frustrating delays, spiralling costs, expensive overruns and perhaps complete cancellation of a project. Data analytics start-up Democrata came up with an ambitious and appropriately ground-breaking answer: to develop a model that predicted where in the UK ancient remains were likely to be located, based on the swathes of open data about previous finds, settlement patterns and landscape characteristics held by government, universities and organisations such as English Heritage, Ordnance Survey and the Land Registry.​


Turning concept into reality demanded world-class capabilities in bringing together and analysing big, highly diverse datasets, and in producing an agile but robust model with real commercial potential. After winning a Hartree Centre/Big Innovation Centre competition aiming to stimulate commercial exploitation of open data, Democrata invested its £25,000 proof of concept prize from STFC in the use of advanced supercomputing and visualisation capabilities provided by the Hartree and IBM (including IBM's cutting-edge software and its BigInsights infrastructure at the Centre). This involved standardising, combining and mining data, digging deep into historic documents for relevant information, and using the data to create sophisticated predictive algorithms. A UK-wide predictive 'layer' was produced comprising 200 million individual points – one for every 50m across the country. 


The resulting model is a customisable tool that reveals where construction firms are likely to encounter archaeological disruption, enabling alternative construction sites to be found or disruption to be factored into plans, schedules and budgets. With the average cost of archaeological work relating to major infrastructure projects amounting to 1-3% of expenditure – equivalent to tens of millions of pounds in some cases – the potential benefits in terms of better margins and sharpened competitive edge are substantial. The model could also help archaeologists decide where to conduct excavations, while wider environmental impact assessment processes will benefit from improved ability to predict risk when planning major projects.​​



“In the context of our mission to democratise data and make it usable, our ability to harness Hartree's computing platform, plus IBM software and skills, equipped us to achieve proof of concept in the shortest possible time and to produce a prototype we can use to enthuse and inspire people with our vision."


Geoff Roberts