Without looking, you canât know whatâs there. That was our experience locating maps amongst the one-million British Library images released to the public domain. We had not guessed that 50,000 images of maps were lurking there. So how were they singled out?
Answer: with the help of our friends (the crowd!) using several methods.
Semi-manually: A dedicated team of volunteers looked at individual images and applyied the tag âmapâ on flickr. The work was organised using a synoptic index in Wikimedia Commons, providing a systematic method of looking at each volume and tracking shared progess. Over 29,000 map images were identified in this way.
The British Library hosted a one-day event, in concert with Wikimedia UK, to which volunteers were invited to kick-start the effort. In between working, the 30 participants enjoyed tours and talks from speakers representing online mapping efforts, including OpenStreet Map and Stroly. The dayâs activities were captured in Gregory Marlerâs engaging description, Lost in Piles of Maps, and a series of photographs from ATR Creative.
One corner of the room - detail of photo by Machi Takahashi of ATR Creative who joined the event from Tokyo and was one of the speakers. CC BY-SA 2.0
Ongoing crowd activity
The bulk of the work took place online over the next two months. With the wiki tools built by J.heald to guide and coordinate contributions, 51 volunteers approached the work, book by book, often focussing on geographic areas of interest. Together, they made short work of what was a huge task; 28% of the books were completed after the first 72 hours; 60% were reviewed in the first 20 days; after five weeks over 20,000 new maps were found in 93% of the source volumes.
But surely maps can be identified automatically? Itâs true that well before the organised effort just described, one user produced algorithm-guided tags for this image set, which resulted in the addition of well over 15,000 map tags.
By the end of December 2014, every image in every book had been reviewed, and between the manual and automatic tagging, over 50,000 maps had been found. Since then, we have been working to clean up the data, including reviewing rogue tags, rotating images, splitting maps, and removing duplicates, to derive a final set of data. Next step: georeferencing.
The tagging project was presented on 12 February 2015 at the EuropeanaTech 2015 conference as a short talk and poster, Case Study: Mapping the Maps.
This achievement represents the work of many. Special thanks go to Maurice Nicholson, BL
Georeferencer participant; Jamed Heald, Wikimedia volunteer; and Ben OâSteen of BL Labs.