Collecting Data To Improve Tools2014-11-20·402 wordsMining-Web-Archives Web Archives Webarchive-Discovery Data Mining
Web Archiving In The JavaScript Age2014-08-11·458 wordsMining-Web-Archives Web Archives Digital Preservation Data Mining
How much of the UK's HTML is valid?2014-07-02·820 wordsMining-Web-Archives Web Archives Digital Preservation Data Mining
BUDDAH147 wordsData Mining Data Mining Digital PreservationBig UK Domain Data for the Arts and Humanities project to build a prototype historical search engine
OPF Blog: Analysing the formats in the UK Web Archive2012-08-17·581 wordsMining-Web-Archives Data Mining Digital Preservation Web Archives Webarchive-Discovery
AADDA224 wordsData Mining Data Mining Digital PreservationThe Analytical Access to the Dark Domain Archive (AADDA) Project.
Experimenting with Hadoop2010-12-14·403 wordsMining-Web-Archives Data Mining Web Archives Digital Preservation