Skip to main content
  1. Projects/

BUDDAH

147 words
Data Mining Data Mining Digital Preservation

The Big UK Domain Data for the Arts and Humanities project was funded by the AHRC to build on the work we started in AADDA, but to work much more closely with individual historians by providing bursaries to support their direct engagement with us and with our search service.

Click here to explore UK Web History

During this process, I expanded our indexing capabilities, with the lessons learned and feedback from the AADDA project being used to further develop our indexing software, and to scale the index itself up to 3.5 billion resources (1996-2013).

More importantly, during this process I started to understand how this historical research use case different from traditional search and information retrieval models. This has significant consequences not only for how we index our archives, but all the way back through the life-cycle of the content, changing the way we crawl the web.


Webmentions

You can respond to this post by liking, boosting or replying to a tweet or toot that mentions it.