I met with some Alexandra Eveleigh and Sarah Shooter from the West Yorkshire Archive Service (WYAS) this morning to talk about web archiving and in particular, ArchivePress. Though ArchivePress was initially conceived as a tool to support archiving of academic blog content, we’ve long thought that it could be used more widely. This meeting confirmed that and gave us the chance to explore some of the archival implications of using ArchivePress to collect and archive blog content.
We discussed a number of scenarios where ArchivePress could be used – not just by archival institutions (as per scenario 2 in my post below) but also by local groups who want to develop an archival collection along a particular theme and include in that collection blogged content. This is an interesting example and one which I hadn’t really thought of before – ArchivePress as a mechanism to support ‘community archiving’, of a sort. It was encouraging to get feedback from an archival perspective about our underlying premise too – that blog archiving can have a different set of requirements to other types of websites, and that blogs are one content type where the core content (ie blog posts) are easily and discernibly more important than the presentation. Sarah was quick to point out that she rarely consumes blog content via websites, receiving it straight to her iPhone instead. She rarely sees what the blog website actually looks like, let alone uses any of the content from the sidebar widgets.
Does extracting just the posts have any impact on the integrity, reliability, and authenticity of the resource? Traditional archival practices would mostly have it that an object has to be preserved in its entirity, including the preservation of context. It could be argued that by removing and preserving only blog posts rather than the whole website, a lot of important context is being lost. But then again, archival appraisal – as Alex pointed out – tends to focus on the content. Archivists have been transferring that content around and altering the objects for decades, for example by microfilming colour resources in black and white, and that has still been accceptable. So, it would seem that as long as the ArchivePress process and the original context of blog posts is sufficiently documented, our proposal should still result in resources of sufficiently high quality, integrity, and reliability to be afforded archival status.
We’ll be posting more on documenting context in due course.

July 31st, 2009 at 9:03 pm
Although (just to contradict myself!) for a user’s perspective on some of the problems caused by microfilming, and the consequent separation of content and carrier see http://bit.ly/I57rl (in French, sorry).
Not sure ‘context’ is quite the right term here. Context is a process – the chain of activities in which records are created, managed and used. ‘Look and feel’ might help establish this context (a typescript letter would look out of place in a bundle of C18th correspondence) but it isn’t impossible for this look and feel to change and yet the original context of creation and the content are still accepted as authentic. In the physical world, of course, you usually have the ‘original’ carrier to go back to verify authenticity against, but not always – the contents of the first city charter for Leeds is only known from a copy…
August 4th, 2009 at 11:57 pm
[...] a nut’, and we’ll certainly be keeping an eye out at West Yorkshire Archive Service for potential new use cases for ArchivePress’s feed-focused methodology and tools. ArchivePress should really appeal [...]
December 25th, 2010 at 8:13 pm
Hey man! I really enjoyed that comment http://www.testthisblogtoseeifitworks.com/ . Keep me informed if you submit any more.
April 11th, 2011 at 4:56 pm
[...] blog’s framing elements, and (given the much smaller scope of that project) established an acceptable rationale for doing so. The challenge for BlogForever is to find a solution to precisely these [...]