Hi Tim
On 27/07/2012 10:42, Tim Brody wrote:
Of course the most successful format, available on by far the most
platforms and most vendors, is HTML. As the Semantic Web/schema.org gain
traction the amount of information stored in HTML will dwarf that in
dead-tree formats like Word and PDF (if it doesn't already).
Not only that, but e-book formats are essentially HTML too, EPub
particularly. I'm optimistic that, as tablets and e-book readers
continue to gain traction, it will be successful, flexible and hopefully
semantically rich rendering on those devices, that will become the
benchmark for most publications, rather than pseudo-A4. Much like what
we've been striving for on the Web for 20 years, but this time as a real
substitute for print, not an adjunct.
Not that preserving *everything* someone might choose to package up in
an EPUB3 file is necessarily going to be a picnic, but at least we're in
familiar web archiving territory ;)
Best
Richard
--
Richard M. Davis
Manager, Research Technologies Group
University of London Computer Centre (ULCC)
Senate House, Malet Street, London WC1E 7HU
t: +44 (0) 20 7863 1350
m: +44 (0) 79 3040 6197
e:
[log in to unmask]w:
http://www.ulcc.ac.uk/b:
http://dablog.ulcc.ac.uk/c:
http://tinyurl.com/richardscalendar*Save electrons* "When replying to a message, include enough original
material to be understood but no more." (RFC 1855)
The University of London is an exempt charity in England and Wales
and a charity registered in Scotland (reg. no. SC041194)