Hi Steve
Thanks for the reply. I think, though, I was looking for a
specific answer rather than having to go on an online course!
Perhaps it would help if I gave a specific example - in the past
I have wanted to merge together a series of papers into a larger
document, and repaginate them accordingly. This was easy to do if I had
access to the MS Word master, as I could insert a section break and
restart page numbering and regenerate a table of contents. Having to
do this with PDFs is a pain, even if, as Jan Velterop has pointed out,
tools are available.
So to give an explicit example, why would you not make the master
format, which will typically but not always be MS Word, available, in
conjunction with a PDF, in a repository of project reports to make reuse
of the content easier? As a depositor I have just an file upload box,
initially to ePrints and now to Pure.
There may be valid reasons in some cases (e.g. as Jan Velterop
suggested authors may wish to protect their content from tampering) and
there may havce been reasons which are possibly no longer valid
(avoidance of MS Word macro viruses, perhaps) .
Thanks
Brian
On 25/07/2012 12:55, Steve Hitchcock wrote:
> Brian, For repositories, digital preservation isn't about Word or PDF. We are in phase 2 of a full-scale JISC programme on managing research data, we have arts repositories, and repositories with educational materials, as well as open access repositories. Can I point you to what these repositories look like in terms of file format profiles:
>
> Characterising and Preserving Digital Repositories: File Format Profiles
> http://www.ariadne.ac.uk/issue66/hitchcock-tarrant
>
> If you are right about the prevalence of policies on pdf, then these are probably the same policies we saw when embarking on a preservation project 5 years ago. Today such policies may be a convenience for some repositories, but have little to do with preservation. Digital preservation has moved on enormously in that time, and that progress is embedded in tools that are available to all. If people are aware of some or all of these tools, perhaps they are less familiar with how they can be joined up in a structured repository preservation strategy. In which case, they may also be interested in a short, but fully documented, 5-part course we ran in 2011
> http://blogs.ecs.soton.ac.uk/keepit/tag/keepit-course/
>
> File formats were covered in parts 3 and 4 of this course, i.e. they are not the starting point for digital preservation.
>
> Steve
>
> On 25 Jul 2012, at 11:49, Brian Kelly wrote:
>
>> I've always deposited an MS Word copy of my papers in my local repository, together with a PDF copy. I've done this because I've been told of the importance of preserving the master copy of a resource, rather than a lossy derivative version, such as PDF. As I've experience in having to recreate an MS Word file from a PDF copy I know this can be a cumbersome process. I assume some authors may prefer to deposit a PDF copy as this may be regarded as providing a form of DRM by making it slightly more difficult to process the file.
>>
>> What policies and practices do people have in place related to this? A Google search for "Policies on depositing MS Word files" suggests that PDFs are the norm. Since the MS Office format has been an ISO standard since 2007 I assume the proprietary versus open standard format for deposits argument is not as strong as it was (subject to caveats about support for ISO/IEC 29500 Strict
>> and the arguments about the validity of the standardisation process which I don't want to go into).
>>
>> Thanks
>>
>> Brian
>>
>> --
>> --------------------------------------------------------
>> Brian Kelly
>> Innovation Support Centre, UKOLN, University of Bath, Bath, UK, BA2 7AY
>> Phone: 01225 383943
>> Email:
>> [log in to unmask]
>>
>> Blog:
>> http://ukwebfocus.wordpress.com/
>>
>> Twitter:
>> http://twitter.com/briankelly
>>
>> Web:
>> http://isc.ukoln.ac.uk/
--
--------------------------------------------------------
Brian Kelly
Innovation Support Centre, UKOLN, University of Bath, Bath, UK, BA2 7AY
Phone: 01225 383943
Email: [log in to unmask]
Blog: http://ukwebfocus.wordpress.com/
Twitter: http://twitter.com/briankelly
Web: http://isc.ukoln.ac.uk/
|