> Can anyone recommend a tool to convert batches of pdf-format files to html
> and txt?
>
> I realise that Acrobat can batch convert to rdf, but I am looking for a
> stand-alone tool.
There are tools in existence (from both freeware and commercial sources)
for, amongst other tasks, migrating pdf into open formats. Most noted are
Ghostscript, Prescript and Pstotext, which I believe are all freely
available.
Note, however, that even with these tools it is still difficult to get
accurate single representations of pdf files in open formats. It often
depends on how the original pdf file was encoded, and this will influence
whether it is possible to capture both the text and the appearance in the
open format in the same document. NOF projects might have to experiment to
come up with the most suitable solution for delivering such resources in
open formats.
There is an excellent article on migrating pdf files on the RLG DigiNews
website. The URL is
http://www.rlg.org/preserv/diginews/diginews5-1.html#feature2. This page
also features links to the tools mentioned above.
Alastair
Alastair Dunning
Information, Training and Research
Arts and Humanities Data Service
King's College London
020 7928 7848
> -----Original Message-----
> From: This list is for people who are receiving New Opportunities Fund
> Digitisation funding. [mailto:[log in to unmask]]On Behalf Of David
> Nolan
> Sent: 21 May 2002 15:47
> To: [log in to unmask]
> Subject: Batch Conversion from pdf
>
>
> Can anyone recommend a tool to convert batches of pdf-format files to html
> and txt?
>
> I realise that Acrobat can batch convert to rdf, but I am looking for a
> stand-alone tool.
>
> Thanks.
>
> --------------
> David Nolan
> Web Editor
> National Council for Voluntary Organisations
> Tel: 020 7520 2491
|