On 16/07/11 21:41, Notis Toufexis • Νότης Τουφεξής wrote:
> You can convwert Word files automatically to TEI/XML here:
>
> http://oxgarage.oucs.ox.ac.uk:8080/ege-webclient/
>
> Once in TEI/XML it's quite easy to move them to EpiDoc.
I would, of course, second this recommendation. This uses the standard
TEI-C stylesheets to move between formats (e.g.docx to tei). I thought
that EpiDoc was moving to being a pure TEI P5 subset rather than an
extension. If that isn't the case, if the EpiDoc community produces
both a pure TEI P5 to EpiDoc and an EpiDoc to pure TEI P5 XSLT
transformation, we could include it in the OxGarage set of
transformations so you could go direct from docx (or format of choice)
to EpiDoc.
> I would only recommend using FileMaker Pro as an intermediate stage
> only if your data/files have a very simple structure.
When it comes to document-centric data with arbitrarily deep levels of
nesting, I tend to view relational databases as a lossy output form from
a richer XML format. So much like people generate lossy HTML from their
rich TEI, they can also generate lossy relational databases. (This
doesn't mean it isn't possible to do this non-lossy, but just that there
is nothing wrong with producing say CSV subset to import into a database
to examine one aspect or other.)
-James
--
Dr James Cummings, InfoDev,
OUCS, University of Oxford
|