hi Scot,
I think that this is the link: http://sourceforge.net/projects/epidoc/files/transcoder/
<I'm sitting less than a metre from the legend himself, so let me know if not.>
cheers
elton
________________________________________
From: Helma Dik [[log in to unmask]]
Sent: 04 March 2014 21:14
To: [log in to unmask]
Subject: Re: [DIGITALCLASSICIST] more questions about Perseus TEI
Dear Scot,
You are dating yourself as a lucky puppy who never had to deal with what is called beta code (but Perseus-style betacode, hence the lower case throughout) before:-) Please take this as a compliment.
Hugh Cayless has written a fabulous transcoder, which is probably on the digital classics wiki. Full descriptions of beta code and Unicode Greek can be found, courtesy of Nick Nicholas, on the TLG site.
off to a meeting,
Helma
On Tue, Mar 4, 2014 at 3:08 PM, Scot Mcphee <[log in to unmask]<mailto:[log in to unmask]>> wrote:
I’ve got a simple question about the Perseus TEI data, for which I’l apologise in advance.
In the PerseusDL canonical repository on Github, the Greek texts, at least all the ones that I looked at, all contain the Greek text in that ascii-Greek format you can use to search for Greek in the Perseus search box, or at least similar to it, to whit:
*ta\s tw=n i(erw=n a)posto/lwn diadoxa\s su\n kai\ toi=s a)po\ tou= swth=ros
(I don’t know what this format is called)
But on the Perseus site, that’s clearly translated into Unicode Greek:
Τὰς<http://www.perseus.tufts.edu/hopper/morph?l=*ta%5Cs&la=greek&can=*ta%5Cs0> τῶν<http://www.perseus.tufts.edu/hopper/morph?l=tw%3Dn&la=greek&can=tw%3Dn0&prior=*ta%5Cs> ἱερῶν<http://www.perseus.tufts.edu/hopper/morph?l=i%28erw%3Dn&la=greek&can=i%28erw%3Dn0&prior=tw=n> ἀποστόλων<http://www.perseus.tufts.edu/hopper/morph?l=a%29posto%2Flwn&la=greek&can=a%29posto%2Flwn0&prior=i(erw=n> διαδοχὰς<http://www.perseus.tufts.edu/hopper/morph?l=diadoxa%5Cs&la=greek&can=diadoxa%5Cs0&prior=a)posto/lwn> σὺν<http://www.perseus.tufts.edu/hopper/morph?l=su%5Cn&la=greek&can=su%5Cn0&prior=diadoxa%5Cs> καὶ<http://www.perseus.tufts.edu/hopper/morph?l=kai%5C&la=greek&can=kai%5C0&prior=su%5Cn> τοῖς<http://www.perseus.tufts.edu/hopper/morph?l=toi%3Ds&la=greek&can=toi%3Ds0&prior=kai%5C> ἀπὸ<http://www.perseus.tufts.edu/hopper/morph?l=a%29po%5C&la=greek&can=a%29po%5C0&prior=toi=s> τοῦ<http://www.perseus.tufts.edu/hopper/morph?l=tou%3D&la=greek&can=tou%3D0&prior=a)po%5C> σωτῆρος<http://www.perseus.tufts.edu/hopper/morph?l=swth%3Dros&la=greek&can=swth%3Dros0&prior=tou=>
Even in the XML (from the orange “XML” button):
Τὰς τῶν ἱερῶν ἀποστόλων διαδοχὰς σὺν καὶ τοῖς ἀπὸ τοῦ σωτῆρος
My question is: are there code libraries to translate the first format into Unicode? In Python? Or any programming language?
regards
Scot Mcphee
--
Scot Mcphee
Computer Programmer, Classics PhD.
p +61 412 957414<tel:%2B61%20412%20957414>
e [log in to unmask]<mailto:[log in to unmask]>
http://autonomous.org/
--
Helma Dik
Department of Classics
University of Chicago
'And how many hours a day did you do lessons?' said Alice, in a hurry to change the subject. 'Ten hours the first day,' said the Mock Turtle: 'nine the next, and so on.' 'What a curious plan!' exclaimed Alice. That's the reason they're called lessons,' the Gryphon remarked: 'because they lessen from day to day.'
-- The Open University is incorporated by Royal Charter (RC 000391), an exempt charity in England & Wales and a charity registered in Scotland (SC 038302).
|