Print

Print


I just learned about the CLTK toolkit, and it made me wonder whether there are classicists out there who would have a use for the linguistically and metrically data of Early Greek epic in WordHoard.  Below is the example of the opening line of the Iliad. The tagging is not TEI, but could be converted into it easily enough.  The morphosyntactic tagging is derived from Perseus, but has gone through a lot of manual correction, and it is, as these things go, pretty good. The metrical tagging uses a machine-friendly ad hoc notation, where the first number identifies the foot, the second the position in the foot, and the third tells you whether the second part consists of one or two syllables.  The hyphen and space tell you whether metrical transitions occur within or between words.

 

Everything in Wordhoard is in the public domain, and I’ll be happy to put the data on github if there is a demand for them.

 

 

<wordHoardTaggedLine id="IL.1.1" n="1">
                                                                               
<w id="ege-101000101"
                                                                                  lemma="μῆνις (n)"
                                                                                  pos="4201"
                                                                                  metricalShape="110-121"              >μῆνιν</w>
                                                                               
<punc                                   > </punc>
                                                                               
<w id="ege-101000102"
                                                                                  lemma="ἀείδω (v)"
                                                                                  pos="1410021"
                                                                                  metricalShape="122-210-221"          >ἄειδε</w>
                                                                               
<punc                                   > </punc>
                                                                               
<w id="ege-101000103"
                                                                                  lemma="θεά (n)"
                                                                                  pos="5201"
                                                                                  metricalShape="222-310"              >θεὰ</w>
                                                                               
<punc                                   > </punc>
                                                                               
<w id="ege-101000104"
                                                                                  lemma="Πηληϊάδης (np)"
                                                                                  pos="2101"
                                                                                  metricalShape="320-410-421-422-510"  >Πηληϊάδεω</w>
                                                                               
<punc                                   > </punc>
                                                                               
<w id="ege-101000105"
                                                                                  lemma="Ἀχιλλεύς (np)"
                                                                                  pos="2101"
                                                                                  metricalShape="521-522-610-620"      >Ἀχιλῆος</w>



To unsubscribe from the DIGITALCLASSICIST list, click the following link:
https://www.jiscmail.ac.uk/cgi-bin/webadmin?SUBED1=DIGITALCLASSICIST&A=1