Dear Marco, I played around with Lemlat a while ago and I was very impressed. Thank you for distributing the code and databases under open access licenses. I have s few questions that might be of general interest: 1. The Github repo contains the database and binaries of the command-line tool for Linux, Mac and Windows, but no source code. Is the source available somewhere? 2. When you use "all lexical bases", which is the default option, there are lots of duplicate analyses. I think this may be because Lemlat does not do any pruning of the results when duplicate lemmata are found in multiple lexica. Is that right? If so, is that a feature you plan on adding? It would be nice to be able to say: "give me all the lemmata in the *OLD* and only those in Du Cange which are not in the *OLD*". 3. Is there in the output any indication of which lexicon a given lemma was found in, to enable the user to track back to the relevant entry for that word? Thanks again for making this fantastic resource available. Peter On Mon, 3 Sep 2018 at 15:33, Passarotti Marco Carlo < [log in to unmask]> wrote: > Dear Members of the List, > > we are proud to announce the recent enhancement of the lexical basis > of Lemlat with the Du Cange Glossary. > > Lemlat is a morphological analyser and lemmatiser of Latin provided with a > large lexical basis including: > - the collation of *three Latin dictionaries* (Georges and Georges, > 1913-1918; Glare, 1982; Gradenwitz, 1904): 43,432 lemmas [including also > relations between lemmas based on derivational morphology]; > - *Onomasticon* by Forcellini (1940): 26,250 lemmas; > -* Glossarium Mediae et Infimae Latinitatis* by Du Cange (1883-1887): > 82,556 lemmas. > > Enlarging the lexical basis of Lemlat with the Du Cange Glossary > significantly increases its coverage of a wide span of Latin texts from > different eras. > > Information about Lemlat can be found at www.lemlat3.eu. > The database and binaries of Lemlat are available at > https://github.com/CIRCSE/LEMLAT3 > > Enjoy Lemlat! > All best, > > Prof. Marco C. Passarotti > Computational Linguistics > *Index Thomisticus** Treebank * https://itreebank.marginalia.it/ > ERC Grantee, P.I. LiLa > CIRCSE Research Centre ( > https://centridiricerca.unicatt.it/circse_index.html) > *********************************************************** > Università Cattolica del Sacro Cuore > Largo Gemelli, 1 > 20123 Milan, Italy > [log in to unmask] > tel. +39-02-72342380 > > > > *Destina il tuo 5 per mille all’Università Cattolica* > > *CF 02133120150 *www.unicatt.it/5permille > > ------------------------------ > > To unsubscribe from the DIGITALCLASSICIST list, click the following link: > https://www.jiscmail.ac.uk/cgi-bin/webadmin?SUBED1=DIGITALCLASSICIST&A=1 > ######################################################################## To unsubscribe from the DIGITALCLASSICIST list, click the following link: https://www.jiscmail.ac.uk/cgi-bin/webadmin?SUBED1=DIGITALCLASSICIST&A=1