Dear all,
Speaking of digitising historic texts, list members might be mildly
entertained by this item in my blog. It concerns f and s in Google Books.
http://chronographics.blogspot.com/2010/12/words-over-time.html
Stephen
On 10/08/2011 10:23, "Ed I Bremner" <[log in to unmask]> wrote:
> Dear All,
>
> MCG members interested in the cutting edge of OCR and the digitisation of
> historic text (including magazines), may well want to consider coming to the
> IMPACT Conference at the British Library on the 24-25th of October 2011.
>
> This event will showcase the results from the IMPACT project and launch the
> IMPACT Centre of Competence.
>
> IMPACT is a European project that has been developing new tools to improve
> the mass digitisation and OCR of historic text -
> See: http://www.impact-project.eu/
>
> Details of the conference are below, with a full programme at:
> http://www.impact-project.eu/news/ic2011/conference-programme/
>
>
>
> *********************************************************
>
> With this email we would like to invite you to the final conference of the
> IMPACT project, ³Digitisation & OCR: Better, faster, cheaper. Solutions of
> the IMPACT Centre of Competence and future challenges² that will take place
> on 24-25 October 2011 at the British Library in London. At this conference
> IMPACT will present the final project results, along with related research
> in the field of OCR and language technology.
>
> This event will also mark the official launch of the IMPACT Centre of
> Competence. This Centre is focused on making digitisation of historical
> printed text in Europe better, faster, cheaper by sharing expertise and
> providing access to tools for all parts of the digitisation workflow, as
> well as tools, services and facilities for further advancement of the State
> of the Art in this field.
>
> The programme for the conference is now online on the conference webpage,
> highlights include:
>
> € Khalil Rouhana (European Commission - Director for digital content
> and cognitive systems in DG Information Society and Media): ³The EC Digital
> Agenda and official launch of the IMPACT Centre of Competence²
> € Michael Fuchs (ABBYY Europe): ³ABBYY FineReader: IMPACT
> improvements²
> € Paul Fogel (California Digital Library): ³Experiences in mass
> digitisation: examining OCR quality²
> € Clemens Neudecker (National library of the Netherlands): ³The IMPACT
> Framework and what you can do with it²
> € Asaf Tzadok (IBM Haifa Research Lab): ³IBM Adaptive OCR engine and
> CONCERT Cooperative Correction²
> € Majlis Bremer-Laamanen (National Library of Finland): ³Crowdsourcing
> for OCR correction: Experiences with Digitalkoot²
> € Katrien Depuydt (INL ) and Klaus Schulz (University of Munich):
> ³Language work in IMPACT²
> € Stephen Krauwer (CLARIN coordinator, University of Utrecht):
> ³Related language work in CLARIN²
> € Parallel sessions on State of the art research tools for document
> analysis and OCR, IMPACT language tools & resources and Digitisation tips
> (Meet the expert).
>
> More programme updates will be announced through
> http://www.impact-project.eu/news/ic2011/conference-programme/ and Twitter
> (hashtag: #impactconf2011). Registration is now possible at the regular fee
> of 120 GBP. To register, please go to this BL ticket website and click
> October. More information is also available from the attached flyer.
>
>
> *********************************************************************
>
> Best Wishes
>
> Ed Bremner - IMPACT Project
> UKOLN
> [log in to unmask]
> SKYPE: ed.bremner
>
> ******************************
> Ed I Bremner
> Consultant and Trainer in Digital Media
> BremWeb Imaging
> www.bremweb.co.uk
> [log in to unmask]
> 07973 335509
> ******************************
>
>
> -----Original Message-----
> From: Museums Computer Group [mailto:[log in to unmask]] On Behalf Of Adam
> Waterton
> Sent: 10 August 2011 09:41
> To: [log in to unmask]
> Subject: Re: Software for digitising magazines
>
> Hi Trevor,
> We recently undertook a project to digitise and create machine readable
> versions of a series of Royal Academy of Arts exhibition catalogues
> (1870-1913). We tried a few OCR packages and also found that Abbyy
> Finereader http://finereader.abbyy.com/ gave good results. However, the
> resulting text files were still very inaccurate and required an enormous
> amount of manual tidying up to make them accurate enough for consistent
> searching. Also, Abbyy is not cheap and the costs will mount up if you need
> a separate Abbyy licence for each of your volunteers.
>
> The results of our digitisation project can be seen here:
> http://www.racollection.org.uk/ixbin/indexplus?_IXACTION_=file&_IXFILE_=temp
> lates/pages/exhibition_list.html
>
> Regards,
> Adam.
>
> Adam Waterton
> Head of Library Services
> Royal Academy of Arts
> Burlington House
> Piccadilly
> London
> W1V 0DS
>
> T: 020 7300 5740 | F: 020 7300 5765 | E: [log in to unmask]
>
> The Royal Academy of Arts Collection Online: www.racollection.org.uk
>
> -----Original Message-----
> From: Museums Computer Group [mailto:[log in to unmask]] On Behalf Of
> Howell, Alan
> Sent: 09 August 2011 10:09
> To: [log in to unmask]
> Subject: Re: Software for digitising magazines
>
> Hi Trevor
>
> I have used Abbey Finereader for some projects at home and found it to be
> very effective at this sort of thing.
>
> Kind regards
>
> Alan Howell
> Guernsey Museums & Galleries
> SSDDI +44 (0) 1481 709736
>
>
> -----Original Message-----
> From: Museums Computer Group [mailto:[log in to unmask]] On Behalf Of
> REYNOLDS, Trevor
> Sent: 06 August 2011 09:49
> To: [log in to unmask]
> Subject: Software for digitising magazines
>
> Dear all
>
> A volunteer run charity I'm involved with wants to digitise the back issues
> of its periodicals.
>
> What they want to end up with is PDF/A format documents with a scanned image
> of each page with searchable text underneath the image. Many of the early
> issues have poor quality text and any OCRed text will probably need heavy
> editing.
>
> Can you recommend software which will enable this to be done? They are
> intending to split the work between a number of volunteers who will be
> working at home on their own computers so low cost, easy to use solutions
> would be welcome!
>
> Trevor Reynolds
> Collections Registrar, English Heritage
> 37 Tanner Row, York, YO1 6WP tel: 01904 601905
>
> Portico: your gateway to information on sites in the National Heritage
> Collection; have a look and tell us what you think.
> http://www.english-heritage.org.uk/professional/archives-and-collections/por
> tico/
>
> ****************************************************************
> website: http://museumscomputergroup.org.uk/
> Twitter: http://www.twitter.com/ukmcg
> Facebook: http://www.facebook.com/museumscomputergroup
> [un]subscribe: http://museumscomputergroup.org.uk/email-list/
> ****************************************************************
> This e-mail (including attachments) may contain sensitive and/or privileged
> information. If received in error, its use by you is not authorised and may
> be unlawful. Please notify the sender and delete all copies immediately.
> E-mails may be subject to error, interference and virus and no liability is
> accepted for loss or damage however it arises and whether direct or
> indirect. Service of legal proceedings by e-mail may not be accepted.
>
> E-mails may be monitored for compliance purposes. All documents are subject
> to copyright.
>
> ****************************************************************
> website: http://museumscomputergroup.org.uk/
> Twitter: http://www.twitter.com/ukmcg
> Facebook: http://www.facebook.com/museumscomputergroup
> [un]subscribe: http://museumscomputergroup.org.uk/email-list/
> ****************************************************************
> The Royal Academy of Arts is a registered charity under Registered Charity
> Number 1125383 and is also registered as a company limited by guarantee in
> England and Wales under Company Number 6298947. Registered office:
> Burlington House, Piccadilly, London, W1J 0BD.
>
> ****************************************************************
> website: http://museumscomputergroup.org.uk/
> Twitter: http://www.twitter.com/ukmcg
> Facebook: http://www.facebook.com/museumscomputergroup
> [un]subscribe: http://museumscomputergroup.org.uk/email-list/
> ****************************************************************
>
> ****************************************************************
> website: http://museumscomputergroup.org.uk/
> Twitter: http://www.twitter.com/ukmcg
> Facebook: http://www.facebook.com/museumscomputergroup
> [un]subscribe: http://museumscomputergroup.org.uk/email-list/
> ****************************************************************
--
_____________________________________________________________
Stephen Boyd Davis
Reader in Interactive Media
Head, Art and Design Research Institute
Head, Lansdown Centre for Electronic Arts
Middlesex University, Cat Hill, Barnet, Herts EN4 8HT
United Kingdom
Tel 44 (0)20 8411 5072
.............................................................
The Lansdown Centre's Web Pages are at http://www.cea.mdx.ac.uk/
****************************************************************
website: http://museumscomputergroup.org.uk/
Twitter: http://www.twitter.com/ukmcg
Facebook: http://www.facebook.com/museumscomputergroup
[un]subscribe: http://museumscomputergroup.org.uk/email-list/
****************************************************************
|