Hey Seb
I like this very much. There's some obvious IA issues, which you'll be
well aware of: the length of page is an issue (some IA work on the huge
list of "similar objects" and so on) - also I might have put the
auto-generated tags up the top near the user-generated ones..but in
general I think this adds lots to the equation. As usual you're doing
stuff, and seeing how it goes rather than waiting for the committee to
sit on it for a couple of years. And I like that, too :-)
Down here (up here?) at Eduserv, Stephen Pope has been tinkering in his
spare time with OpenCalais. I've heard him do lots of swearing about the
horriblenesses of RDF and its apparent incompatibility with an object
oriented language. He's been working on a set of tools to enable people
to work effectively with the OpenCalais output. As I understand it (not
much), the stuff he's doing will allow people to plug OpenCalais into a
.Net app and extract the data with ease. If you or anyone else is
interested, I can put you in touch or he's going to be blogging about it
at http://ultramagnus.org/ when he's got a spare second.
Did you (or anyone else on list) ever try the Yahoo! Content Analysis
API? It's obviously different to OpenCalais, but I've always found it
pretty cool - see
http://developer.yahoo.com/search/content/V1/termExtraction.html
Anyway. Nice one. Look forward to asking you about it next week
Cheers
Mike
Mike Ellis
Solutions Architect
Eduserv
[log in to unmask]
tel: 01225 470522
mob: 07017 031522
fax: 01225 474301
www.eduserv.org.uk
-----Original Message-----
From: Museums Computer Group [mailto:[log in to unmask]] On Behalf Of
Chan, Sebastian
Sent: 31 March 2008 11:39
To: [log in to unmask]
Subject: Auto-generated tagging - a museum Open Calais implementation
Folks
Down here at the Powerhouse Museum we have just switched on an
implementation of Reuters' OpenCalais service on our OPAC/collection
database.
This allows for auto-tagging of content into a variety of categories.
I've blogged about it over at Fresh & New
(http://www.powerhousemuseum.com/dmsblog/index.php/2008/03/31/opac20-ope
ncalais-meets-our-museum-collection-auto-tagging-and-semantic-parsing-of
-collection-data/).
Whilst there are some immediate problems with the way some phrases are
picked up by the parsing, the overall result is incredibly positive and
powerful, extracting significant extra value from collection records. We
are now able to extract people, place, companies and much more from
plain text and auto-create structured metadata to enhance search and
discovery.
For those interested in a real-world museum example of the potential of
the 'semantic web' this may be of particular interest . . . .
Seb
Sebastian Chan
Manager, Web Services
Powerhouse Museum
street - 500 Harris St Ultimo, NSW Australia
postal - PO Box K346, Haymarket, NSW 1238
tel - 61 2 9217 0109
fax - 61 2 9217 0689
e - [log in to unmask]
w - www.powerhousemuseum.com
b - www.powerhousemuseum.com/dmsblog
=========================Important
Notice=====================================
This email and attachments are for the use of the intended recipient(s)
only and may contain confidential or legally privileged information or
material that is copyright of Powerhouse Museum or a third party. If you
have received this email in error, please notify the sender immediately
and then delete it. If you are not the intended recipient, you must not
use, disclose or distribute this e-mail without the author's prior
permission. Any views expressed in this message and attachments are
those of the individual sender and the Powerhouse Museum accepts no
liability for the content of this message. Whilst every care has been
taken, the Powerhouse Museum cannot guarantee that the integrity of this
email has been maintained nor that the email is free of errors or
viruses. The Powerhouse Museum advises all organisations and individuals
to undertake their own virus scanning and security measures.
========================================================================
======
**************************************************
For mcg information and to manage your subscription to the list, visit
the website at http://www.museumscomputergroup.org.uk
**************************************************
Unless otherwise agreed expressly in writing by a senior manager of
Eduserv, this communication is to be treated as confidential and the
information in it may not be used or disclosed except for the purpose
for which it has been sent.
If you have reason to believe that you are not the intended recipient
of this communication, please contact the sender immediately.
No employee or agent is authorised to enter into any binding agreement
or contract on behalf of Eduserv or Eduserv Technologies Ltd., unless
that agreement is subsequently confirmed by the conclusion of a written
contract or the issue of a purchase order.
Eduserv (Limited by Guarantee) – company number 3763109 - and
Eduserv Technologies Ltd – company number – 4256630 - are both
companies incorporated in England and Wales and have their registered
offices at Queen Anne House, 11 Charlotte Street, Bath, BA1 2NE.
**************************************************
For mcg information and to manage your subscription to the list, visit the website at http://www.museumscomputergroup.org.uk
**************************************************
|