jenny
There is nothing more practical than a good theory.
~ James C. Maxwell (1831-1879)
-----Original Message-----
From: General DCMI discussion list [mailto:[log in to unmask]] On
Behalf Of Automatic digest processor
Sent: Sunday, November 14, 2004 5:01 PM
To: Recipients of DC-GENERAL digests
Subject: DC-GENERAL Digest - 12 Nov 2004 to 14 Nov 2004 (#2004-90)
There is one message totalling 107 lines in this issue.
Topics of the day:
1. A new version of Dublin Core Services.
----------------------------------------------------------------------
Date: Sun, 14 Nov 2004 18:52:15 +0100
From: Ernesto Giralt <[log in to unmask]>
Subject: A new version of Dublin Core Services.
Hi all, and sorry for the crossposting.
A new version - 0.2(beta)- of Dublin Core Services/Describethis
(http://www.describethis.com) has been published. This new version, as =
main
feature, brings us an automatic generator of keywords: DCS incorporates =
now
a dictionary of 5300 words in 11 different languages, included Catalan,
Portuguese, Russian, Arabic, Italian, among others, that permits to
recognize and generate keywords automatically. The system applies =
analitic
algorithms to find the best terms that better describe a given resource. =
The
new terms generated are added to the ones already included in the =
document,
although these are marked visually to avoid confusions with the terms
proposed by the own authors. In the case of the HTML documents that do =
not
have included these type of metadata, the list of generated keywords can =
be
used as a guide and a valid proposal for the publishers and authors of =
these
contents. =20
=20
The current version delivers some corrected and improved features. =
Among
these, can be emphasized the following ones:=20
- The new list of metadata types and variants found in the documents =
HTML
now includes more than 70 elements in 3 different languages
- The service has incorporated a new parser to recognize and extract the
metadata for the Creative Commons licenses (see =
http://creativecommons.org).
- The RDF converter and generator has been improved to produce a valid =
and
more complete document.
- Now DCS has applied Web Standards to all the documents generated (XML,
XHTML and RDF) to include the elements that indicate the type of =
document
(DOCTYPE) and language marks. =20
- The HTML documents parser now is capable of recognizing metadata =
placed in
other tags than traditional tags like META, concretely the LINK tag and =
the
comments embedded in the body of the text. =20
=20
In addition, due to the successful application in the blogs network, our
development team has dedicated special attention to the metadata and
particular =20
characteristics of this "type" of online content. With these changes and
improvements the already existing references to DescribeThis and the =
future
ones =20
will have a metadata extraction results more extensive and detailed. =20
=20
For the following version, DescribeThis will include:=20
- An editor for Dublin Core registers and collections.=20
- A multilingual and improved interface for DescribeThis - at present =
are
almost ready the first versions in Spanish and Catalan -=20
- Selected dictionaries and thesauri to be applied to improve the
automatically generated/extracted results=20
- Features for user subscription and register, so that results can be
closest to the needs and personal profile of each one. =20
=20
We wish to thank to the specialists and users in general that have sent =
us
valuable messages with recommendations, critics and advice. In special =
to
Daniel O'Connor, Paula A Markes and Eva M=E9ndez, to whom we must thank =
for
many of the changes and improvements.
=20
Again, thanks to all. We will continue working to improve our services =
and
products. =20
=20
----------------------------------------------=20
Dublin Core Services is a set of web services that offers tools for the
description and automated analysis of online resources. Through the
interface that =20
provides DescribeThis (http://www.describethis.com) it allows the =
management
and individual processing of the metadata collections that have been =20
extracted or generated from the resources. The site offers an =
easy-to-use
interface to indicate the resource to analyze and simple options to =
download
the results like XML, XHTML or RDF files. =20
Send your messages to [log in to unmask]
--
! Ernesto Giralt=20
Team of Development of Dublin Core Services. =20
=20
------------------------------
End of DC-GENERAL Digest - 12 Nov 2004 to 14 Nov 2004 (#2004-90)
****************************************************************
|