At the Library of Congress we would like to begin instructing staff to put
meta information in the Web documents they put up. Of course we would like
to support the Dublin Core, but current search engines aren't programmed
to use it. Certainly the META tag can be used, as has been discussed on
this list. However, Alta Vista and Infoseek both are able now to use only
2 meta tags: "description" and "keywords". Those map in the DC to Subject
role=abstract and Subject without a qualifier. If we use them that way
the search engines won't be able to use them now. We all need something
NOW to help us find what we want on the Web. All this discussion
brings up the following questions.
1. Why are we lumping an abstract into the Subject field? Aren't keywords
and abstracts different enough that they warrant their own fields?
Abstracts are very useable in search and retrieval, and one could imagine
wanting to limit a search to an abstract. They are also suffiently
different from keywords in that stop words shouldn't be indexed. Also,
don't we want to consider consistency with what the search engines are
already doing so that when we have sufficiently developed guidelines so
that everyone starts using metadata that we can grandfather in what had
already been done? To have to use Subject and role= for the abstract makes
it harder to create metadata; don't we want to keep it simple for anyone
off the street to use? Can we consider having two different elements for
what is now "Subject" and make them consistent with AltaVista (Descriptor
and Keywords)? For those that want to go further, they could still
qualify Keywords by scheme=LCSH or whatever.
2. I can't remember when the "DC" part of the META NAME was added (e.g.
DC.subject). To use that implies there is some other scheme out there. Is
there really any other attempt to standardize meta information that we
have to include DC? Isn't the LINK REL enough to identify that Dublin Core
is being used? Again, can't we use it as it has already been used, without
specifying "DC"? If we want our scheme to be the standard, then we
wouldn't want to be forever having to put in "DC", since it adds
complexity to adding metadata for the average person.
3. Does anyone know of any progress with getting the Web search engines to
use DC meta elements? Why haven't they jumped at the chance to make some
order out of chaos?
We have a bit of a dilemma here in deciding what meta information to put
in our documents, because we want to support the Dublin Core but need to
have something that can be used by search engines right now. We considered
putting it in both ways (the way that AltaVista can now use and also
repeating it in the Dublin Core form), but that seems too much for people
to key it in twice.
Rebecca
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
^^ Rebecca S. Guenther ^^
^^ Senior MARC Standards Specialist ^^
^^ Network Development and MARC Standards Office ^^
^^ Library of Congress ^^
^^ Washington, DC 20540-4020 ^^
^^ (202) 707-5092 (voice) (202) 707-0115 (FAX) ^^
^^ [log in to unmask] ^^
^^ ^^
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
|