Hi all,
I have encountered a few problems with META tags embedded in the HEAD
section of HTML documents and harvesting those with a search engines.
We implemented DC in another 2500 pages on our web site. Some of the
content was more than 256 characters long and some of the DC elements
where repeated (e.g. DC.Description) .
We are using Verity Search97 (Information Server 3.01). Unfortunately
the repeated elements aren't supported at all. Meaning if I have the
Subject element repeated 3 times I can only search over the content
of the first occurrence.
Also we pre-process all documents returned from a search before they
are send to the user (include navigation bars etc.) . The processor
has some problems with content longer than 256 characters as well as
repeated elements.
Luckily their are work arounds due to Search97's extreme flexibility
(e.g. embedded XML in the HTML document). But since the Dublin Core User
Guide doesn't mention anything regarding a character limit or
potential problems with repeating META tags I was wondering what the
recommendations where based on?
* Is there a standard (content length, repeatability) outlined in the
HTML 4.0 specification?
* Has anyone else had similar problems?
* Has the implementers group ever taken a look at this issue?
Thanks,
Thomas
--
Thomas Hofmann
[technical producer]
************************************
email: [log in to unmask]
www: http://amol.org.au/
phone: + 61 (2) 92170 - 400
fax: + 61 (2) 92170 - 616
snailmail: AMOL Coordination Unit
500 Harris Street
2006 Ultimo, NSW
Australia
************************************
we are all foreigners at some point
rage against racism...
************************************
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
|