Tony Gill <[log in to unmask]> wrote:
>
> Their homepage contains the following:
>
> <META name="keywords" content="
> art, british, art, british, art, british, art, british, art, british,
> art, british, art, british, art, british, art, british, art, british,
> [this goes on for another 5 lines]
> lateral arts, painting, sculpture, photography, installation, public art, mixed media,
> lateral arts, painting, sculpture, photography, installation, public art, mixed media,
> [this goes on for another 6 lines]
> etc>
>
> These people have obviously cottoned on to robot relevance ranking by counting the
> occurrence of keyword hits; surely this is a dishonest waste of bandwidth?
Yes - very 'dishonest'....but indexers should not use the
same algorithm to index metadata and full text(html).
That is, if indexing metadata - don't use word frequency - since
it is a 'summary' of the document.
Cheers... Renato
_______________________________________________________________________
Dr Renato Iannella http://www.dstc.edu.au/RDU/staff/ri
Research Data Network CRC urn:inet:dstc.edu.au:renato:home
DSTC Pty Ltd, Gehrmann Laboratories phone/fax: +61 7 3365 4310/11
University of Queensland, 4072, AUSTRALIA email: [log in to unmask]
|