Hi
I think that ther is a small mistake in the example 2 of "Expressing Simple
Dublin Core in RDF / XML":
<?xml version="1.0"?>
<!DOCTYPE rdf:RDF PUBLIC "-//DUBLIN CORE//DCMES DTD 2001 11 28//EN"
"http://dublincore.org/documents/2001/11/28/dcmes-xml/dcmes-xml-dtd.dtd">
<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
xmlns:dc ="http://purl.org/dc/elements/1.1/">
<rdf:Description rdf:about="http://dublincore.org/">
<dc:title>Dublin Core Metadata Initiative - Home Page</dc:title>
<dc:description>The Dublin Core Metadata Initiative Web
site.</dc:description>
<dc:date>1998-10-10</dc:date>
<dc:format>text/html</dc:format>
<dc:language>en</dc:language>
<dc:contributor>The Dublin Core Metadata Initiative</dc:contributor>
<!-- guesses for the translation of the above titles -->
<dc:title xml:lang="fr">L'Initiative de métadonnées du Dublin
Core</dc:title>
when a document is in XML, UTF-8 is considered the default encoding (even if
it is not mentionned on the first line) So the accented characters "é" in
the word "métadonnées" should not appear in clear as in the example 2. I
checked in the source of the code and the XHTML page is encoded using iso
8859-1 so the é is valid in the page but the XML coding written here is
wrong. Maybe you should add a paragraph to re-explain the necessity to
encode the characters greater than 127 using UTF-8 OR numeric entities (and
not character entities like "é")
in fact it is explained a couple of paragraphs before the example 2:
"All other characters outside the core US-ASCII range of 32-126 should not
be encoded with the HTML entities such as é since these are not
defined in XML. Numeric entities for the characters should be used which are
written as &#ddd; in decimal or ઼ in hexadecimal. Alternatively they
can be encoded as Unicode in one of the formats such as UTF-8 which is
widely supported."
P.S. This question is important in the non-english part of the world and
should be clarified in this normative document.
Thanks
|