I am looking at the current working draft of the multilingual working
group:
http://purl.org/DC/documents/working_drafts/wd-multilingual-current.htm
Under item "8", the following fragment of XML appear:
<rdfs:comment xml:lang="fr">
Le nom donné à la ressource par le créateur ou l'auteur.
</rdfs:comment>
Please note that because XML uses the UTF-8 (or UTF-16) character set
encoding as default, this fragment will be interpreted roughly as
follows:
<rdfs:comment xml:lang="fr">
Le nom donn? la ressource par le cr?eur ou l'auteur.
</rdfs:comment>
(where "?" will be some Chinese or Japanse ideograph).
This is probably not what was intended.
I propose that in the draft, examples incorporating non-ASCII glyphs
without expressing them in UTF-8 need to declare their character
set encoding explicitly. For instance, the above fragment becomes:
<rdfs:comment xml:lang="fr" xml:encoding="ISO-8859-1">
Le nom donné à la ressource par le créateur ou l'auteur.
</rdfs:comment>
Any comments?
--
- gisle hannemyr ( [log in to unmask] - http://home.sol.no/home/gisle/ )
------------------------------------------------------------------------
"Use the Source, Luke. Use the Source." -- apologies to Obi-Wan Kenobi
------------------------------------------------------------------------
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
|