JiscMail Logo
Email discussion lists for the UK Education and Research communities

Help for DC-ARCHITECTURE Archives


DC-ARCHITECTURE Archives

DC-ARCHITECTURE Archives


DC-ARCHITECTURE@JISCMAIL.AC.UK


View:

Message:

[

First

|

Previous

|

Next

|

Last

]

By Topic:

[

First

|

Previous

|

Next

|

Last

]

By Author:

[

First

|

Previous

|

Next

|

Last

]

Font:

Proportional Font

LISTSERV Archives

LISTSERV Archives

DC-ARCHITECTURE Home

DC-ARCHITECTURE Home

DC-ARCHITECTURE  March 2002

DC-ARCHITECTURE March 2002

Options

Subscribe or Unsubscribe

Subscribe or Unsubscribe

Log In

Log In

Get Password

Get Password

Subject:

Re: DC-ARCHITECTURE Digest - 10 Mar 2002 to 11 Mar 2002 (#2002-35 )

From:

Simon Cox <[log in to unmask]>

Reply-To:

This list, which supersedes dc-datamodel, dc-schema, and dc-implementors, i" <[log in to unmask]>

Date:

Tue, 12 Mar 2002 10:03:18 +0800

Content-Type:

text/plain

Parts/Attachments:

Parts/Attachments

text/plain (1192 lines)

Jane -
I've been down this track a couple of times so have one comment to add:

There is one way only in W3C XML Schema of defining
an element from which other elements can be derived
which can have either simpleContent or complexContent -
that is to declare an element with anyType that acts
as the head of a substitution group:
either

<element name="DCField" type="anyType"/>
or
<element name="DCField"/>

where in the latter construction the type is implied.
Then any other element of any type can be declared
to be in its substitution group, e.g.

<element name="DCDate" type="date" substitutionGroup="DCField"/>

where "date" is a built-in simpleType from XML Schema, or

<element name="DCCoverage" type="your:favoriteCoverageEncodingType"
substitutionGroup="DCField"/>

where favoriteCoverageEncodingType is a complexType in "your" namespace.
But don't try deriving simpleContent types by restriction from
an anyType parent - there is "magic" involved in deriving the
simpleTypes side of the type hierarchy which is not to be
exposed in real schema documents!

Also, do not make the mistake of defining an empty complexType
and then trying to add simpleContent - it doesn't work.

And you also cannot derive specific simpleContent types
from string, unless using lexical patterns is acceptable.

And even the correct XML Schema syntax for deriving types
by restriction from anySimpleType is muddy - different
schema validators seem to have different interpretations
of the spec (there is a bug in XML Spy, for example -
I can forward my correspondence with Altova if anyone is interested!)

_____
[log in to unmask]  CSIRO Exploration & Mining
26 Dick Perry Avenue, Kensington WA 6151
PO Box 1130, Bentley WA 6102  AUSTRALIA
T: +61 (8) 6436 8639  F: +61 (8) 6436 8555  C: +61 (4) 0330 2672
http://www.csiro.au/page.asp?type=resume&id=CoxSimon

> -----Original Message-----
> From: Automatic digest processor [mailto:[log in to unmask]]
> Sent: Tuesday, 12 March 2002 8:01 AM
> To: Recipients of DC-ARCHITECTURE digests
> Subject: DC-ARCHITECTURE Digest - 10 Mar 2002 to 11 Mar 2002
> (#2002-35)
>
>
> There are 6 messages totalling 1037 lines in this issue.
>
> Topics of the day:
>
>   1. Public Comment on DC-simple XML Schema declaration within OAI (5)
>   2. Public Comment on DC-simple XML Schema
> declaration within OAI
>
> ----------------------------------------------------------------------
>
> Date:    Mon, 11 Mar 2002 10:15:20 +1000
> From:    Jane Hunter <[log in to unmask]>
> Subject: Re: Public Comment on DC-simple XML Schema
> declaration within OAI
>
> Dear all,
>
> My original concern with using the 'string' datatype was not just with
> the need for more complex structured derivations but also with the
> inability to derive other simple and commonly-required datatypes such
> as URIs (for dc:identity) or dates (for dc:date) from 'string'.
>
> I'm by no means an XML Schema expert but, one suggestion is that you
> consider using 'anySimpleType' rather than 'string':
>
>    http://www.w3.org/TR/xmlschema-2/#built-in-datatypes
>
> At least then users can derive some other useful datatypes.
>
> And just to clarify, the current proposal is to have 3 XML Schemas?
>
> 1. An XML Schema for 'simple DC' - using either 'string' or
> 'anySimpleType' for the element types
>
> 2. An XML Schema from which 'complex DC' and application profiles can
> be derived - using 'anyType' for the element types
>
> 3. An XML Schema for 'qualified DC'.
>
> jane
>
> > I'll add my pat on the back to this.  I think that Andy has
> successfully put the cap on this discussion.
> >
> > I raised the xHTML possibility to test the waters on
> whether HTML falls into the realm of lingua franca that serve
> as 'appropriate literals' for DCMI. This was inspired by the
> fact that xs:string in xml land is not the good old 7-bit
> ASCII that we know and love but unicode, which is a big step
> up the complexity ladder. I wasn't sure that adding html
> tagging to that stepped out of bounds, but figured it was
> worth asking.  I withdraw that shredded trial balloon.
> >
> > Moral of the story: you've got to draw the line somewhere
> and I think that Andy's argument is as good as it gets.
> >
> > Carl
> >
> > > -----Original Message-----
> > > From: Dan Brickley [mailto:[log in to unmask]]
> > > Sent: Saturday, March 09, 2002 4:55 AM
> > > To: [log in to unmask]
> > > Subject: Re: Public Comment on DC-simple XML Schema
> declaration within
> > > OAI
> > >
> > >
> > > This is one of the clearest articulations of the problem I've
> > > heard in a long while.
> > >
> > > We know how to deal with simple string values, and how to
> qualify them
> > > (subproperties and datatypes, in RDF-ese). We don't know
> yet how to
> > > preserve widearea interoperability and data re-use as we move
> > > from this
> > > realtively well understood realm towards more complex, highly
> > > strucured
> > > multi-namespace data exchange. It's OK, that's a
> difficult problem for
> > > everyone...
> > >
> > > Dan
> > >
> > > On Sat, 9 Mar 2002, Andy Powell wrote:
> > >
> > > > On Fri, 8 Mar 2002, Carl Lagoze wrote:
> > > >
> > > > > Andy, is there any reason in your mind why we should NOT
> > > allow xHTML
> > > > > in the values of the simple schema.  My first
> reaction is that it
> > > > > opens the door sufficiently to allow things like MathML
> > > but does not
> > > > > push it wide open into unrestricted land.
> > > >
> > > > I'm not really sure... but my feeling is that 'simple DC'
> > > means simple
> > > > literal string values and *nothing* else.
> > > >
> > > > Here's my thinking...
> > > >
> > > > We need to have a shared understanding of what we mean by a
> > > 'simple DC'
> > > > application.
> > > >
> > > > Currently, that understanding has to work across
> > > applications that are not
> > > > based on XML.  I'm thinking here about applications that
> > > carry metadata
> > > > using non-XML encodings such as HTML4/meta, Z39.50/GRS-1,
> > > Z39.50/MARC, ...
> > > >
> > > > Any two 'simple DC' applications should be able to exchange
> > > all their
> > > > metadata.  Anything encoded in one 'simple DC' application,
> > > should be able
> > > > to be encoded in another with no loss of data.
> > > >
> > > > 'Simple DC' is our most dumbed-down form of metadata.  It
> > > provides our
> > > > base level of interoperability between different services.
> > > (Simple DC is
> > > > the equivalent of the plain text rendered by, say, lynx -
> > > not the (X)HTML
> > > > page on which that rendering is based).
> > > >
> > > > XHTML carried in DC element values does *not* feel like
> > > 'simple DC' to me.
> > > > I completely agree that it would be a useful thing to be
> > > able to do (and
> > > > at least one of the services I'm involved in would like to
> > > be able to do
> > > > it!) - but I wouldn't call it 'simple DC' and I don't think
> > > support for it
> > > > should appear in a 'simple DC' XML schema.
> > > >
> > > > Finally, as an aside...
> > > >
> > > > It seems to me that the problems associated with trying to embed
> > > > structured content within DC element values is the aspect
> > > of DC that we
> > > > understand least currently.  I think we now have a good
> > > understanding of
> > > > qualified DC in terms of element refinement and encoding
> > > schemes.  But we
> > > > still don't know how to handle structured content very
> > > well.  For example,
> > > > we have the rather messy situation in which it is not clear
> > > if the DCMI
> > > > Box encoding scheme is a 'formatted string' or an 'XML
> > > application' or
> > > > both. I think this stems from an acknowledgement that
> we can do some
> > > > things in XML-based applications that we can't do in text-based
> > > > applications but without being quite sure how to handle
> > > that properly in
> > > > practice.
> > > >
> > > > Andy.
> > > >
> > > > > > -----Original Message-----
> > > > > > From: Roland Schwaenzl
> > > > > > [mailto:[log in to unmask]]
> > > > > > Sent: Friday, March 08, 2002 1:17 PM
> > > > > > To: [log in to unmask]
> > > > > > Subject: Re: Public Comment on DC-simple XML Schema
> > > declaration within
> > > > > > OAI
> > > > > >
> > > > > >
> > > > > > > From [log in to unmask] Fri Mar  8
> > > 16:30 MET 2002
> > > > > > > content-class: urn:content-classes:message
> > > > > > > MIME-Version: 1.0
> > > > > > > X-MimeOLE: Produced By Microsoft Exchange V6.0.5762.3
> > > > > > > Thread-Topic:      Re: Public Comment on DC-simple XML
> > > > > > Schema declaration
> > > > > > >                    within OAI
> > > > > > > Thread-Index: AcHGrDpz/EG80hNpQAmDGuSf4riG1QACWQMg
> > > > > > > Date:         Fri, 8 Mar 2002 10:30:05 -0500
> > > > > > > From: Carl Lagoze <[log in to unmask]>
> > > > > > > Subject:      Re: Public Comment on DC-simple XML Schema
> > > > > > declaration within OAI
> > > > > > > Comments: cc: Herbert Van de Sompel
> <[log in to unmask]>
> > > > > > > To: [log in to unmask]
> > > > > > > Content-Transfer-Encoding: 8bit
> > > > > > > X-MIME-Autoconverted: from quoted-printable to 8bit by
> > > > > > scarlett.mathematik.Uni-Osnabrueck.DE id QAA20116
> > > > > > >
> > > > > > > Roland,
> > > > > > >
> > > > > > > Regarding point 1:  The prohibition as you state it sound
> > > > > > pretty draconian; seems like some of the people originally
> > > > > > motivated OAI (eprints folks) would want mathML.  Remind me
> > > > > > again, is there a solution that allows things like
> mathML but
> > > > > > forbid arbitrary other XML?
> > > > > >
> > > > > > >
> > > > > >
> > > > > > Sure, you could (!) do that. W3C's xml-schema-primer has an
> > > > > > example, with content restricted to XHTML:
> > > > > > http://www.w3.org/TR/xml-schema-0   sec.5.5. It's
> the example
> > > > > > preceding the textType example.
> > > > > > In particular table 4 in that section is quite useful as
> > > > > > summary of built in facilities.
> > > > > > There's a similar technique with attributes.
> > > > > >
> > > > > > One could try: http://www.w3.org/1998/Math/MathML for a
> > > namespace URI.
> > > > > >
> > > > > > There's some use of MathML embedded into XHTML -
> > > > > > (cf. processContents="skip" in the example cited above)
> > > > > >
> > > > > >
> > > > > > rs
> > > > > >
> > > > > >
> > > > > >
> > > > > > > Carl
> > > > > > >
> > > > > > > I've copied to Herbert Van de Somple because the MathML
> > > > > > thing might be of concern to him also.
> > > > > > >
> > > > > > >
> > > > > > >
> > > > > > > > -----Original Message-----
> > > > > > > > From: Roland Schwaenzl
> > > > > > > > [mailto:[log in to unmask]]
> > > > > > > > Sent: Friday, March 08, 2002 9:19 AM
> > > > > > > > To: [log in to unmask]
> > > > > > > > Subject: Re: Public Comment on DC-simple XML Schema
> > > > > > declaration within
> > > > > > > > OAI
> > > > > > > >
> > > > > > > >
> > > > > > > > Dears,
> > > > > > > >
> > > > > > > > with us it's now the time for reports rather than
> > > development.
> > > > > > > > I'll not be able to follow this discussion the
> coming week.
> > > > > > > >
> > > > > > > >
> > > > > > > > Let me try to summarize, what i understand
> currently from
> > > > > > the dc-xml
> > > > > > > > discussion.
> > > > > > > >
> > > > > > > >
> > > > > > > > 1. In OAi the use of dc:elements with the xml- simple
> > > > > > > >    dataType "string" will (continue to) be required in
> > > > > > > >    the mandatory part of OAi.
> > > > > > > >    It could be, that OAi allows a dataType extension by
> > > > > > > >    the xml:lang attribute.
> > > > > > > >
> > > > > > > > [In particular no mark-up from W3C's MathML or Ruby will
> > > > > > be allowed in
> > > > > > > >  oai-dc records].
> > > > > > > >
> > > > > > > > 2. There are mixed views on (details of)
> > > requirements, design
> > > > > > > > and coding
> > > > > > > >    for dcmi supported "plain-xml"-schemes.
> > > > > > > >
> > > > > > > >
> > > > > > > > Please correct me on mistaken points.
> > > > > > > >
> > > > > > > >
> > > > > > > > Best,
> > > > > > > > rs
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > > > Andy
> > > > --
> > > > Distributed Systems, UKOLN, University of Bath, Bath,
> BA2 7AY, UK
> > > > http://www.ukoln.ac.uk/ukoln/staff/a.powell       +44
> 1225 383933
> > > > Resource Discovery Network http://www.rdn.ac.uk/
> > > >
> > >
>
> ------------------------------
>
> Date:    Mon, 11 Mar 2002 10:08:53 +0000
> From:    Rachel Heery <[log in to unmask]>
> Subject: Re: Public Comment on DC-simple XML Schema
> declaration within OAI
>
> On Mon, 11 Mar 2002, Jane Hunter wrote:
>
> >
> > And just to clarify, the current proposal is to have 3 XML Schemas?
> >
> > 1. An XML Schema for 'simple DC' - using either 'string' or
> > 'anySimpleType' for the element types
> >
> > 2. An XML Schema from which 'complex DC' and application
> profiles can
> > be derived - using 'anyType' for the element types
> >
> > 3. An XML Schema for 'qualified DC'.
>
>
> If we are going this multipe schema route I think an
> additional schema for
> unqualified Dc i.e. v1.1 15 elements plus 'audience' should
> be a priority.
> With same element type as 'simple DC'.
>
> Rachel
>
> >
> > jane
> >
> > > I'll add my pat on the back to this.  I think that Andy
> has successfully put the cap on this discussion.
> > >
> > > I raised the xHTML possibility to test the waters on
> whether HTML falls into the realm of lingua franca that serve
> as 'appropriate literals' for DCMI. This was inspired by the
> fact that xs:string in xml land is not the good old 7-bit
> ASCII that we know and love but unicode, which is a big step
> up the complexity ladder. I wasn't sure that adding html
> tagging to that stepped out of bounds, but figured it was
> worth asking.  I withdraw that shredded trial balloon.
> > >
> > > Moral of the story: you've got to draw the line somewhere
> and I think that Andy's argument is as good as it gets.
> > >
> > > Carl
> > >
> > > > -----Original Message-----
> > > > From: Dan Brickley [mailto:[log in to unmask]]
> > > > Sent: Saturday, March 09, 2002 4:55 AM
> > > > To: [log in to unmask]
> > > > Subject: Re: Public Comment on DC-simple XML Schema
> declaration within
> > > > OAI
> > > >
> > > >
> > > > This is one of the clearest articulations of the problem I've
> > > > heard in a long while.
> > > >
> > > > We know how to deal with simple string values, and how
> to qualify them
> > > > (subproperties and datatypes, in RDF-ese). We don't
> know yet how to
> > > > preserve widearea interoperability and data re-use as we move
> > > > from this
> > > > realtively well understood realm towards more complex, highly
> > > > strucured
> > > > multi-namespace data exchange. It's OK, that's a
> difficult problem for
> > > > everyone...
> > > >
> > > > Dan
> > > >
> > > > On Sat, 9 Mar 2002, Andy Powell wrote:
> > > >
> > > > > On Fri, 8 Mar 2002, Carl Lagoze wrote:
> > > > >
> > > > > > Andy, is there any reason in your mind why we should NOT
> > > > allow xHTML
> > > > > > in the values of the simple schema.  My first
> reaction is that it
> > > > > > opens the door sufficiently to allow things like MathML
> > > > but does not
> > > > > > push it wide open into unrestricted land.
> > > > >
> > > > > I'm not really sure... but my feeling is that 'simple DC'
> > > > means simple
> > > > > literal string values and *nothing* else.
> > > > >
> > > > > Here's my thinking...
> > > > >
> > > > > We need to have a shared understanding of what we mean by a
> > > > 'simple DC'
> > > > > application.
> > > > >
> > > > > Currently, that understanding has to work across
> > > > applications that are not
> > > > > based on XML.  I'm thinking here about applications that
> > > > carry metadata
> > > > > using non-XML encodings such as HTML4/meta, Z39.50/GRS-1,
> > > > Z39.50/MARC, ...
> > > > >
> > > > > Any two 'simple DC' applications should be able to exchange
> > > > all their
> > > > > metadata.  Anything encoded in one 'simple DC' application,
> > > > should be able
> > > > > to be encoded in another with no loss of data.
> > > > >
> > > > > 'Simple DC' is our most dumbed-down form of metadata.  It
> > > > provides our
> > > > > base level of interoperability between different services.
> > > > (Simple DC is
> > > > > the equivalent of the plain text rendered by, say, lynx -
> > > > not the (X)HTML
> > > > > page on which that rendering is based).
> > > > >
> > > > > XHTML carried in DC element values does *not* feel like
> > > > 'simple DC' to me.
> > > > > I completely agree that it would be a useful thing to be
> > > > able to do (and
> > > > > at least one of the services I'm involved in would like to
> > > > be able to do
> > > > > it!) - but I wouldn't call it 'simple DC' and I don't think
> > > > support for it
> > > > > should appear in a 'simple DC' XML schema.
> > > > >
> > > > > Finally, as an aside...
> > > > >
> > > > > It seems to me that the problems associated with
> trying to embed
> > > > > structured content within DC element values is the aspect
> > > > of DC that we
> > > > > understand least currently.  I think we now have a good
> > > > understanding of
> > > > > qualified DC in terms of element refinement and encoding
> > > > schemes.  But we
> > > > > still don't know how to handle structured content very
> > > > well.  For example,
> > > > > we have the rather messy situation in which it is not clear
> > > > if the DCMI
> > > > > Box encoding scheme is a 'formatted string' or an 'XML
> > > > application' or
> > > > > both. I think this stems from an acknowledgement that
> we can do some
> > > > > things in XML-based applications that we can't do in
> text-based
> > > > > applications but without being quite sure how to handle
> > > > that properly in
> > > > > practice.
> > > > >
> > > > > Andy.
> > > > >
> > > > > > > -----Original Message-----
> > > > > > > From: Roland Schwaenzl
> > > > > > > [mailto:[log in to unmask]]
> > > > > > > Sent: Friday, March 08, 2002 1:17 PM
> > > > > > > To: [log in to unmask]
> > > > > > > Subject: Re: Public Comment on DC-simple XML Schema
> > > > declaration within
> > > > > > > OAI
> > > > > > >
> > > > > > >
> > > > > > > > From [log in to unmask] Fri Mar  8
> > > > 16:30 MET 2002
> > > > > > > > content-class: urn:content-classes:message
> > > > > > > > MIME-Version: 1.0
> > > > > > > > X-MimeOLE: Produced By Microsoft Exchange V6.0.5762.3
> > > > > > > > Thread-Topic:      Re: Public Comment on DC-simple XML
> > > > > > > Schema declaration
> > > > > > > >                    within OAI
> > > > > > > > Thread-Index: AcHGrDpz/EG80hNpQAmDGuSf4riG1QACWQMg
> > > > > > > > Date:         Fri, 8 Mar 2002 10:30:05 -0500
> > > > > > > > From: Carl Lagoze <[log in to unmask]>
> > > > > > > > Subject:      Re: Public Comment on DC-simple XML Schema
> > > > > > > declaration within OAI
> > > > > > > > Comments: cc: Herbert Van de Sompel
> <[log in to unmask]>
> > > > > > > > To: [log in to unmask]
> > > > > > > > Content-Transfer-Encoding: 8bit
> > > > > > > > X-MIME-Autoconverted: from quoted-printable to 8bit by
> > > > > > > scarlett.mathematik.Uni-Osnabrueck.DE id QAA20116
> > > > > > > >
> > > > > > > > Roland,
> > > > > > > >
> > > > > > > > Regarding point 1:  The prohibition as you
> state it sound
> > > > > > > pretty draconian; seems like some of the people originally
> > > > > > > motivated OAI (eprints folks) would want mathML.
> Remind me
> > > > > > > again, is there a solution that allows things
> like mathML but
> > > > > > > forbid arbitrary other XML?
> > > > > > >
> > > > > > > >
> > > > > > >
> > > > > > > Sure, you could (!) do that. W3C's
> xml-schema-primer has an
> > > > > > > example, with content restricted to XHTML:
> > > > > > > http://www.w3.org/TR/xml-schema-0   sec.5.5. It's
> the example
> > > > > > > preceding the textType example.
> > > > > > > In particular table 4 in that section is quite useful as
> > > > > > > summary of built in facilities.
> > > > > > > There's a similar technique with attributes.
> > > > > > >
> > > > > > > One could try: http://www.w3.org/1998/Math/MathML for a
> > > > namespace URI.
> > > > > > >
> > > > > > > There's some use of MathML embedded into XHTML -
> > > > > > > (cf. processContents="skip" in the example cited above)
> > > > > > >
> > > > > > >
> > > > > > > rs
> > > > > > >
> > > > > > >
> > > > > > >
> > > > > > > > Carl
> > > > > > > >
> > > > > > > > I've copied to Herbert Van de Somple because the MathML
> > > > > > > thing might be of concern to him also.
> > > > > > > >
> > > > > > > >
> > > > > > > >
> > > > > > > > > -----Original Message-----
> > > > > > > > > From: Roland Schwaenzl
> > > > > > > > > [mailto:[log in to unmask]]
> > > > > > > > > Sent: Friday, March 08, 2002 9:19 AM
> > > > > > > > > To: [log in to unmask]
> > > > > > > > > Subject: Re: Public Comment on DC-simple XML Schema
> > > > > > > declaration within
> > > > > > > > > OAI
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > Dears,
> > > > > > > > >
> > > > > > > > > with us it's now the time for reports rather than
> > > > development.
> > > > > > > > > I'll not be able to follow this discussion
> the coming week.
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > Let me try to summarize, what i understand
> currently from
> > > > > > > the dc-xml
> > > > > > > > > discussion.
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > 1. In OAi the use of dc:elements with the xml- simple
> > > > > > > > >    dataType "string" will (continue to) be required in
> > > > > > > > >    the mandatory part of OAi.
> > > > > > > > >    It could be, that OAi allows a dataType
> extension by
> > > > > > > > >    the xml:lang attribute.
> > > > > > > > >
> > > > > > > > > [In particular no mark-up from W3C's MathML
> or Ruby will
> > > > > > > be allowed in
> > > > > > > > >  oai-dc records].
> > > > > > > > >
> > > > > > > > > 2. There are mixed views on (details of)
> > > > requirements, design
> > > > > > > > > and coding
> > > > > > > > >    for dcmi supported "plain-xml"-schemes.
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > Please correct me on mistaken points.
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > Best,
> > > > > > > > > rs
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > > > Andy
> > > > > --
> > > > > Distributed Systems, UKOLN, University of Bath, Bath,
> BA2 7AY, UK
> > > > > http://www.ukoln.ac.uk/ukoln/staff/a.powell       +44
> 1225 383933
> > > > > Resource Discovery Network http://www.rdn.ac.uk/
> > > > >
> > > >
> >
>
>
> --------------------------------------------------------------
> -------------
> Rachel Heery
> UKOLN
> University of Bath                              tel: +44
> (0)1225 826724
> Bath, BA2 7AY, UK                               fax: +44
> (0)1225 826838
> http://www.ukoln.ac.uk/
>
> ------------------------------
>
> Date:    Mon, 11 Mar 2002 06:59:45 -0500
> From:    Carl Lagoze <[log in to unmask]>
> Subject: Re: Public Comment on DC-simple XML Schema
> declaration within OAI
>
> Hi Jane,
>
> Its not clear to me that there is a clear proposal for the number of =
> schema (schemata?), except for the fact that there will be more than =
> one.  The one we know about at this time is "simple dc'.
> There may be a =
> schema for qualified dc, the conceptual basis of which is [1], where =
> qualification is fairly tightly constrained.  The answer to whether =
> there will be other schema for a much fuzzier area that you
> are calling =
> complex dc is unclear. =20
>
> Your message implies that such a schema is necessary for application =
> profiles but that leads to a much more expansive view of applicaiton =
> profiles than I understand.  For example, it is possible to write a =
> schema based on the simple dc schema for an app profile
> defined as "must =
> include one dc creator element and on dc title element and three =
> elements from FGDC".  Your definition of application profile
> seems to =
> include more complex structures such as dc elements rooting
> sub-trees =
> comprised of elements from mixed namespaces.  Its not clear
> to me that =
> we understand this world yet and how to control it.
>
> Finally, regarding anySimpleType: I had not heard of such an animal =
> until your email but went and found it in the datatype part
> of the xml =
> schema spec.  I played with it in xml spy just to check it
> out.  From my =
> brief look around it indicates that it has no real effect on
> the derived =
> instance documents relative to xs:string.  Is then your intention to =
> make it possible for deriving schema (those that import it)
> to have an =
> adequate restriction base?
>
> Carl=20
>
>
> [1] http://dublincore.org/documents/2000/07/11/dcmes-qualifiers/
>
> > -----Original Message-----
> > From: Jane Hunter [mailto:[log in to unmask]]
> > Sent: Sunday, March 10, 2002 7:15 PM
> > To: [log in to unmask]
> > Subject: Re: Public Comment on DC-simple XML Schema
> declaration within
> > OAI
> >=20
> >=20
> > Dear all,
> >=20
> > My original concern with using the 'string' datatype was
> not just with
> > the need for more complex structured derivations but also with the
> > inability to derive other simple and commonly-required
> datatypes such
> > as URIs (for dc:identity) or dates (for dc:date) from 'string'.
> >=20
> > I'm by no means an XML Schema expert but, one suggestion is that you
> > consider using 'anySimpleType' rather than 'string':
> >=20
> >    http://www.w3.org/TR/xmlschema-2/#built-in-datatypes
> >=20
> > At least then users can derive some other useful datatypes.
> >=20
> > And just to clarify, the current proposal is to have 3 XML Schemas?
> >=20
> > 1. An XML Schema for 'simple DC' - using either 'string' or
> > 'anySimpleType' for the element types
> >=20
> > 2. An XML Schema from which 'complex DC' and application
> profiles can
> > be derived - using 'anyType' for the element types
> >=20
> > 3. An XML Schema for 'qualified DC'.
> >=20
> > jane
> >=20
> > > I'll add my pat on the back to this.  I think that Andy has=20
> > successfully put the cap on this discussion.
> > >
> > > I raised the xHTML possibility to test the waters on=20
> > whether HTML falls into the realm of lingua franca that serve=20
> > as 'appropriate literals' for DCMI. This was inspired by the=20
> > fact that xs:string in xml land is not the good old 7-bit=20
> > ASCII that we know and love but unicode, which is a big step=20
> > up the complexity ladder. I wasn't sure that adding html=20
> > tagging to that stepped out of bounds, but figured it was=20
> > worth asking.  I withdraw that shredded trial balloon.
> > >
> > > Moral of the story: you've got to draw the line somewhere=20
> > and I think that Andy's argument is as good as it gets.
> > >
> > > Carl
> > >
> > > > -----Original Message-----
> > > > From: Dan Brickley [mailto:[log in to unmask]]
> > > > Sent: Saturday, March 09, 2002 4:55 AM
> > > > To: [log in to unmask]
> > > > Subject: Re: Public Comment on DC-simple XML Schema=20
> > declaration within
> > > > OAI
> > > >
> > > >
> > > > This is one of the clearest articulations of the problem I've
> > > > heard in a long while.
> > > >
> > > > We know how to deal with simple string values, and how to=20
> > qualify them
> > > > (subproperties and datatypes, in RDF-ese). We don't know=20
> > yet how to
> > > > preserve widearea interoperability and data re-use as we move
> > > > from this
> > > > realtively well understood realm towards more complex, highly
> > > > strucured
> > > > multi-namespace data exchange. It's OK, that's a=20
> > difficult problem for
> > > > everyone...
> > > >
> > > > Dan
> > > >
> > > > On Sat, 9 Mar 2002, Andy Powell wrote:
> > > >
> > > > > On Fri, 8 Mar 2002, Carl Lagoze wrote:
> > > > >
> > > > > > Andy, is there any reason in your mind why we should NOT
> > > > allow xHTML
> > > > > > in the values of the simple schema.  My first=20
> > reaction is that it
> > > > > > opens the door sufficiently to allow things like MathML
> > > > but does not
> > > > > > push it wide open into unrestricted land.
> > > > >
> > > > > I'm not really sure... but my feeling is that 'simple DC'
> > > > means simple
> > > > > literal string values and *nothing* else.
> > > > >
> > > > > Here's my thinking...
> > > > >
> > > > > We need to have a shared understanding of what we mean by a
> > > > 'simple DC'
> > > > > application.
> > > > >
> > > > > Currently, that understanding has to work across
> > > > applications that are not
> > > > > based on XML.  I'm thinking here about applications that
> > > > carry metadata
> > > > > using non-XML encodings such as HTML4/meta, Z39.50/GRS-1,
> > > > Z39.50/MARC, ...
> > > > >
> > > > > Any two 'simple DC' applications should be able to exchange
> > > > all their
> > > > > metadata.  Anything encoded in one 'simple DC' application,
> > > > should be able
> > > > > to be encoded in another with no loss of data.
> > > > >
> > > > > 'Simple DC' is our most dumbed-down form of metadata.  It
> > > > provides our
> > > > > base level of interoperability between different services.
> > > > (Simple DC is
> > > > > the equivalent of the plain text rendered by, say, lynx -
> > > > not the (X)HTML
> > > > > page on which that rendering is based).
> > > > >
> > > > > XHTML carried in DC element values does *not* feel like
> > > > 'simple DC' to me.
> > > > > I completely agree that it would be a useful thing to be
> > > > able to do (and
> > > > > at least one of the services I'm involved in would like to
> > > > be able to do
> > > > > it!) - but I wouldn't call it 'simple DC' and I don't think
> > > > support for it
> > > > > should appear in a 'simple DC' XML schema.
> > > > >
> > > > > Finally, as an aside...
> > > > >
> > > > > It seems to me that the problems associated with
> trying to embed
> > > > > structured content within DC element values is the aspect
> > > > of DC that we
> > > > > understand least currently.  I think we now have a good
> > > > understanding of
> > > > > qualified DC in terms of element refinement and encoding
> > > > schemes.  But we
> > > > > still don't know how to handle structured content very
> > > > well.  For example,
> > > > > we have the rather messy situation in which it is not clear
> > > > if the DCMI
> > > > > Box encoding scheme is a 'formatted string' or an 'XML
> > > > application' or
> > > > > both. I think this stems from an acknowledgement that=20
> > we can do some
> > > > > things in XML-based applications that we can't do in
> text-based
> > > > > applications but without being quite sure how to handle
> > > > that properly in
> > > > > practice.
> > > > >
> > > > > Andy.
> > > > >
> > > > > > > -----Original Message-----
> > > > > > > From: Roland Schwaenzl
> > > > > > > [mailto:[log in to unmask]]
> > > > > > > Sent: Friday, March 08, 2002 1:17 PM
> > > > > > > To: [log in to unmask]
> > > > > > > Subject: Re: Public Comment on DC-simple XML Schema
> > > > declaration within
> > > > > > > OAI
> > > > > > >
> > > > > > >
> > > > > > > > From [log in to unmask] Fri Mar  8
> > > > 16:30 MET 2002
> > > > > > > > content-class: urn:content-classes:message
> > > > > > > > MIME-Version: 1.0
> > > > > > > > X-MimeOLE: Produced By Microsoft Exchange V6.0.5762.3
> > > > > > > > Thread-Topic:      Re: Public Comment on DC-simple XML
> > > > > > > Schema declaration
> > > > > > > >                    within OAI
> > > > > > > > Thread-Index: AcHGrDpz/EG80hNpQAmDGuSf4riG1QACWQMg
> > > > > > > > Date:         Fri, 8 Mar 2002 10:30:05 -0500
> > > > > > > > From: Carl Lagoze <[log in to unmask]>
> > > > > > > > Subject:      Re: Public Comment on DC-simple XML Schema
> > > > > > > declaration within OAI
> > > > > > > > Comments: cc: Herbert Van de Sompel=20
> > <[log in to unmask]>
> > > > > > > > To: [log in to unmask]
> > > > > > > > Content-Transfer-Encoding: 8bit
> > > > > > > > X-MIME-Autoconverted: from quoted-printable to 8bit by
> > > > > > > scarlett.mathematik.Uni-Osnabrueck.DE id QAA20116
> > > > > > > >
> > > > > > > > Roland,
> > > > > > > >
> > > > > > > > Regarding point 1:  The prohibition as you
> state it sound
> > > > > > > pretty draconian; seems like some of the people originally
> > > > > > > motivated OAI (eprints folks) would want mathML.
> Remind me
> > > > > > > again, is there a solution that allows things like=20
> > mathML but
> > > > > > > forbid arbitrary other XML?
> > > > > > >
> > > > > > > >
> > > > > > >
> > > > > > > Sure, you could (!) do that. W3C's
> xml-schema-primer has an
> > > > > > > example, with content restricted to XHTML:
> > > > > > > http://www.w3.org/TR/xml-schema-0   sec.5.5. It's=20
> > the example
> > > > > > > preceding the textType example.
> > > > > > > In particular table 4 in that section is quite useful as
> > > > > > > summary of built in facilities.
> > > > > > > There's a similar technique with attributes.
> > > > > > >
> > > > > > > One could try: http://www.w3.org/1998/Math/MathML for a
> > > > namespace URI.
> > > > > > >
> > > > > > > There's some use of MathML embedded into XHTML -
> > > > > > > (cf. processContents=3D"skip" in the example cited above)
> > > > > > >
> > > > > > >
> > > > > > > rs
> > > > > > >
> > > > > > >
> > > > > > >
> > > > > > > > Carl
> > > > > > > >
> > > > > > > > I've copied to Herbert Van de Somple because the MathML
> > > > > > > thing might be of concern to him also.
> > > > > > > >
> > > > > > > >
> > > > > > > >
> > > > > > > > > -----Original Message-----
> > > > > > > > > From: Roland Schwaenzl
> > > > > > > > > [mailto:[log in to unmask]]
> > > > > > > > > Sent: Friday, March 08, 2002 9:19 AM
> > > > > > > > > To: [log in to unmask]
> > > > > > > > > Subject: Re: Public Comment on DC-simple XML Schema
> > > > > > > declaration within
> > > > > > > > > OAI
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > Dears,
> > > > > > > > >
> > > > > > > > > with us it's now the time for reports rather than
> > > > development.
> > > > > > > > > I'll not be able to follow this discussion the=20
> > coming week.
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > Let me try to summarize, what i understand=20
> > currently from
> > > > > > > the dc-xml
> > > > > > > > > discussion.
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > 1. In OAi the use of dc:elements with the xml- simple
> > > > > > > > >    dataType "string" will (continue to) be required in
> > > > > > > > >    the mandatory part of OAi.
> > > > > > > > >    It could be, that OAi allows a dataType
> extension by
> > > > > > > > >    the xml:lang attribute.
> > > > > > > > >
> > > > > > > > > [In particular no mark-up from W3C's MathML
> or Ruby will
> > > > > > > be allowed in
> > > > > > > > >  oai-dc records].
> > > > > > > > >
> > > > > > > > > 2. There are mixed views on (details of)
> > > > requirements, design
> > > > > > > > > and coding
> > > > > > > > >    for dcmi supported "plain-xml"-schemes.
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > Please correct me on mistaken points.
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > Best,
> > > > > > > > > rs
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > > > Andy
> > > > > --
> > > > > Distributed Systems, UKOLN, University of Bath, Bath,=20
> > BA2 7AY, UK
> > > > > http://www.ukoln.ac.uk/ukoln/staff/a.powell       +44=20
> > 1225 383933
> > > > > Resource Discovery Network http://www.rdn.ac.uk/
> > > > >
> > > >
> >=20
>
> ------------------------------
>
> Date:    Mon, 11 Mar 2002 15:52:46 -0000
> From:    Pete Johnston <[log in to unmask]>
> Subject: Re: Public Comment on DC-simple XML Schema
> declaration within OAI
>
> Apologies... I've been away from this for a couple of days and I'm
> struggling to absorb it all, but just picking up one point here...
>
> Jane said:
>
> > I'm by no means an XML Schema expert but, one suggestion is
> > that you consider using 'anySimpleType' rather than 'string':
> >
> >    http://www.w3.org/TR/xmlschema-2/#built-in-datatypes
> >
> > At least then users can derive some other useful datatypes.
> >
> > And just to clarify, the current proposal is to have 3 XML Schemas?
> >
> > 1. An XML Schema for 'simple DC' - using either 'string' or
> > 'anySimpleType' for the element types
>
> Is there not a problem that the current "base" element type in the
> simpledc schema isn't a simpleType? i.e. if we allow the use of the
> xml:lang attribute (which was part of the initial proposal for "simple
> DC", and I don't _think_ has been rejected along the way?), I
> think that
> makes the "base" element type a "complexType" in XML Schema
> terms, which
> introduces the problem pointed out by Roland here
>
> http://www.jiscmail.ac.uk/cgi-bin/wa.exe?A2=ind0203&L=dc-archi
> tecture&F=
> &S=&P=7240
>
> of deriving simpleTypes from that starting point.
>
> Does this mean we need separate "per-DC-element" types, where some
> elements have complexTypes permitting xml:lang and others have only
> simpleTypes based only on "anySimpleType"?
>
> Pete
>
> ------------------------------
>
> Date:    Mon, 11 Mar 2002 18:24:43 -0000
> From:    Pete Johnston <[log in to unmask]>
> Subject: Re: Public Comment on DC-simple XML Schema
>    declaration
>          within OAI
>
> Rachel said:
>
> > What I would like to see is one XML schema for all DC
> > terms... given that we now have elements in 2 different
> > namespaces I think we have to accept the need for a schema
> > containing all terms.... surely it will be a nonsense to have
> > a schema with qualifiers and one element??
>
> Andy said:
>
> > If a schema representing all DCMI terms is a useful building block,
> > then fine, lets create an XML schema for it that others can import.
> > But I'm not convinced it is BTW.
>
> My understanding is that because of the way XML Schema works with XML
> namespaces, it's not possible to model elements/attributes from two
> namespaces within one XML Schema. So we will necessarily have
> a minimum
> of two "base schemas" for DCMI, one each for the elements in the two
> namespaces http://purl.org/dc/elements/1.1/ and
> http://purl.org/dc/terms/   This contrasts with RDF Schema where you
> could put the descriptions of the terms in the two namespaces in one
> file (if you really wanted to do so!)
>
> So I don't think Rachel's requirement can be realised directly.
>
> And following Carl and Andy's messages at
>
> http://www.jiscmail.ac.uk/cgi-bin/wa.exe?A2=ind0203&L=dc-archi
> tecture&F=
> &S=&P=8602
>
> http://www.jiscmail.ac.uk/cgi-bin/wa.exe?A2=ind0203&L=dc-archi
> tecture&F=
> &S=&P=8322
>
> I think we're saying that, given the need for different datatyping
> requirements for using elements from the _same_ namespace in
> _different_
> application contexts, there may be a requirement for multiple XML
> Schemas to represent the terms in one namespace.
>
> What I'm still not sure about is whether these multiple schemas for,
> say, http://purl.org/dc/elements/1.1/ are to be completely independent
> of each other, or whether we should try to build explicit
> relationships
> between them, or more specifically between the datatypes used within
> them.
>
> My understanding is that we can do the former quite easily, but trying
> to do the latter introduces some (for me, at least) quite difficult
> questions of datatype hierarchies.
>
> Cheers
> Pete
>
> ------------------------------
>
> Date:    Tue, 12 Mar 2002 09:26:43 +1000
> From:    Jane Hunter <[log in to unmask]>
> Subject: Re: Public Comment on DC-simple XML Schema
> declaration within OAI
>
> Dear Pete,
>
> Sorry I missed that point raised by Roland - but I'm not sure you're
> right. Its certainly more complex but you can apply restrictions to
> complexTypes. I think you can do the following but I need to test it
> with XML Spy:
>
> In the 'simple DC' XML Schema DCMI namepace you define the generic
> elementType:
>
>   <complexType name="elementType"
>     <simpleContent>
>       <extension base="anySimpleType">
>         <attribute ref="x:lang" use="optional"/>
>       </extension>
>     </simpleContent>
>   </complexType>
>
>   <element name="date" type="elementType/>
>
>
> Then if a user wants their date element to use the 'date' datatype,
> they define their own element based on the dc:elementType, in
> their own
> schema:
>
>   <element name="myDate">
>   <xs:complexType>
>     <xs:simpleContent>
>       <xs:restriction base="dc:elementType">
>         <simpleContent>
>           <extension base="date">
>             <attribute ref="x:lang" use="optional"/>
>           </extension>
>         </simpleContent>
>       </xs:restriction>
>     </xs:simpleContent>
>   </xs:complexType>
>   </xs:element>
>
>
> This is pretty complex, so I'm starting to think the pure and simple,
> string and vanilla (sounds like a song) approach is the easiest. If
> anyone wants anything other than strings, or facets of strings, then
> they have to declare their own element, but we could consider
> providing
> a 'semantics' attribute which can be included in the user's
> definition,
> to point to the DC element definition:
>
>     <element name="myDate" type="xsd:date" xx:semantics=
>          "http://www.dublincore.org/documents/1999/07/02/dces/#date"/>
>
> Since the semantic definition of each element is the important stable
> part of DC, then this seems a sensible approach (to me at least).
>
> jane
>
> > Is there not a problem that the current "base" element type in the
> > simpledc schema isn't a simpleType? i.e. if we allow the use of the
> > xml:lang attribute (which was part of the initial proposal
> for "simple
> > DC", and I don't _think_ has been rejected along the way?),
> I think that
> > makes the "base" element type a "complexType" in XML Schema
> terms, which
> > introduces the problem pointed out by Roland here
> >
> >
http://www.jiscmail.ac.uk/cgi-bin/wa.exe?A2=ind0203&L=dc-architecture&F=
> &S=&P=7240
>
> of deriving simpleTypes from that starting point.
>
> Does this mean we need separate "per-DC-element" types, where some
> elements have complexTypes permitting xml:lang and others have only
> simpleTypes based only on "anySimpleType"?
>
> Pete

------------------------------

End of DC-ARCHITECTURE Digest - 10 Mar 2002 to 11 Mar 2002 (#2002-35)
*********************************************************************

Top of Message | Previous Page | Permalink

JiscMail Tools


RSS Feeds and Sharing


Advanced Options


Archives

February 2024
January 2024
October 2023
September 2023
August 2023
July 2023
June 2023
May 2023
April 2023
March 2023
February 2023
January 2023
December 2022
November 2022
September 2022
August 2022
June 2022
May 2022
April 2022
March 2022
February 2022
January 2022
November 2021
October 2021
September 2021
August 2021
July 2021
May 2021
April 2021
March 2021
February 2021
January 2021
December 2020
November 2020
September 2020
August 2020
July 2020
June 2020
April 2020
March 2020
February 2020
January 2020
December 2019
November 2019
September 2019
August 2019
July 2019
June 2019
May 2019
April 2019
March 2019
February 2019
January 2019
December 2018
November 2018
October 2018
September 2018
August 2018
July 2018
June 2018
May 2018
April 2018
March 2018
February 2018
January 2018
December 2017
November 2017
September 2017
August 2017
July 2017
June 2017
May 2017
April 2017
March 2017
February 2017
January 2017
December 2016
November 2016
October 2016
September 2016
August 2016
July 2016
June 2016
May 2016
April 2016
March 2016
February 2016
January 2016
December 2015
November 2015
October 2015
September 2015
August 2015
July 2015
June 2015
May 2015
April 2015
March 2015
February 2015
January 2015
December 2014
November 2014
October 2014
September 2014
August 2014
July 2014
June 2014
May 2014
April 2014
March 2014
February 2014
January 2014
November 2013
October 2013
September 2013
August 2013
July 2013
June 2013
May 2013
April 2013
March 2013
February 2013
January 2013
December 2012
November 2012
October 2012
September 2012
August 2012
July 2012
June 2012
May 2012
April 2012
March 2012
February 2012
January 2012
December 2011
November 2011
October 2011
September 2011
August 2011
July 2011
June 2011
May 2011
April 2011
March 2011
February 2011
January 2011
December 2010
November 2010
October 2010
September 2010
August 2010
July 2010
June 2010
May 2010
April 2010
March 2010
February 2010
January 2010
December 2009
November 2009
October 2009
September 2009
August 2009
July 2009
June 2009
May 2009
April 2009
March 2009
February 2009
January 2009
November 2008
October 2008
September 2008
August 2008
July 2008
June 2008
May 2008
April 2008
February 2008
January 2008
December 2007
November 2007
October 2007
September 2007
August 2007
July 2007
May 2007
April 2007
March 2007
February 2007
January 2007
December 2006
November 2006
October 2006
September 2006
August 2006
July 2006
June 2006
May 2006
April 2006
March 2006
February 2006
January 2006
December 2005
November 2005
September 2005
August 2005
July 2005
June 2005
April 2005
March 2005
February 2005
January 2005
December 2004
November 2004
October 2004
September 2004
August 2004
July 2004
June 2004
May 2004
March 2004
February 2004
January 2004
November 2003
October 2003
September 2003
August 2003
June 2003
May 2003
April 2003
March 2003
January 2003
December 2002
November 2002
October 2002
September 2002
August 2002
July 2002
June 2002
May 2002
April 2002
March 2002
February 2002
January 2002
December 2001
November 2001
October 2001
September 2001
August 2001
July 2001
June 2001
May 2001
April 2001
March 2001
February 2001
December 2000
November 2000
October 2000


JiscMail is a Jisc service.

View our service policies at https://www.jiscmail.ac.uk/policyandsecurity/ and Jisc's privacy policy at https://www.jisc.ac.uk/website/privacy-notice

For help and support help@jisc.ac.uk

Secured by F-Secure Anti-Virus CataList Email List Search Powered by the LISTSERV Email List Manager