Print

Print


Please remove me from this list.
Robin T. Rutledge

Price, Jerry wrote:
> -----Original Message-----
> From: General DCMI discussion list [mailto:[log in to unmask]] On
> Behalf Of Karen Coyle
> Sent: Monday, August 18, 2008 1:59 PM
> To: [log in to unmask]
> Subject: [Fwd: Re: non-literal values; qualified or simple DC]
>
> Sorry, I didn't send this to the list. Got to remember to "reply all."
>
> kc
>
> -------- Original Message --------
> Subject: Re: non-literal values; qualified or simple DC
> Date: Mon, 18 Aug 2008 07:37:10 -0700
> From: Karen Coyle <[log in to unmask]>
> To: Hammond, Tony <[log in to unmask]>
> References: <[log in to unmask]>
>
> Tony, I don't think that the issue is that URIs are hard for humans to
> read -- actually, humans shouldn't have to read them. I think the issue
> is that until we have some standard resolution for our URIs there isn't
> much we can do with them. A URI on its own is just a dumb identifier.
> It's what it identifies that we want to work with. How useful would an
> ISBN be if you couldn't plug it into Amazon or a library catalog and get
> the book? So it's what we can DO with URIs that matters, and right now
> if I get a URI for a vocabulary term ... well, what do I do?
>
> It's that part that is missing. And that's why I think we have been slow
> to adopt URIs. Maintaining a dependable, usable resolution service is a
> big job, and we will want a strong standard that makes using URIs
> automatic and easy.
>
> kc
>
> Hammond, Tony wrote:
>   
>> I must agree with Karen:
>>
>>     
>>> URIs are for machines. Taking care of the humans will require more
>>>       
> than
>   
>>> a URI. We need to define what that "more" is and how we will make it
>>>       
> work.
>   
>> URIs have many issues associated with them, but one that many will
>>     
> agree on
>   
>> is that they are unwieldy.
>>
>> One of the outstanding successes of URIs is their capacity for
>> "transcription" [1]. The fact that you can paint them on the side of a
>>     
> bus
>   
>> and someone will remember that (or jot it down) and then look it up
>>     
> later on
>   
>> the Web. But this is also a key weakness of URIs as identifiers
>>     
> because they
>   
>> can be just too darn long.
>>
>> Leaving aside the scheme prefix (because, well let's just say because
>>     
> ;), we
>   
>> have at minimum the DNS name/IP address which must be added as a
>>     
> network
>   
>> name authority to the path component which provides the namespace and
>>     
> may
>   
>> (or may not) include the subject term depending on whether fragment
>>     
> notation
>   
>> is being used. Gets messy, quickly:
>>
>>     example.org/term
>>     example.org/name1/term
>>     example.org/name1/name2/term
>>     ...
>>
>> Marketing with branded URIs may alleviate this somewhat by jettisoning
>> namespaces and/or folding subject terms into authority components,
>>     
> e.g.
>   
>>     www.mammamiamovie.com/
>>
>> But in reality many URIs tend to be longer and more "difficult" (see
>>     
> e.g.
>   
>> the referenced URIs below). And BTW have you seen many RDF graphs
>>     
> (which
>   
>> generally depict URIs in absolute form) that are really fit for
>>     
> viewing?
>   
>> I am curious why the CURIE [2] effort to abbreviate URIs by
>>     
> introducing
>   
>> compact URI forms is taking so long to be realized. This may help out
>>     
> some
>   
>> in taming URIs for humans where prefixes are used to provide a simple
>> mnemonic mechanism. Indeed in XML namespaces a similar convention for
>> abbreviating URIs is used, with a number of well known namespace names
>> emerging, e.g. "dc:", "xsd:", "rdf:".
>>
>> Think the humans should get a look-in.
>>
>> Tony
>>
>> [1] http://www.apps.ietf.org/rfc/rfc3986.html#sec-1.2.1
>> [2] http://www.w3.org/TR/2007/WD-curie-20070307/
>>
>>
>>
>> On 18/8/08 14:29, "Karen Coyle" <[log in to unmask]> wrote:
>>
>>     
>>> Andy, taking off from your blog post, where you say:
>>>
>>> "Because if the topic is treated as a non-literal value then it can
>>>       
> be
>   
>>> assigned a URI and can become the subject of other descriptions.  If
>>>       
> the
>   
>>> topic is treated as a literal value then it becomes a descriptive
>>> cul-de-sac - no further description of the topic is possible."
>>>
>>> I agree with this, but it kind of begs the question of assigning
>>>       
> those
>   
>>> URIs and communicating about them, which I think is one of the
>>>       
> problems
>   
>>> that we face today. It comes down to that fact that just assigning a
>>>       
> URI
>   
>>> isn't enough to make the Web of linked data work. There needs to be a
>>> good way for people and programs to discover those URIs and
>>>       
> understand
>   
>>> what they represent. To me this is the primary barrier. If I am
>>>       
> creating
>   
>>> a catalog for my users and I harvest records that have URIs in the
>>> subject fields, I need to easily be able to know 1) what I can
>>>       
> display
>   
>>> to my human users so that they can understand what the subject is 2)
>>> what to index so my human users can do a search and retrieve what
>>>       
> they
>   
>>> are looking for.
>>>
>>> URIs are for machines. Taking care of the humans will require more
>>>       
> than
>   
>>> a URI. We need to define what that "more" is and how we will make it
>>>       
> work.
>   
>>> kc
>>>
>>> Andy Powell wrote:
>>>       
>>>> For the record (1)... I think the use of 'coterie' is somewhat
>>>>         
> unfair
>   
>>>> (at least in the sense of a 'clique'), since I don't see anyone
>>>>         
> trying
>   
>>>> to be exclusive - quite the opposite in fact... but I take the point
>>>> about not many people in DCMI being well placed to answer the
>>>>         
> question -
>   
>>>> which is a serious issue for DCMI I would suggest.
>>>>
>>>> For the record (2)... I no longer consider myself part of any such
>>>> 'coterie' but I've had a go at answering anyway.  See
>>>>
>>>>
>>>>         
> http://efoundations.typepad.com/efoundations/2008/08/the-importance.html
>   
>>>> Fundamentally, the importance of non-literals vs. literals lies in
>>>>         
> our
>   
>>>> ability to build a Web of *linked data* (to use the currently
>>>>         
> accepted
>   
>>>> term) based on assigning 'http' URIs to things that are useful to us
>>>>         
> in
>   
>>>> describing the world (concepts, places, people and the like),
>>>>         
> something
>   
>>>> that I think the library community have been very slow to recognise,
>>>> despite the fact that they sit on a vast array of data, knowledge
>>>>         
> and
>   
>>>> expertise that would be highly valuable in this space.  This is a
>>>>         
> great
>   
>>>> shame IMHO.
>>>>
>>>> Andy
>>>> --
>>>> Head of Development, Eduserv Foundation
>>>> http://www.eduserv.org.uk/foundation/
>>>> http://efoundations.typepad.com/
>>>> [log in to unmask]
>>>> +44 (0)1225 474319
>>>>
>>>>         
>>>>> -----Original Message-----
>>>>> From: General DCMI discussion list
>>>>> [mailto:[log in to unmask]] On Behalf Of Karen Coyle
>>>>> Sent: 15 August 2008 17:51
>>>>> To: [log in to unmask]
>>>>> Subject: Re: non-literal values; qualified or simple DC
>>>>>
>>>>> Karen,
>>>>>
>>>>> I suspect the members of the small coterie that could explain
>>>>> this are all on vacation at this time. I am not in that
>>>>> group, but I will attempt an explanation anyway ;-)
>>>>>
>>>>> Reading the dcterm:subject entry, I can see that there is an
>>>>> expectation that the term will be part of a context -- the
>>>>> context may be an authoritative list that it must come from,
>>>>> or it could be a combination of a list and rules, such as one
>>>>> gets with LCSH, LCC or Dewey. Any term that gets a value from
>>>>> a context like that is considered non-literal in the DCAM
>>>>> sense because the context needs to be included in the formal
>>>>> description of the term. This is somewhat like the 6XX fields
>>>>> in MARC where the indicator (or $2) tells you which
>>>>> vocabulary the subject heading belongs to.
>>>>>
>>>>> That said, this definition seems to exclude the possibility
>>>>> of uncontrolled subject terms, which you mention. Leaving
>>>>> aside the DCAM (which is often puzzling), it seems to me that
>>>>> you need a way to indicate 1) whether or not the values in
>>>>> the subject field are controlled and 2) if they are
>>>>> controlled, what list they come from. I don't think that
>>>>> DCTERMS alone provides this capability, although you could
>>>>> possibly create it by using these value "patterns":
>>>>>
>>>>> 1) a character string alone. This would represent an
>>>>> uncontrolled subject term or terms.
>>>>> 2) A URI for the subject term. This is only an option if the
>>>>> term itself has a URI. I can imagine URIs for things like LCC
>>>>> or Dewey looking something like:
>>>>>     http://www.oclc.org/dewey/ddc22/973.13
>>>>> 3) A URI for the subject *system* plus a string for the
>>>>> subject heading or term.
>>>>>     http://www.nlm.nih.gov/mesh/  "Dipeptidyl-Peptidase IV
>>>>>           
> Inhibitors"
>   
>>>>> I'd love to hear other takes on this because I think it
>>>>> probably has been amply discussed in the DC development process.
>>>>>
>>>>> kc
>>>>>
>>>>>
>>>>> Karen Arcamonte wrote:
>>>>>           
>>>>>> I'm currently involved in the selection of standard fields for a
>>>>>> metadata project and we have some fields that we are calling
>>>>>>             
> Dublin
>   
>>>>>> Core fields (Subject and Relation fields), but we are
>>>>>>             
>>>>> including free 
>>>>>           
>>>>>> text or uncontrolled terms. I notice that the DC Subject
>>>>>>             
>>>>> and Relation 
>>>>>           
>>>>>> fields are "intended to be used with a non-literal value." I'm not
>>>>>> sure what this means. Is there anyone that can explain in simple
>>>>>> terms? I've looked at the DCMI Abstract Model and I'm still
>>>>>>             
>>>>> not sure what they mean by "non-literal"
>>>>>           
>>>>>> value. Also, can you say you are using Qualified Dublin
>>>>>>             
>>>>> Core for some 
>>>>>           
>>>>>> fields and Simple Dublin Core for other fields in an
>>>>>>             
>>>>> application profile?
>>>>> --
>>>>> -----------------------------------
>>>>> Karen Coyle / Digital Library Consultant [log in to unmask]
>>>>> http://www.kcoyle.net
>>>>> ph.: 510-540-7596   skype: kcoylenet
>>>>> fx.: 510-848-3913
>>>>> mo.: 510-435-8234
>>>>> ------------------------------------
>>>>>
>>>>>           
>>
>>     
> ************************************************************************
> ********   
>   
>> DISCLAIMER: This e-mail is confidential and should not be used by
>>     
> anyone who is
>   
>> not the original intended recipient. If you have received this e-mail
>>     
> in error
>   
>> please inform the sender and delete it from your mailbox or any other
>>     
> storage
>   
>> mechanism. Neither Macmillan Publishers Limited nor any of its agents
>>     
> accept
>   
>> liability for any statements made which are clearly the sender's own
>>     
> and not
>   
>> expressly made on behalf of Macmillan Publishers Limited or one of its
>>     
> agents.
>   
>> Please note that neither Macmillan Publishers Limited nor any of its
>>     
> agents
>   
>> accept any responsibility for viruses that may be contained in this
>>     
> e-mail or
>   
>> its attachments and it is your responsibility to scan the e-mail and 
>> attachments (if any). No contracts may be concluded on behalf of
>>     
> Macmillan 
>   
>> Publishers Limited or its agents by means of e-mail communication.
>>     
> Macmillan 
>   
>> Publishers Limited Registered in England and Wales with registered
>>     
> number 785998 
>   
>> Registered Office Brunel Road, Houndmills, Basingstoke RG21 6XS   
>>
>>     
> ************************************************************************
> ********
>   
>>
>>     
>
>   

-- 
Robin T. Rutledge, CEO
MediaRecall Holdings, LLC
USA HQ/Digital Studios
3363 Commercial Ave.
Northbrook, Illinois, 60062
847.513.6810 Office
847.881.0703 F
312.622.4777 M
[log in to unmask]
http://www.mediarecall.com