Print

Print


Folks:

Apologies, but the images I inserted into this message were stripped out 
somewhere along the line.  The version below has links instead.

Diane

-------- Original Message --------
Subject: 	[DC-RDA] Happy New Year: Three new RDA Vocabularies registered
Date: 	Thu, 1 Jan 2009 17:16:19 -0500
From: 	Diane I. Hillmann <[log in to unmask]>
Reply-To: 	List for discussion on Resource Description and Access (RDA) 
<[log in to unmask]>
To: 	[log in to unmask]



Folks:

Three new and important vocabularies from RDA have been registered:

RDA Content Type: http://metadataregistry.org/vocabulary/show/id/45.html
RDA Carrier Type: http://metadataregistry.org/vocabulary/show/id/46.html
RDA Media Type: http://metadataregistry.org/vocabulary/show/id/37.html

I think it's interesting to point out how different these vocabularies 
are, both in their structure and how they're represented in the RDA 
text.  Content type is from Chapter 6, and is presented as a relatively 
flat vocabulary (e.g., no hierarchy) and includes definitions and scope 
notes.  You can take a look at "cartographic dataset" to see how the 
simple table in the text can be translated into a much richer and more 
useful form:

http://metadataregistry.org/concept/show/id/510.html
 <http://metadataregistry.org/concept/show/id/510.html>


Simply by separating the definition from the scope note and providing 
simple thesaural relationships from those notes, we have something much 
more functional.  Interestingly, the Registry makes all those 
relationships reciprocal, so there are gaps in the vocabulary simply 
because the textual underpinning for the relationships only goes in one 
direction.  For example, looking at the image above, there is a scope 
note that explains the relationship to "cartographic image" but in the 
listing for "cartographic image" there is only a related term, with no 
explanation of why the relationship exists.  Ideally, there should be 
something there for human use (machines don't particularly care).

In contrast, the RDA Carrier Type vocabulary, located in Chapter 3, is a 
two level hierarchy, with the top concepts seemingly intended as 
gathering devices, not for application in data.  In the text, these 
terms lack definitions, and what little disambiguation we have comes 
from the organization (the gathering category) and a few footnotes, 
which are really scope notes.  Below is one of the "top concept" 
gathering terms, with its narrower relationships specified:

http://metadataregistry.org/concept/show/id/533.html


In this vocabulary, the footnotes have been converted to scope notes, 
but for now the obvious alternate terms, even those that relate to the 
scope note itself, have not been entered.  Below is the term "audio 
roll" and the scope note converted from a footnote.  The note suggests 
that an obvious alternative term for audio roll would be "piano roll."

http://metadataregistry.org/concept/show/id/538.html

The RDA Media Type vocabulary is very short and non-hierarchical, and 
correlates to the gathering categories of the carrier types (which 
suggests to me that perhaps they ought not to be all that separate).  
Like the content types, these come with definitions and (sometimes) 
scope notes:

http://metadataregistry.org/concept/show/id/475.html


All of these vocabularies are up to date as of the Oct. 31, 2008 text 
available in the constituency review documents.  I should note that as 
usual when I do wholesale vocabulary registration, I uncover a few bugs 
here and there, and you might see a few funky places where the status is 
"published" instead of "new-proposed" and the language is something 
you've never heard of--I will be going through and fixing those within 
the next couple of days so they should be corrected shortly.  I'll also 
be writing up the problem so that Jon, the put-upon registry developer, 
can at least add them to his bug list (I'll have to wrestle with them a 
bit more until they're fixed, for a number of reasons).

One of the questions I've been posing to all and sundry about all this 
is that of the usability of the vocabularies without definitions.  My 
contention is that they're better than nothing, but won't lead to the 
consistency we're looking for without definitions, scope notes, and 
alternative terms.  Karen Coyle and I were messing around with this idea 
sometime (maybe a year?) ago, and came up with an addition to the 
carrier type vocabulary that illustrated what it could look like (it's 
in the Registry Sandbox): 

http://sandbox.metadataregistry.org/concept/show/id/517.html <http://sandbox.metadataregistry.org/concept/show/id/517.html.>

I do think that looking at something fully fleshed out serves to illustrate 
nicely the paucity of the current Carrier Type vocabulary.

The other question is how these vocabularies should be enhanced and 
extended.  Shortly (we hope) the Registry team will announce some 
enhanced functionality that will add the ability for interested parties 
to discuss enhancements, at the concept or term level, in ways that can 
lead to some consensus based proposals for missing pieces or extensions.

Unfortunately the RDA text gives instructions that limit the 
other kinds of "bottom up" options to determine from usage where the 
vocabularies need to be extended.  The instructions for when no current 
term from the vocabularies are available come straight from MARC-- one 
gets to use "other" or in some cases, "unspecified."  Wouldn't it make 
more sense to allow catalogers to fill in what they think it should be, 
as free text, so that the data could be mined on a regular basis to see 
what the needs are in the community?  Even errors, where there IS a term 
but the cataloger didn't recognize that the chosen text term should have 
been something else, helps establish the kinds of alternative labels 
that help the next person down the road.  "Other" and "Unspecified" are 
dead ends and should be unceremoniously rejected (they will not--repeat 
NOT--be added to the vocabularies while I have breath).

Well, enough for this New Year's Day.  I look forward to some comments 
and suggestions ...

Regards,
Diane

*********************************
Diane I. Hillmann
Director of Metadata Initiatives
Information Institute of Syracuse
Partner, Metadata Management Associates
Website: http://managemetadata.org
Email: [log in to unmask]
Voice: (607) 387-9207
Fax: (607) 387-4867
Skype: dihillmann
*********************************