Print

Print


Jordan's email of yesterday on this topic once again proves
the theory of simultaneous invention, plus, of course, that
great minds think alike! I hope there is time in Helsinki to
address this matter. 

I realize and appreciate all the thought and work that has 
gone into the ResourceType draft, and it seems to me that we still need to work 
on this more. (Yes, I'm willing to help, not just stand off in a corner and make 
rude remarks :-)

I. The September 23, 1997, "Dublin Core Metadata Element Set: 
Reference Description" defines Resource Type as, "The category of the resource, 
such as home page, novel, poem, working paper, technical report, essay, 
dictionary." It refers the reader to 
http://sunsite.berkeley.edu/Metadata/types.html for the draft.
The "Dublin Core Resource Types : Structuralist Draft, July 24, 1997"
(printed out here September 25, 1997) states in the first full paragraph,
second line, that, "This element is to describe the genre of the item being 
described,"  and lists the top-level Resource Types as: text; image; sound; 
software; data; and interactive.
"Genre" is often taken to mean referring to the intellectual content of the 
item, as contrasted to its physical form. For example, "novel" is a genre term 
in that a novel can appear  printed on a piece of paper, on microform, or in 
digital form on someone's computer screen - it is still a novel, and only the 
physical form (the carrier of the information) has changed.  
***Is this what it is intended to mean here?*** This key point needs to be 
decided before we proceed.  

II. I have held off on sending this email because thesaurus-building is not my 
specialty; I just did not feel comfortable about the list, and having just gone 
through a substantial amount of committee work in building a list of form/genre 
terms for geospatial information, I figured that I knew only enough to hunt up 
someone who specializes in this work before I said anything. I have the good 
fortune to work with a person - Linda Hill ([log in to unmask])
- who has worked with indexes, abstracts, and thesauri for some years.  After 
looking at the list of Resource Types in the Structuralist Draft, she has 
contributed the following points, plus an appendix that is explained later in 
this section.

If we are going to develop a short list of terms that will assist persons, it 
would be a good idea to build it following the thesaurus guidelines available
from ANSI/NISO and ISO. Some of the following specific principles can be
found there.
a. Resource Type terms should be nouns, not adjectives; e.g., "interactive" 
would need to be changed to "interactive items" or something like that.
b. Terms should stand on their own and be understandable when they are
used as descriptors - that is, their meaning should not be dependent on
seeing them within the hierarchy of the scheme. Therefore, a term like
"executable" under "software" should instead be "executable software".
c. The choice of singular vs. plural forms of the terms is debatable and
either could be used. However, the ANSI/NISO Z39.19 standard calls for
"count nouns" to be expressed as plurals ("names of objects or concepts
that are subject to the question 'how many?' but not 'how much?'). Also,
if the primary purpose of the Resource Type terms is for searching, seachers
are more likely to think in terms of plurals rather then singular. It is 
generally only at the cataloging time when there is one object to be
described that the singular term is the most likely form to use.
d. There are shallow and deep hierarchies, and arguments on both sides as to 
which is better. The limited number of top-level nodes and an orderly grouping 
under them are more effective tools for use by the general
public. 
e. The use of cross references (1) from nearly synonymous terms for the
same sort of thing and (2) between related terms in the thesaurus of terms
is essential for a useable tool.
f. The relationship between a broad term and a narrow term should be kept
strictly an "is a" relationship - that is, a genus/species relationship. It
should always be true that a term "is" also truely its broader term - so that
if the broader term were also apply as a descriptor to an item, it would
also be true.
g. It is difficult to keep the Resource Type list free of the closely
related set of terms that describe how an item IS AVAILABLE. A "homepage" is a 
method of presentation and does not belong in this list. What the Web page has 
on it - what it IS - is what would be described by this set of terms.
The cataloging world has found that keeping genre and carrier terms in separate 
lists is extremely difficult, and has in effect shoved them together in a USMARC 
field (REBECCA - have I correctly presented this?). If we decide to have both 
genre and carrier terms in one list, fine; but we need to define ResourceType to 
cover both.
h. The notion of creating stand-alone terms by concatenating the hierarchy
(e.g., "image.interactive") is not a good idea because (1) the resulting
terms are not familiar and (2) it forces users to know the hierarchical
structure of the terminology.
i. With a hierarchical structure of terms, it is not necessary to list
every possible genre form because: (1) if a specific term is not listed for
an item, a term that is higher in the tree can be used instead; and 
(2) thesauri are expandable - more terms can be added for local purposes or
the 'main' authoritative set can be expanded through time, as long as the
basic structure accommodates all anticipated categories of terms.

What follows in an appendix is the hierarchical listing of a draft Resource Type
thesaurus Linda Hill has been working on. It is not complete; there is enough of 
it to give a general outline. This listing shows only the valid terms and their 
hierarchical relationships - not the invalid (synonymous terms or the Related 
Term links). Linda would be happy to put the draft thesaurus up as an html 
document so we DC persons can explore all of the terms. If this approach to 
structure is accepted, the details of the terms and relationships can be the 
focus of discussion.

III. As I noted above, what led me to talk with Linda Hill about this matter 
was that I could tell that a great deal of thought and hard work had gone into 
the "Structuralist Draft," and yet from the point of view of spatial data and my 
years of experience of cataloging, there were points that did not seem to 
follow. Some specific examples follow:
- Almost everything is "data" at some level, so while I see what
 is being aimed toward, this is not a good category.
- The list of "text" items does not all seem to be at the same level, nor does 
it list all possibilities at a level. If one has "dictionaries," then one must 
also have "encyclopedias," and so on. 
- In compiling a list like this, it would be 
extremely helpful to have a printout of the Library of Congress's list of what 
are called free-floating subdivisions for subject headings (REBECCA, please 
correct me if I have that name wrong) - it's a list of subdivisions (e.g., 
"Maps,"  and so on) that may be used with almost any thematic subject (e.g., 
"Geology", "Housing", etc.). My geospatial-data form/genre group used this,
and found it saved us time and assisted us in constructing a logical structure.
- Strictly from the viewpoint of spatial data - this list creates 2 possible 
places to put spatial data, since large amounts of it is remote-sensing images 
(e.g., aerial photographs) - yet "spatial data" is listed in only one location.



Mary Larsgaard
Alexandria Digital Library/Davidson Library
University of California
Santa Barbara

------
APPENDIX

RESOURCE TYPE TERMS - DRAFT THESAURUS FROM ALEXANDRIA DIGITAL LIBRARY (L. Hill)

cartographic works
 . atlases
 . maps

collections
 . archives
 . catalogs

datasets
 . survey data

events

images
 . art works
 . . art originals
 . . art prints
 . . art reproductions
 . graphic materials
 . motion pictures
 . photographs
 . . aerial photographs
 . . photographs from space
 . remote-sensing images
 . . aerial photographs

individuals
 . experts

multimedia packages

musical works

organizations

projects

recordings
 . sound recordings
 . video recordings

services
 . interactive services
 . . chat services
 . . discussion lists
 . . games
 . . news groups
 . reference services
 . . cataloging rules
 . . classification systems
 . . dictionaries
 . . directories
 . . gazetteers (place names dictionaries)
 . . metadata standards
 . . subject heading lists
 . . thesauri

software packages

textual works
 . abstracts
 . articles
 . bibliographies
 . biographies
 . . autobiographies
 . conference papers
 . correspondence
 . fictional works
 . . dramatic works
 . . poems
 . monographs
 . . monographic series
 . proceedings
 . reports
 . serial works
 . . journals
 . . magazines
 . . newsletters
 . . newspapers
 . theses