JiscMail Logo
Email discussion lists for the UK Education and Research communities

Help for DC-ARCHITECTURE Archives


DC-ARCHITECTURE Archives

DC-ARCHITECTURE Archives


DC-ARCHITECTURE@JISCMAIL.AC.UK


View:

Message:

[

First

|

Previous

|

Next

|

Last

]

By Topic:

[

First

|

Previous

|

Next

|

Last

]

By Author:

[

First

|

Previous

|

Next

|

Last

]

Font:

Proportional Font

LISTSERV Archives

LISTSERV Archives

DC-ARCHITECTURE Home

DC-ARCHITECTURE Home

DC-ARCHITECTURE  May 2011

DC-ARCHITECTURE May 2011

Options

Subscribe or Unsubscribe

Subscribe or Unsubscribe

Log In

Log In

Get Password

Get Password

Subject:

What does "Dublin Core application profile" mean today?

From:

Thomas Baker <[log in to unmask]>

Reply-To:

DCMI Architecture Forum <[log in to unmask]>

Date:

Tue, 17 May 2011 16:58:06 -0400

Content-Type:

text/plain

Parts/Attachments:

Parts/Attachments

text/plain (221 lines)

----------------------------------------------------------------------
NOTE: This text has also been posted to the new DCMI wiki as a proposed
answer to a Frequently Asked Question, 
http://wiki.dublincore.org/index.php/FAQ/Application_Profile.  It is
posted here to stimulate discussion of its interpretation and of its 
conclusions.
----------------------------------------------------------------------

What does "Dublin Core application profile" mean today?

The notion of an "application profile" was first introduced at a Dublin Core
workshop in October 2000.  The basic idea was that maintenance organizations
would "declare" element sets, while implementers would merely "use" the
elements, so declared, in their metadata.  Elements could be drawn from
multiple vocabularies, "mixing-and-matching" as required, and constrained,
encoded, or interpreted to meet application-specific needs [1].

This very general conceptualization of application profiles soon came to be
interpreted in a variety of incompatible ways, with the result that many
"Dublin Core" implementations could not interoperate despite sharing a common
vocabulary.  It was recognized, early on, that making application profiles
interoperable among themselves would require a common modeling basis, and that
machine-processable expressions of profiles "based on data models such as RDF"
could provide a solid basis for such interoperability [2].  In 2003, work began
on a DCMI Abstract Model [15], leading eventually to a re-definition of the
notion of "Dublin Core application profile" on the basis of a formal model.

=== Application profiles based on the DCMI Abstract Model ===

The DCMI Abstract Model, first published as a DCMI Recommendation in 2005 and
revised in 2007, formalized the notion of a Description Set.  A companion
specification of 2008, "Description Set Profile: a constraint language for
Dublin Core Application Profiles" [3,4], described how the general abstract
model of a Description Set could be "profiled" with the set of constraints
required for a specific application.  

A Description Set Profile groups application-specific sets of structural
constraints into "templates" for Descriptions and Statements, the main
components of a Description Set.  The constraints specify how the "slots" of a
template are to be filled with URIs or text strings in a metadata record based
on the template.  For example, a Property Constraint in the template for a Book
might specify that the topic of a book be described with a statement using
dcterms:subject, and a Vocabulary Encoding Scheme List Constraint might specify
that the object of that statement, a URI, be contextualized with a Vocabulary
Encoding Scheme URI for the Library of Congress Subject Headings.  The
specification aimed at providing an initial set of simple constraint types of
use for the structural validation of metadata records with the expectation that
the set might be extended with additional constraint types in response to user
feedback.

=== The Singapore Framework for Dublin Core Application Profiles ===

The Description Set Profile construct formed the centerpiece of the "Singapore
Framework for Dublin Core Application Profiles" of 2007 [5].  The Singapore
Framework conceptualized application profiles as sets of specifications
documenting, however simply, the decisions underpinning a particular metadata
design -- specifications documenting how Functional Requirements informed a
Domain Model of entities to be described, as the basis for a Description Set
Profile detailing vocabularies used and with what constraints, in turn
providing the basis for constructing concrete metadata formats.  

An Application Profile, under this model, was seen not just as a schema, but as
a packet of specifications and usage guidelines documenting the metadata model
of an application in a form maximally re-usable by other implementors.  More
generally, the Singapore Framework was designed to illustrate how a specific
metadata model could be grounded in "domain standards" -- Community Domain
Models, Metadata Vocabularies, and the DCAM with its related syntax guidelines
-- and how the domain standards were themselves grounded in the "foundation
standards" of RDF.

A primer for creating application profiles on this basis, "Guidelines for
Dublin Core Application Profiles", was published in 2009 [6], and the
DCMI Usage Board developed and tested criteria for evaluating the conformance
of application profiles to the guidelines and principles articulated in the
Singapore Framework [7].

The Singapore Framework approach to application profiles was intended to guide
the design of RDF-compatible instance metadata -- records based on
application-specific formats, using RDF vocabularies, templated according to
the Description Set Profile constraint language, and expressed in concrete
syntaxes with a defined relationship to the DCAM (such as HTML, XML, and
RDF/XML).  DCAM's basis in RDF was intended to ensure that the contents of
DCAM-based instance metadata would be straightforwardly expressible as RDF
triples and thus as Linked Data.  

More fundamentally, the Singapore Framework approach to application profiles
was intended to bridge the gap between traditional data-management approaches,
grounded in metadata records with defined document structures, and new Semantic
Web approaches, based on the notion of theoretically boundless data graphs.

In order to do this, the authors of DCAM defined constructs not provided by RDF
itself -- specifically, the notion of a bounded Description Set (roughly
analogous to the RDF notion, ambiguously defined at the time, of a Named
Graph).  DCAM also constrained the infinite possibilities of an RDF graph to a
specific set of patterns intended to capture the Dublin Core community's
"style" of metadata, with its particular use of URIs, text labels, Syntax
Encoding Schemes (datatypes), and Vocabulary Encoding Schemes.  In the text
"Interoperability Levels for Dublin Core Metadata" of 2008, shared application
profiles were seen as the highest level of interoperability in a four-tiered
hierarchy ranging from the human but non-machine-actionable interoperability of
shared natural-language definitions, to interoperability on the basis of formal
semantics (RDF), and culminating in interoperability on the basis of the
Singapore Framework: shared natural-language definitions, a shared
formal-semantic basis, and a shared model for metadata records using the same
terms with the same constraints [8].

=== The future of Dublin Core application profiles ===

That DCMI is aware, a purely DCAM-based approach to application profiles was
never widely implemented, though it has been widely cited, used as a basis for
some software development [17], and has served as a source of inspiration to
metadata designers in its broad outline if not in its details.  Notable
implementations included the Dublin Core Collections Application Profile [9]
and the Scholarly Works Application Profile (SWAP) [10].  In the absence of
tool support for authoring a Description Set Profile and for generating schemas
usable for validation, the DCAM-based approach remained on a largely analytical
plane.

In the meantime, alternative approaches to expressing structural constraints
for the purpose of validating RDF-based metadata have been explored within the
Dublin Core and Semantic Web communities.  In 2007, for example, Alistair Miles
proposed "Son of Dublin Core" -- an approach for encoding and validating
"graph-based metadata" using a concrete XML syntax together with language for
expressing application-specific syntax constraints over a metadata graph [12].
The company Clark & Parsia has developed an "integrity constraint validator",
Pellet, which treats OWL "as a schema or validation language for RDF data via
auto-generated SPARQL queries that can be executed on any SPARQL-enabled RDF
store" by interpreting OWL ontologies with closed-world assumptions in order to
detect constraint violations [13].  Dan Brickley advocates a more pluralistic
and eclectic approach to application profiles encompassing a wide range of
human-readable and machine-processable methods for documenting metadata
patterns, from simple lists of namespaces used to Web documents, metadata
exemplars, and example queries [14].

[Disclaimer: the following paragraphs are under discussion by DCMI.]

As of 2011, the DCMI Abstract Model retains the status of a DCMI
Recommendation, the Description Set Profile specification continues to carry
the status of DCMI Working Draft, and the Singapore Framework itself has the
status of a DCMI Recommended Resource.  In all cases, their authors have moved
on to other projects, and active development of the documents has ceased.
Regaining momentum on the DCAM-based approach to application profiles would
require more resources -- authors, editors, and implementors with clear
requirements -- than DCMI can currently bring to bear.  

For now, DCMI has chosen to leave the specifications in place, with their
current statuses, both for their value as historical contributions and
potentially as sources of requirements for future projects.  However, DCMI
recognizes the emergence of validation and quality control as a critical
problem for producers of Linked Data.  

The DCAM-based approach to application profiles as outlined in the Singapore
Framework was an attempt to elucidate the cycle of development from functional
requirements through the development of a domain model, the description of
entities in that domain model following defined constraints, and the generation
of metadata record formats on their basis -- all in a form compatible with RDF.
Even in the absence of a formal notion of Description Set Profiles, key aspects
of the framework arguably retain their validity as general principles of good
metadata design.  For example, the "declaration" of metadata terms in
vocabularies versus their "re-use" in application profiles remains a useful
distinction.  The idea of non-RDF formats with defined mappings to RDF remains
important in the context of publishing Linked Data.  Basing domain models and
descriptive models on functional requirements remains a solid design principle.
A generic vocabulary for expressing constraints remains a worthy goal, for
example as a basis for automating the generation of metadata input forms and
validators.

More generally, the idea of expressing application-specific constraints in
syntactic terms -- as components that can be validated in order to ensure the
quality of instance metadata -- presents a valid and important alternative to
the "semantic" style of constraint modeling used in OWL ontologies.  OWL
ontologies describe an ideal universe of related entities "in the world".  In
accordance with the "open world assumption", the semantic constraints of an
ontology refine the description of that ideal world as a license for inferring
additional knowledge on the basis of knowledge at hand.

These syntactic and semantic notions of "constraints" have significantly
different uses.  For example, if an application profile says that the
description of a book has only one subject heading -- a URI -- and a
description with two subject headings is encountered, a validator will report
an error in the record.  If an OWL ontology says that a book has only one
subject heading, and a description with two subject headings is encountered, an
OWL reasoner will infer that the two subject-heading URIs identify the same
subject [16].  The application profile approach is good for flagging data that
is inconsistent with the structure of a metadata record as a document, while
OWL ontologies are good for flagging logical inconsistencies with respect to an
ideal construct.

With the growing momentum of Linked Data, legacy record formats are being
adapted for use in Linked Data -- sometimes by translating their constraints
into OWL, and this is sometimes based on a misunderstanding of the nature of
OWL.  OWL ontologies define constraints by refining the conceptualization of a
world, whereas application profiles merely refine the contents of a record.
OWL ontologies, in other words, introduce semantic constraints that will be
inherited by users of the ontology downstream, while application profiles
merely control the consistency of data in a syntactic sense, leaving semantics
untouched.  If quality of control of metadata records at the moment of their
creation, or at the moment of ingest, remains on ongoing concern, metadata
designers may wish to revisit some of the Singapore Framework's ideas in future
standardization efforts.

References

[1] http://www.ariadne.ac.uk/issue25/app-profiles/
[2] http://dublincore.org/usage/documents/profiles/
[3] Glossary/Description_Set_Profile
[4] http://dublincore.org/documents/dc-dsp/
[5] http://dublincore.org/documents/singapore-framework/
[6] http://dublincore.org/documents/profile-guidelines/
[7] http://dublincore.org/documents/profile-review-criteria/
[8] http://dublincore.org/documents/interoperability-levels/
[9] http://dublincore.org/groups/collections/collection-application-profile/2007-03-09
[10] http://www.ukoln.ac.uk/repositories/digirep/index/Scholarly_Works_Application_Profile
[11] http://dublincore.org/usage/reviews/2009/swap/
[12] http://aliman.googlecode.com/svn/trunk/sodc/SoDC-0.2/index.html
[13] http://clarkparsia.com/pellet/icv/
[14] http://lists.w3.org/Archives/Public/public-lld/2010Nov/0114.html
[15] http://wiki.dublincore.org/index.php/Glossary/DCMI_Abstract_Model
[16] http://www.amberdown.net/2009/09/faq-using-rdfs-or-owl-as-a-schema-language-for-validating-rdf/
[17] http://portal.acm.org/citation.cfm?id=1875719

Top of Message | Previous Page | Permalink

JiscMail Tools


RSS Feeds and Sharing


Advanced Options


Archives

February 2024
January 2024
October 2023
September 2023
August 2023
July 2023
June 2023
May 2023
April 2023
March 2023
February 2023
January 2023
December 2022
November 2022
September 2022
August 2022
June 2022
May 2022
April 2022
March 2022
February 2022
January 2022
November 2021
October 2021
September 2021
August 2021
July 2021
May 2021
April 2021
March 2021
February 2021
January 2021
December 2020
November 2020
September 2020
August 2020
July 2020
June 2020
April 2020
March 2020
February 2020
January 2020
December 2019
November 2019
September 2019
August 2019
July 2019
June 2019
May 2019
April 2019
March 2019
February 2019
January 2019
December 2018
November 2018
October 2018
September 2018
August 2018
July 2018
June 2018
May 2018
April 2018
March 2018
February 2018
January 2018
December 2017
November 2017
September 2017
August 2017
July 2017
June 2017
May 2017
April 2017
March 2017
February 2017
January 2017
December 2016
November 2016
October 2016
September 2016
August 2016
July 2016
June 2016
May 2016
April 2016
March 2016
February 2016
January 2016
December 2015
November 2015
October 2015
September 2015
August 2015
July 2015
June 2015
May 2015
April 2015
March 2015
February 2015
January 2015
December 2014
November 2014
October 2014
September 2014
August 2014
July 2014
June 2014
May 2014
April 2014
March 2014
February 2014
January 2014
November 2013
October 2013
September 2013
August 2013
July 2013
June 2013
May 2013
April 2013
March 2013
February 2013
January 2013
December 2012
November 2012
October 2012
September 2012
August 2012
July 2012
June 2012
May 2012
April 2012
March 2012
February 2012
January 2012
December 2011
November 2011
October 2011
September 2011
August 2011
July 2011
June 2011
May 2011
April 2011
March 2011
February 2011
January 2011
December 2010
November 2010
October 2010
September 2010
August 2010
July 2010
June 2010
May 2010
April 2010
March 2010
February 2010
January 2010
December 2009
November 2009
October 2009
September 2009
August 2009
July 2009
June 2009
May 2009
April 2009
March 2009
February 2009
January 2009
November 2008
October 2008
September 2008
August 2008
July 2008
June 2008
May 2008
April 2008
February 2008
January 2008
December 2007
November 2007
October 2007
September 2007
August 2007
July 2007
May 2007
April 2007
March 2007
February 2007
January 2007
December 2006
November 2006
October 2006
September 2006
August 2006
July 2006
June 2006
May 2006
April 2006
March 2006
February 2006
January 2006
December 2005
November 2005
September 2005
August 2005
July 2005
June 2005
April 2005
March 2005
February 2005
January 2005
December 2004
November 2004
October 2004
September 2004
August 2004
July 2004
June 2004
May 2004
March 2004
February 2004
January 2004
November 2003
October 2003
September 2003
August 2003
June 2003
May 2003
April 2003
March 2003
January 2003
December 2002
November 2002
October 2002
September 2002
August 2002
July 2002
June 2002
May 2002
April 2002
March 2002
February 2002
January 2002
December 2001
November 2001
October 2001
September 2001
August 2001
July 2001
June 2001
May 2001
April 2001
March 2001
February 2001
December 2000
November 2000
October 2000


JiscMail is a Jisc service.

View our service policies at https://www.jiscmail.ac.uk/policyandsecurity/ and Jisc's privacy policy at https://www.jisc.ac.uk/website/privacy-notice

For help and support help@jisc.ac.uk

Secured by F-Secure Anti-Virus CataList Email List Search Powered by the LISTSERV Email List Manager