JISCMail - WEBSITE-INFO-MGT Archives

Email discussion lists for the UK Education and Research communities

Subscriber's Corner

Email Lists

WEBSITE-INFO-MGT Archives

WEBSITE-INFO-MGT@JISCMAIL.AC.UK

View:

Message:

[

First

Last

]

By Topic:

[

First

Last

]

By Author:

[

First

Last

]

Font:

Proportional Font

		LISTSERV Archives
		WEBSITE-INFO-MGT Home
		WEBSITE-INFO-MGT 2003

Options

Subscribe or Unsubscribe

Get Password

Subject:

Re: Semantic Web and UK HEIs (was RE: New LSE website launched 23rd June)

From:

"Emmott,Stephen" <[log in to unmask]>

Reply-To:

Emmott,Stephen

Date:

Fri, 4 Jul 2003 13:04:22 +0100

Content-Type:

text/plain

Parts/Attachments:

text/plain (145 lines)

Brian,

[Apologies for the delay in reply: post launch issues...]

Yes. Behind the 'Semantic Web' lies the development of schemas which are
flexible enough to accommodate the breadth of content making up the WWW
(now and future) - ideally.

The example you use demonstrates content* (see footnote below) 'within'
a schema: the content is fitted to the schema by inserting the
appropriate markup thereby breaking it up into its constituent parts
('fields' in your example). This transforms the content so that it sits
somewhere between unstructured and structured.

'The metadata is the data' means that either the metadata can be
reliably extracted from the structured data as and when it is required
or that there is enough structure to obviate the need for metadata. The
issue is that metadata is data about data i.e., it represents data but
is an entity in and of itself regardless of where it is located (i.e.,
within or between resources). Moreover, an ideal is for metadata to be
unnecessary as the data will be sufficiently structured to support
inference i.e., a description will be unnecessary as examination of the
data will be sufficient. [If you haven't noticed, a lot of this is
actually AI]

It is probably uncontroversial to predict that only a subset of the
WWW's content will be structured in the way you describe and most
probably this will concern content which is already structured (that
which is currently in databases, etc.): arguably structured in nature
anyway e.g., a personnel record. However, the majority of the WWW's
content is textual/ visual e.g., an illustrated poem or perhaps a
strategic plan. Here, applying structure requires commitment in terms of
the intended purpose/ use and this is a real can of worms: ambiguity,
multiplicity of schemas, absence of consensus, etc. The primary concern
is the reader (people not computers), and visual schemas will dominate.
Communication between people is absolutely key, whether immediate
(voice, etc.) or latent (text, etc.). Getting metadata, let alone
structuring the data itself, is fruitless without sufficient command and
control (either forced through the management of people or the tools
used) - getting people to produce metadata voluntarily simply does not
work, at least not without intrinsic reward...

So the Semantic Web needs to accommodate both structured content as well
as unstructured content with associated metadata as well as structured
content without metadata. Fundamental to success is the way in which
content is authored/ created. If the tools used are not enforcing the
schemas (for content or metadata), it will fail - or to put it another
way, partially succeed... i.e., semantic webs. Where does this leave the
unstructured content without metadata? More to the point, where does
this leave users?

As for the 'magic', this is a community of information professionals
(librarians, analysts, developers, etc.) behind the scenes who will
enable this in part or whole. They will be developing the schemas (and
schemes) as well as the mappings between schemas (and schemes) for both
content and the metadata to describe content. Meaning and knowledge
exist in people's heads and cannot be explicitly represented, at least
in terms of predicate logic (the 'maths') that RDF is based upon.
Automatically generated RDF is therefore unlikely although no doubt
we'll be exposed to a few instances were this has been achieved.
However, for those managing information services, exceptions are of
limited use.

To couch this in IWMW terms, we're back to gurus again; there aren't
enough of them around to get the job done now that the WWW is pervasive!

        Stephen...

* I use the term 'content' to refer to data, infromation and more:
poetry is neither data nor information. To keep the email short, these
terms are used interchangeably.



-----Original Message-----
From: Brian Kelly [mailto:[log in to unmask]] 
Sent: 24 June 2003 11:21
To: Emmott,Stephen; [log in to unmask]
Subject: Semantic Web and UK HEIs (was RE: New LSE website launched 23rd
June)



...
> I'd welcome constructive criticism from colleagues at other HEIs and 
> would encourage a debate on our ability as a community to make a 
> transition to the 'semantic web'. One question I always ask regarding
> metadata: Where are the tools? (i.e., tools that the owners/
> publishers of content can use)

Hi Stephen
   As the person who chose the topic of the Semantic Web as a plenary
talk at the recent Institutional Web Management Workshop I guess I
should respond :-)
  I am very much aware that there is not a clear understanding of what
is meant by the Semantic Web and what we can gain from it.  Let me give
you my views.
   With a traditional XML-based Web you can do lots of useful things. As
you've done at LSE, you store your data in XML and use XSLT to transorm
it to XHTML.  You could also use XSLT to transform it to other formats.
   However if a third party wishes to integrate your data with theirs
and with other data, there is a problem.  You will have defined your
fields (your XML Schema - i.e. <STUDENT-NUMBER>, <STAFF-ID>,
<VICE-CHANCELLOR>, etc.) according to local needs.  Other organisations
will use different schemas.  SO to merge the data or search across
different data sets we need either to standardise our schemas
(politically different), put the knowledge in the applications
(expensive, not scalable) or adopt a mechanism which allows different
schemas to be integrated.  The Semantic Web provides a solution to this
latter approach.
   As an example have a look at http://triplestore.aktors.org/ (having
first installed Mozilla, as this only works in Mozilla).  This work has
been carried out by a research group at Southampton University.
   This takes data from a number of sources (e.g. the RAE data which is
held on HERO) and converts this to RDF (using a HTML scraping approach).
This can then be integrated with data from other sources - as can be
seen if you have a play in Mozilla.
   Rather than a research group converting the data to RDF (and maybe
getting it wrong) it would be better if the data owner made their data
available in RDF.  This could be then integratd with third party data.
   The bits of magic that make this possible are RDF and URIs.  RDF is
an XML format which includes a mathematical expression which defines
relationships between resources.  The relationships are not defined in
the RDF language but at a URI - so RDF is extensible.
   It would seem that the benefits from the Semantic Web are gained when
you wish to merge data from disparate sources.  There is then a question
of who should fund the investment to do this.  
   My thoughts - which may contain errors due to my flawed understanding
of the Semantic Web.

Brian

PS In response to your question, where are the tools - in the example I
gave the metadata is the data so there isn't a need for metadata
management tools.
   
> Best wishes,
> 
>         Stephen...
> 
> Stephen Emmott
> Projects Director (Editor in Chief, LSE website)
> Business Systems & Services, LSE
>

Top of Message | Previous Page | Permalink

JiscMail Tools

Files Area | help

RSS Feeds and Sharing

Search Archives

Advanced Options

Archives

May 2024
April 2024
March 2024
December 2023
November 2023
August 2023
July 2023
June 2023
April 2023
March 2023
February 2023
December 2022
October 2022
August 2022
July 2022
June 2022
May 2022
March 2022
February 2022
January 2022
December 2021
November 2021
September 2021
July 2021
June 2021
May 2021
February 2021
January 2021
December 2020
November 2020
October 2020
July 2020
June 2020
May 2020
April 2020
March 2020
January 2020
December 2019
November 2019
October 2019
September 2019
August 2019
July 2019
May 2019
April 2019
March 2019
February 2019
January 2019
December 2018
November 2018
October 2018
September 2018
August 2018
July 2018
June 2018
May 2018
April 2018
March 2018
February 2018
January 2018
December 2017
November 2017
October 2017
September 2017
August 2017
July 2017
June 2017
April 2017
March 2017
February 2017
January 2017
December 2016
October 2016
September 2016
August 2016
July 2016
June 2016
May 2016
April 2016
March 2016
February 2016
January 2016
December 2015
November 2015
October 2015
September 2015
August 2015
July 2015
June 2015
May 2015
April 2015
March 2015
February 2015
January 2015
December 2014
November 2014
October 2014
September 2014
August 2014
July 2014
June 2014
May 2014
April 2014
March 2014
February 2014
January 2014
December 2013
November 2013
October 2013
September 2013
August 2013
July 2013
June 2013
May 2013
April 2013
March 2013
February 2013
January 2013
December 2012
November 2012
October 2012
September 2012
August 2012
July 2012
June 2012
May 2012
April 2012
March 2012
February 2012
January 2012
December 2011
November 2011
October 2011
September 2011
August 2011
July 2011
June 2011
May 2011
April 2011
March 2011
February 2011
January 2011
December 2010
November 2010
October 2010
September 2010
August 2010
July 2010
June 2010
May 2010
April 2010
March 2010
February 2010
January 2010
December 2009
November 2009
October 2009
September 2009
August 2009
July 2009
June 2009
May 2009
April 2009
March 2009
February 2009
January 2009
December 2008
November 2008
October 2008
September 2008
August 2008
July 2008
June 2008
May 2008
April 2008
March 2008
February 2008
January 2008
December 2007
November 2007
October 2007
September 2007
August 2007
July 2007
June 2007
May 2007
April 2007
March 2007
February 2007
January 2007
2006
2005
2004
2003
2002
2001
2000
1999
1998

JiscMail is a Jisc service.

View our service policies at https://www.jiscmail.ac.uk/policyandsecurity/ and Jisc's privacy policy at https://www.jisc.ac.uk/website/privacy-notice

For help and support help@jisc.ac.uk