With JISC funding, UKOLN has developed a prototype harvesting and aggregation system for metadata from UK Institutional repositories called 'RepUK'. Currently we are harvesting only OAI-DC records, and the focus is on scholarly papers.
Our main interest is in providing the aggregation as a component to be potentially used in wider, distributed systems. We are actively liaising with the JISC-funded Resource Discovery Infrastructure Taskforce about this possibility.
Another aspect of our work with this development is the examination of the metadata records themselves in aggregate - we are exploring ideas around deriving a degree of 'intelligence' about the state of the metadata to be found in OA repositories in the UK.
We hope to show a working system very soon, with a first cut at providing some 'visualisations of data quality' across the aggregation of metadata. If you're interested in seeing the work in progress then you can access a prototype here:
http://kitt.bath.ac.uk/RepUK/hello.htm
This should be considered a temporary location for this interface. An incremental harvest is in operation now and will continue over this weekend, but all repositories will be showing figures which should be no older than the last full harvest which was in January. Once we go love with this then a rolling, continuous harvest process will be introduced. I'm pleased to say that our data corroborated Chris's statement about the number of records showing in the Edinburgh repository.
I should note that we rely on the excellent OpenDOAR API to provide us with the list of and OAI-PMH base URLs for the repositories we harvest.
The lead developer on this project is Mark Dewey ([log in to unmask]) so please feel free to contact him off list if you have comments or questions.
Paul
On 10 Feb 2011, at 10:41, Chris Rusbridge wrote:
> I noticed that the statistics on the OpenDOAR list of repositories can be way out of date, eg last time I looked the ERA repository from Edinburgh was reported as having 1132 items, yet today it actually has over 4,000 items. Is OpenDOAR effectively a dead snapshot?
>
> --
> Chris Rusbridge
> Mobile: +44 791 7423828
> Email: [log in to unmask]
>
--------------------------------------------
Paul Walk
Deputy Director
UKOLN (University of Bath)
http://www.ukoln.ac.uk/
[log in to unmask]
+44(0)1225383933
--------------------------------------------
|