Sahara is a very flexible OAI-PMH harvester that has been developed for
the DAREnet, LOREnet and EduRep projects. Sahara has been developed for
SURFnet (a subsidiary of SURF, the Dutch counterpart of JISC) by CQ2.
SURFnet is looking for partners to develop Sahara further as open source
software. I think it has the potential of becoming a major tool for
OAI-based repository networks. I hope this mailing list is good starting
point for that. Any ideas will be appreciated!
The key features of Sahara:
- Can harvest any type of metadata (as long as your processing software
can handle it).
- Sophisticated user interface for managing the harvesting proces.
- Repositories can be organized in independent domains (e.g. for each
project one domain) and groups (e.g. for different partners within one
group).
- Configurable mapping can monitor, filter en translate metadata.
- Extended logging.
- Ability to selectively reharvest repositories.
- Harvested metadata can be send to any kind of processing software.
Currently supported are de FAST and Lucene search engines and
- Written in Python.
We are working on the last stages of the 2.0 release. Attached to this
message there are two documents. One describes the user interface for
Sahara. This is the best description of the functionality we have at
this moment. The other document describes the get-interface, which is a
webservice that enables applications to request harvesting information
from Sahara.
More technical questions can be answered bij Erik Groeneveld of CQ2.
There is no project site yet. Setting this up will be part of the open
source effort.
If someone is interested in using Sahara we could set up a test account
for reviewing the user interface (without actual harvesting). If you
want to test drive it we can offer an limited harvesting capacity,
including the use of the SURFnet Search Engine. Which offers a
SRU/SRW-interface for searching the harvested content.
Links
SURFnet: http://www.surfnet.nl/info/en/home.jsp
DAREnet: http://www.darenet.nl/en/page/language.view/home
LOREnet: http://www.lorenet.nl/en/page/page.view/watislorenet.page
EduRep: http://contentketen.kennisnet.nl/praktisch/contentchain
CQ2: http://www.cq2.nl/en/toon
--
Erik Saaman
SURFnet bv
06-28954267
|