This harvester uses the OAI-PMH interface of openBis to harvest metadata.
Use pip to install this plugin. This example installs it in /vagrant
source /home/www-data/pyenv/bin/activate
pip install -e git+https://github.com/openresearchdata/ckanext-openbis.git#egg=ckanext-openbis --src /vagrant
cd /vagrant/ckanext-openbis
pip install -r requirements.txt
python setup.py developMake sure the ckanext-oaipmh and ckanext-harvest extension are installed as well.
- add
openbis_harvestertockan.pluginsindevelopment.ini(orproduction.ini) - restart your webserver
- with the web browser go to
<your ckan url>/harvest/new - as URL fill in the base URL of an OAI-PMH conforming openBis instance
- select Source type
openBis - if your OAI-PMH needs credentials, add the following to the "Configuration" section:
{"username": "foo", "password": "bar" } - if you only want to harvest a specific set, add the following to the "Configuration" section:
{"set": "baz"} - Save
- on the harvest admin click Reharvest
On the command line do this:
- activate the python environment
cdto the ckan directory, e.g./usr/lib/ckan/default/src/ckan- start the consumers (NOTE: only run 1 gather and 1 fetch consumer per server):
paster --plugin=ckanext-openbis harvester gather_consumer &
paster --plugin=ckanext-openbis harvester fetch_consumer &-
run the job:
paster --plugin=ckanext-openbis harvester run
The harvester should now start and import the OAI-PMH metadata.
To make it easier to develop, tests are setup that allow to do that:
. ~/default/bin/activate
cd /vagrant/ckanext-openbis
In this example the logging filter is used to only show messages of the harvester.