Make a Lasting Impression

Get Involved!

Who's Online

13 user(s) are online (5 user(s) are browsing ImpressCMS Blog)

Members: 0
Guests: 13

more...
ImpressCMS proudly uses SourceForge
ImpressCMS on Ohloh.net
ImpressCMS Blog > Federation of digital libraries with ImpressCMS

Federation of digital libraries with ImpressCMS

I was fortunate to attend a 'winter school' on Federation of Learning Repositories for Agriculture, Food and Environment in Hungary from 2-6 November. The workshop was essentially about how to make digital libraries interoperable with one another, so that people can search across collections owned by different organisations simultaneously – think 'cross-site search'. The course focused on standards and models for automating data exchange.

There's been a fair bit of talk in our community over the years about developing cross-site search, so it was very interesting to meet and be taught by people who are doing it on an industrial scale. Trainers included David Massart from the European Schoolnet project and Erik Duval, President of the Ariadne Foundation. Both are involved in establishing distributed networks of learning repositories involving universities, research institutions and other organisations. Put simply, they want to make everyone's educational materials available to everyone else, globally.

We looked at two models of federation:

  • Querying: A search made on one site is forwarded as a query to a network of affiliated sites, which provide an external query interface. Each site in the network returns its own results, which are displayed in aggregate to the user.
  • Harvesting: A site may periodically import the records (metadata) held by a network of affiliated sites, which expose their content via a web service. When a user runs a search on one site they have access to all the metadata in the network, but it is stored locally, so the search is much faster.

We had hands-on implementation sessions for both models, during which we set up a Simple Query Interface target, and an Open Archives Initiative Protocol for Metadata Handling (OAI-PMH) target, and also tools to run queries and harvest metadata against those targets. Something of a challenge, as we were working in Java!

Sharing data between different computer systems opens up a diabolical can of worms about format. So we spent a fair bit of time looking at metadata standards such as the Dublin Core and Learning Object Metadata. Standardising your data is difficult but it opens up whole new worlds of discovery, sharing and reuse, and maybe this is something worth thinking about for ImpressCMS at a project level.

The workshop included two demonstration sessions where participants displayed their digital library systems to others. This provided a good opportunity to wave the flag for ImpressCMS and also to get some feedback on a new 'Open Archive' module I'm developing. This started out as a 'publications' module using the Dublin Core standard metadata fields. However, it can also handle streaming audio, embeddable video, plain text records and hopefully soon images as well. The module can permit its records to be harvested via an OAI-PMH web service, so it can participate in cross-site search and distributed digital library systems.

The course was run under the auspices of the Agricultural Learning Repositories Task Force. It was organised by the ASPECT Best Practices Network, Organic.Edunet and the Food and Agriculture Organization of the United Nations. It was sponsored by the United Nations University, CGIAR, Rural Inclusion Project, iQTool Project, Organic Mednet and the European Foundation of Information Technology in Agriculture. I would like to thank them all for the time they spent with us at the course, and also for sponsoring my travel and accommodation, which was greatly appreciated.

All posts by Madfish
Subscribe to latest posts
The comments are owned by the poster. We aren't responsible for their content.
Poster Thread
Vaughan
Posted: 2009/11/19 15:37  Updated: 2009/11/19 15:37
Home away from home
Joined: 2007/12/4
From: Derbyshire/UK
Posts: 1749
 Re: Federation of digital libraries with ImpressCMS
interesting, something to look into anyhow.

XML files is a good way of transporting records to & from multiple sites, rather than sending full blown documents & multiple queries.

you could put the meta data, search content & other relevant data into an XML file and then send the xml to the other sites, each site then would open the xml file and instantly have all that info.

of course you'd need to create the xml file in the 1st place, but that wouldn't be too hard to do.
Madfish
Posted: 2009/11/20 4:23  Updated: 2009/11/20 4:23
Home away from home
Joined: 2007/12/4
From:
Posts: 416
 Re: Federation of digital libraries with ImpressCMS
Yeah that's pretty much what the Open Archives Initiative protocol does. A remote harvester can request for your records and your site will return them in XML according to a specified schema. The harvester can also request records within a certain time range, so it can come back and collect new records / update itself periodically.

One interesting application of the protocol could be to mirror content across sites. Since its just the metadata, if the mirrors all point back to the site actually hosting the download or whatever, it would be kind of like a web ring on steroids.
Vaughan
Posted: 2009/11/20 13:32  Updated: 2009/11/20 13:32
Home away from home
Joined: 2007/12/4
From: Derbyshire/UK
Posts: 1749
 Re: Federation of digital libraries with ImpressCMS
interestingly enough, i am currently developing a system for exporting data from modules etc and config settings to XML files aswell as importing from XML.

so once that is complete, we can look at developing the system further to allow us do do something like this across multiple sites.
Will
Posted: 2009/11/21 16:23  Updated: 2009/11/21 16:23
Home away from home
Joined: 2007/12/4
From: Fort Worth, TX
Posts: 3111
 Re: Federation of digital libraries with ImpressCMS
This is so far over my head that I can barely see up its skirt - but it sounds cool as hell.
skenow
Posted: 2009/11/22 11:41  Updated: 2009/11/22 11:41
Home away from home
Joined: 2007/12/4
From:
Posts: 2886
 Re: Federation of digital libraries with ImpressCMS
From what I understand, this is like global tagging - similar to social bookmarking on Delicious. The only difference is, instead of having to go to delicious.com to find similarly tagged content, your site has the capability to share this information with other sites, either by pushing it out or pulling it in using XML, SOAP, REST, or some other technique.

Is this right, Madfish?