SRU Update and the Dutch Union Catalogue

|
Comments Comments (4) | Bookmark or Share Bookmark or Share

By Janifer Gatenby, OCLC Leiden Office

In February 2008, the Nederlandse Centrale Catalogus (NCC: the Dutch Union Catalogue)* started updating WorldCat in real time.  As Dutch librarians create new bibliographic records and add new holdings to new and existing records in the NCC, WorldCat is simultaneously and automatically updated behind the scenes.  Thus, all new data is visible in WorldCat within seconds, even though it is entered via a local interface. Once in WorldCat the data is exposed to a global audience via the freely available site WorldCat.orgThe data is also formatted and made available to the major search engines, in particular Google and Yahoo and various other major sites such as Facebook.  Many sites, including an increasing number of blog sites have downloaded the WorldCat search box allowing direct searching from their pages.  Other sites are accessing WorldCat via their own programs using the WorldCat API (Application Program Interface).

 

The importance of up to date and comprehensive information in all potential discovery environments cannot be overstated; users may search wherever they choose and librarians can be confident that these users will be able to retrieve the same resources, no matter their point of entry. 

 

SRU record update** is a standard designed specifically for updating union catalogues.  The standard permits timely update, a mechanism for the feedback of information, including errors (so that they can be fixed, hopefully promptly, by the contributors themselves) and alignment identifiers.  SRU record update is designed for metadata, recognizing that neither source nor target systems retain exclusive control over metadata as is usually the case with shared documents.  Examples of typical changes are records linking to authority records (detailed information about authors and subjects), receiving new holdings reports from other libraries, and being merged and enriched at the same time by duplicate records.  Because metadata does not remain static, it is important to share identifiers to keep records in different systems in alignment.  Unique identifiers also allow one system easy, quick and unambiguous access to related records in another system and any associated data, e.g. holdings, enriched cover art, tables of content, reviews, notes and lists.

 

Without the SRU record update protocol, new data is added via batches and delays in processing update batches of data are common, and sometimes quite lengthy.  Delays can occur in creating batches or in processing the batches that are received.  SRU record update also ensures comprehensiveness as it eliminates the risk of missed batches and provides timely validation errors to minimize the number of missing erroneous records.

 

In 5 months of operation of SRU update, the NCC has added to WorldCat more than 1.5 million holdings and 230,000 bibliographic records with a time between appearance in one catalogue and appearance in the other averaging less than 5 seconds. For the first two months, there were two streams of data, ongoing regular work and a gap load of records processed between the initial batch load and the live date for SRU record update.  The gap load served to confirm the scalability of the interface.  On average, between 1800 and 2000 transactions per day are sent from the NCC to WorldCat.    

 

Following the NCC success, a second release of the software was implemented on 14th July 2008, extending the service to include enhanced security, a mechanism for the deletion of holdings and a validation only routine.  The validation only routine serves as a valuable testing tool for both libraries and OCLC developers alike.  The service is now ready for further implementations.  The second installation will be the Libraries Australia *** service later this year.

 

The new service has great potential for WorldCat because a growing percentage of contribution to WorldCat is not via WorldCat's online Connexion interface.  All systems, of libraries of all types, currently contributing via batch can potentially switch to interactive update.  Sending systems need to have SRU record update "pusher" capability and OCLC is investigating the possibility of creating tools to facilitate the incorporation of the protocol into existing integrated library systems.

 

 

*The NCC contains bibliographic references and the locations of approximately 12 million books and almost 500,000 periodicals in more than 400 libraries in the Netherlands. The NCC was initially loaded to WorldCat in fiscal year 2006.

**SRU record update was developed by members of the SRU community as a modernization of the Z39.50 extended service update and Union Catalogue Profile.

***Libraries Australia, coordinated by the National Library of Australia, is used for reference, collection development, cataloguing and interlibrary lending.  Libraries Australia contains the Australian National Bibliographic Database (ANBD), which records the location details of over 42 million items held in most Australian academic, research, national, state, public and special libraries.  The holdings of Libraries Australia became part of WorldCat in fiscal year 2008. 

 

Comments 4 Comments

Henry said:

Hi

I have attended the 'OCLC Data Resources ...' session in IFLA 2008. This was mentioned by Joanna.

It is good to hear OCLC is experimenting this service. It will ease up the batch-loading. I hope to see this can be used by all members soon.

Is there any system requirement in the local system to use such service?

Regards

Henry
SILAS
Singapore

Karen Calhoun Author Profile Page said:

Dear Henry,
Thanks for your comment. I have forwarded your message to the person in charge of SRU Update here at OCLC. She may be able to give you a timeframe for wider implementation of SRU update, beyond OCLC's CBS systems. Karen

Janifer Gatenby said:

Hello Henry,

At OCLC, we will be encouraging use of SRU record update as a means of contribution to WorldCat from local and union catalogues. The system of the contributing catalogue needs to have a SRU record update client which we call the "Pusher". The "Pusher" is capable of generating record inserts, record updates and record deletes as they occur on its own database. It can also process the responses from WorldCat, in particular the responses that include the OCLC number (record identifier) and can then insert these identifiers back into the records in its database.

A SRU record update "pusher" is not difficult to implement, and I am aware of other examples of implementation besides the ones between OCLC and its CBS partners.

While OCLC currently has no development schedule for implementing SRU update this year beyond our work with CBS partners, we are looking into ways to facilitate adoption of SRU record update with other systems in the future. One possibility is to create a service that will poll a catalogue and extract records for update. This is not a complete solution because there would need to be an alternative way of sending identifiers and error diagnostics. Another possibility is a tool set that we could make available.

Meanwhile we are very keen to hear from anyone with an SRU record update "pusher" or project to make one.

Janifer

Henry said:

Dear Karen and Janifer

From the perspective of members, we do have concern about the WorldCat being not able to hold our local data. Many members use Connexion mainly for copy-cataloging. We will do our local customisation at our local systems. This is because many libraries have their local policies different from the international standards. Name authority is one of the examples (especially in Asian countries). Therefore, the SRU Update will make the members' holding info in WorldCat be more synchronized.

Another point I would like to make here is: if the program can insert/update the oclc number assigned back in the local record in the member's database. This is important in order to make the deep-link in worldcat.org to work properly. There are many situations the link back from worldcat.org to the local opac fail.

Thank you.

Regards
Henry Lam
http://www.silas.org.sg

About this blog

Metalogue is a forum for sharing thoughts on all things related to knowledge organization by and for libraries, hosted by Karen Calhoun, Vice President, WorldCat and Metadata Services for OCLC. Karen is joined often by friends and colleagues from all over the globe, who contribute perspectives and experiences about the current and future state of cataloguing and metadata.

Find In A Library

Search for an item in libraries near you:
WorldCat.org »