The Importance of Identifiers
Here is another entry from my colleague
-------------------------------------------------------
My passport number is my identifier. The passport also carries metadata that identifies me, but not necessarily uniquely, because someone else could have the same name, birth place and date. In this case, more elements are needed to distinguish me from another person, such as my photograph. As the number of elements required for uniqueness can vary, once identity is established, an identifier is applied for future ease. Thus it is hard to imagine a passport without a passport number, even if, strictly speaking, it is usually not simply numeric but an alpha numeric string. In a database, an identifier is used for identity in preference to an ensemble of descriptive elements. Unique identifiers provide direct access to records and are of fundamental importance in eliminating duplicates both from a database and from incoming records.
Identifiers are important in the commercial world, having a key role in distribution, promotion, rights management and copyright protection. In
- ISBN (ISO 2108). Monographs - manifestation level
- ISSN (ISO 3297[1]). Serials - manifestation level, but also used at the work level)
- ISMN (ISO 10957[2]). Music - manifestation level
- ISWC (ISO 15707[3]). Music - work level
- ISTC (ISO 21047[4]). Text - work level
- ISRC (ISO 3901[5]). Sound recordings - manifestation level
- ISAN (ISO 15706[6]). Audio-visual - work level
- V-ISAN (ISO 15706-2). Audio-visual - manifestation level
- ISIL (ISO 15511[7]) Libraries
- ISNI (DIS 27729) Name identifier - currently in progress
- ISCI (CD 27730) Collections - currently in progress
- DOI (Digital object identifier) - currently in progress
NISO, the North American standards body, is also involved in identifier standards, in particular a work item in progress for an institution identifier for all organisations involved in the supply chain of serial publications.
All the ISO standards with the exception of the ISIL and the ISCI are identifiers created for the purpose of underpinning commercial trade. These ISO identifier standards only cover materials where somebody has applied for an identifier, and the application process can be expensive. Thus within the WorldCat database it is estimated that only 30% of the resources represented have international identifiers. For the so called "long tail" of resources of little or no commercial value, only quasi official identifiers exist such as the Library of Congress Control Number (LCCN) or the OCLC number. However, identifiers are becoming increasingly important in the Internet environment as a means to access identical resources in multiple sites and hence the identifiers need to be unique on a global scale. They also need to be capable of being embedded within a URL (Uniform Resource Locator). URLs themselves are addresses and very poor identifiers, because they are both location specific and are volatile, changing frequently. This has led to the emergence of resolution systems that link from identifiers of resources to locations of information about the resources or to the actual resources themselves. The DOI is one such resolution system.
There are several models for registering identifiers. In some cases (e.g. ISBN) the international agency releases blocks of numbers to national agencies who then assign them for use by publishers. The registration of the metadata associated with the identifier is the responsibility of the publisher and the national agency and not the international agency. So the definitive metadata for books is found in "in print" lists and national bibliographies. In other cases, a central database of identifiers and their associated metadata is maintained, as for the ISSN and as is planned for the ISTC. WorldCat can potentially become a reference database for unique identification of resources of all types, commercial and non-commercial alike.
Within WorldCat, like in other databases, identifiers are the key to linking and navigating from resources and their holdings in libraries to related resources, such as different editions of the same work or different works by the same author. Identifiers also link between base data and enriched data both within the same database and external databases. Further, identifiers are used to link from resources to services relating to the resources, for example to link from a metadata record within WorldCat.org to online delivery services provided by a local library. The standard protocols that underlie interoperability use identifiers for processing transactions. OCLC is already providing identifier services so that its identifier infrastructure can be used by other systems. The first two of these services are now in production, namely xISBN and xISSN that allow retrieval of related resources by ISBN and ISSN respectively. The ISSN service includes a graphic display of the history of a serial as per the figure below.
For a clearer view of this example, please visit the xISSN registry here.
Further identifier services are being progressively released by OCLC, including on the horizon a service allowing grouping of resources at work level that is currently in pilot with the Dutch union catalogue and services based on manifestation identifiers (project GLiMIr - Global Library Manifestation Identifier).

