Recently in Google Category

OOOOOOCLC

|
Bookmark or Share Bookmark or Share
A while back, I made a joke on April 1 about Google buying OCLC.  This was actually a well coordinated April Fool's Day attack between myself and the folks at ALA Techource (none of our bosses from back then want to know how much time actually went into this coordination).  I guess one true test of a joke is its staying power, and this one has it, oddly enough (I suggest using Google to see the folks who took it seriously, and that way I can avoid embarrassing anyone).  From time to time, I even find current links to the announcement treating it as real news.

So, either this joke was really funny, or the juxtaposition of two "big switch" players is intriguing to librarians.  My money is on the latter. I'm pretty pleased that OCLC has embarked on record-sharing deals with Google because I have always thought that search companies with great algorithms generally undervalue the power of metadata.  I'm convinced that they only way to prove the point is to show them, as libraries are starting to do with faceted browse catalogs.

I'm also insatiably curious as to what the first page of search results in Google Book search will look like when there are 100 million books in the database.  What will Scholar look like with 100 million books and 100 million articles?  The best answer I am able to get from Google is "highly relevant."  Relevance ranking is hard, as we have learned in enhancing WorldCat with non-monograph metadata.  Done well, however, it greatly enhances the discovery experience for patrons.

Other people are beginning to wonder out loud about the prowess of Google and WorldCat data.  Another recent post had me wondering about the future of Google Books.  CrossRef has created a new plagiarism screening service called CrossCheck (clever!). Another indicator of my love of data and what it empowers us to do and discover.  It got me thinking (un-originally, apparently) about what Google could cook up in searching for plagiarism once it has millions and millions of books scanned.  I understand that Google is doing duplicate checking in its scans to keep from scanning books twice, so I imagine that "plagiarism checking" would be rather simple for them too.  Literary crime detection using Google could be a fun pastime.

Artificial Unintelligence

|
CommentsComments (1) | Bookmark or Share Bookmark or Share
Five years ago, I wrote a book.  Originally titled "Strange Bedfellow," this work on the relationships between libraries, vendors, and dot-com entities was re-titled The Ultimate Digital Library: Where the New Information Players Meet.  Some of the better parts of that book seemed so radical then and look entirely passe now.  It's hard to believe it's been five years since it came out, but where was I supposed to go from "Ultimate?"  The More Ultimate Digital Library?

I was actually looking to see if I could pick up some used copies since I have given all but one of mine away, when I was pleased to see that I had joined the ranks of Wikipedia, albeit in an unusual way.  My Wikipedia entry mentions my book, but apparently has me confused with a professor of artificial intelligence at U. Penn.  The cite for Dr. Pace's book came from Google Book Search.  I could not help but note the irony that I have been mashed up in cyberspace by a combination of Wikipedia and Google.

Some of the more famous entries in WorldCat Identities have links out to Wikipedia.  It occurs to me now that Wikipedia could use some links in the other direction for us much less famous.

About the Author

Andrew K. Pace

I am Executive Director for Networked Library Services at OCLC. I am also a past President of LITA. On occasion, I am known for pontificating "on stage, in writing, and via the web" on a variety of issues important to libraries.

More »

Find In A Library

Search for an item in libraries near you:
WorldCat.org »