« GPO LOOKS AT PUBLIC-PRIVATE PARTNERSHIPS TO ACCESS GOVERNMENT INFORMATION | Main | WITH LETTERS, PROFESSORS ENTER FRAY OVER RISING TEXTBOOK PRICES »
Chuck Hamaker and Brad Spry, Google Scholar, Serials, March 2005. Excerpt: 'Google Scholar apparently made a decision to index fairly completely all scholarly, known or cooperating publisher-based sites, but to only partially index university web sites based on file format identifiers, i.e. PDF or PS files....Of the 24,000 items at The University of North Carolina (UNC) Charlotte that a Google site search identifies when ‘pdf’ is used as a search term, fewer than 500 are identified in Google Scholar. This does not bode well for inclusion of special collections and other content being created by libraries specifically for the web....If we understand correctly what it does index, it is time to get on with the much larger job of identifying more trusted scholarly sources. It has done a great job with the basic stuff, indexing 25% to 50% or more from many participating sites and obvious locations (such as arXiv) and identifying scholarly content through secondary means, i.e. citations and abstracting sources like PubMed and ACM. Between the first and third weeks of December, coverage tripled for many standard publishers. But inclusion based on a fairly limited primary source list and document format (or bibliographic citations) are just a beginning. Can it go beyond this to the rest of the scholarly resources on the web?' Open Access News 4/7/05
Posted by at April 25, 2005 10:56 AM