February 9, 2011 - CIC contribution to Google Books at 1,000,000 and counting!
From Google Team co-manager Ann Snowman:
Earlier this month, Kimberly Armstrong, Deputy Director of the Center for Library Initiatives, Committee on Institutional Cooperation, announced that the CIC partners’ contributions to the Google Books project have passed the million-book milestone. Penn State has contributed its fair share to that number, and continues to send monthly shipments for scanning.
As the body of digial content grows, we are learning new ways to use the corpus that can be found in HathiTrust and Google Books to our scholarly advantage. Ellysa Cahoy recently hosted “Using HathiTrust and Google Books in the Context of Discipline-Based Library Instruction,” where panelists John Meier, Eric Novotny and Paula Contreras shared strategies for finding relevant full-text resources for a range of disciplines via HathiTrust and Google Books (click here for Ellysa’s notes from the seminar). Despite the day-to-day drudgery involved in processing materials for digitization, this is an exciting time! We are pioneers at the threshold of a new age in scholarship. I think we should all pause for a moment to appreciate the contributions that Penn State University Libraries are making in this new frontier.
September 20, 2010 - HathiTrust Digital Library links now included in The CAT
From database specialist Binky Lush:
The Google Books Project Team and the Next Gen Interface Team are pleased to announce links to the HathiTrust Digital Library in the CAT.
HathiTrust is a collaboration of the thirteen universities of the Committee on Institutional Cooperation, along with the University of California and the University of Virginia systems, to establish a repository for these institutions to archive and share their digital collections.
Beginning this week, HathiTrust icons will appear in the brief record in the CAT when Hathi offers access to a digital version of the item. The icon will link to the digital resource or list of resources within HathiTrust and will indicate three levels of access to the item(s):
- Full view: Indicates full text is available from the HathiTrust.
- View multiple volumes: Indicates multiple volumes are available from the HathiTrust. They may be full text and/or search only.
- Limited view (search only): Indicates the text is searchable but only portions of the text are available due to copyright restrictions.
The links to Google Books will continue to be displayed as well, but will be enhanced with more information about levels of access (full, partial, or limited view) within Google Books. For a full explanation of each access level, click on the More Information link in the CAT record.
August 9, 2010 - Project reaches half-way mark, Gov Docs contribution winding down
From Google Team co-manager Ann Snowman:
We are well past the mid-point of the University Libraries' contribution to the CIC/Google Book Search project and now see the light at the end of the tunnel. We just sent a fresh snapshot of our catalog to Google, which they will compare to the old snapshot to update the picklist.
Our contribution to the current Gov Docs project is all but completed, with the last shipment probably leaving this month.
Selectors are now reviewing a revised withdrawal procedure that will explain what to do when withdrawing a book that was digitized, to be released for practice shortly.
February 5, 2010 - Project managers present "One Year In: A Google Conversation"
Lisa German and Ann Snowman spoke to an audience in Foster Auditorium and online about the past, present, and future of the Google Books project at Penn State. They highlighted some of the related benefits of the project thus far, including near-100% fulfillment rates for materials requested from the Annex, improvements to Interlibrary Loan services, and more accurate records of the Libraries’ collections. Looking forward, they emphasized the importance of cross-departmental partnerships and responsiveness to changing patron needs and expectations. Ann and Lisa also discussed the Hathi Trust and other topics during an extended Q&A session.
Click here for the presentation slides (.pdf)
February 4, 2010 - University Libraries join the Google Gov Docs project
Social Sciences Library supervisor Heather Ross writes:
Many of you are familiar with "The Google Project." Books from Penn State University Libraries across the Commonwealth, together with books from other CIC institution libraries, are being scanned by Google for use in Google Books and for deposit in the Hathi Trust (the CIC digital repository). Books are collected from the Libraries, sent to Google and then returned.
There is a second project called "Google Gov Docs," which is not to be confused with the Google Docs App, a Google product. Gov Docs stands for Government Documents. The project is designed to create a definitive digital collection of federal government publications by scanning materials in the CIC library collections. When completed the collection is expected to contain over 500,000 federal documents.
Click here for the rest of the report (staff only)
February 2, 2010 - The sticker says it all! New "e-copy available" label coming soon
Starting in mid-February, University Libraries staff will begin applying the "e-copy available" label to library materials that are also available in electronic format. This includes materials scanned as part of the Google Books Project and by the Internet Archive (through the Lyrasis Mass Digitization Collaborative), as well as library-produced digital copies.
The label underwent several revisions during the design process and an extensive search of similar logos and brands at other institutions to ensure its uniqueness. It will serve primarily as a visual identification to library staff to not change the barcode of the physical copy. Potentially, the label will also promote patron use of the electronic version of the work, and awareness of the libraries' expanding digital holdings in general.
November 19, 2009 - Dean Nancy L. Eaton reports on the recent Google Partners summit
Dean Eaton contributed the following report on her experiences at the Google Partners summit:
Google Library Partners (libraries that are digitizing collections as part of the Google Book Search Project) met at the Google Campus in Mountain View, California on October 21-22 to be updated on a variety of topics and to provide Google with advice on several development issues. I attended as one of the four representatives for the CIC, since I chair the Director’s Group and am on the CIC negotiating team for the renegotiation of the CIC-Google contract. Topics covered during the two days included a summary of what was in the scanned corpus (database) to date, new techniques for minimizing duplication of content, several examples of research being conducted against the Research Corpus at Google, upcoming Google Book search features, and a discussion of the Book Rights Registry by the newly appointed head of the Registry, Michael Healy.
Five key issues were highlighted:
- Statistics: The Corpus as of the Summit contained over eleven million volumes provided by libraries and two million from publishers. 478 languages were represented; 47% were non-English.
- Improvements in GRIN (delivering digital copies to libraries)
- Announcement that the METS standard has been adopted
- A description of the duplication detector and how it is working
- A discussion of the Amended Settlement
Uppermost in attendees’ minds during the Summit was the fact that Google had to respond to public comments on its Settlement Agreement by November 9. Included in the public comments were reports by the Justice Department and the Copyright Office, as well as statements by library associations, publishers, and authors. We were, under non-disclosure, given a preview of what would be in Google’s response to the court. Subsequently, Google asked for an extension until November 13 in order to work with the Authors Guild and the Association of American Publishers in preparing its response. (Documents submitted in the filing of the Amended Settlement Agreement on November 13 are now public.) The Amended Settlement Agreement, if approved by the court, would substantively change the nature of various Partner contracts, thus the need for the CIC negotiating team to be called back into action.
October 29, 2009 - Ann Snowman reports on "perfect" shipment, other news
Google Team co-manager Ann Snowman writes:
Shipment #8 to Google was a "perfect" shipment. When the Google Scanning Center receives our shipment each month, they check in each and every book. They compare this list against the manifest we forwarded electronically. And they report back to us if there were any books that either arrived at the scanning center but did not appear on our manifest, or that appear on the manifest but never made it onto a truck. Given the vast number of books handled each month against a critical deadline, it is understandable that one or two are missed in the process. This does not mean that books are lost, just overlooked in handling. Extra books are not scanned, but are shipped back to us and included in the next outgoing shipment with proper identification. Missing books, possibly set aside for preservation, are tracked down here and processed accordingly, as well. A perfect shipment (with no extra or missing books) indicates that we have hit our stride in the processing and it is becoming more routine - a point of pride for the project staff and managers. Well done!
- The Google Books project team is working on establishing withdrawal procedures in case libraries want to withdraw materials that have been digitized.
- To date, about 40% of the University Libraries' contribution to the project has been sent to Google.
October 9, 2009 - Update on shipments to Google
The eighth shipment of materials to Google left Cato II on September 30, while the previous shipment returned two days later. Preparations are now underway for shipment #9, which will depart on October 28.
All contributions of materials from the Commonwealth Campus Libraries are now complete.
August 13, 2009 - Project managers give presentation at subject heads' meeting
Lisa German and Ann Snowman discussed the benefits of the Google Book Project for the University Libraries. They touched on the Barcode Inventory Project, the OCLC Reclamation Project, the process of sending books to Google for digitization, and the HathiTrust. In addition, they showed some examples on Google Book Search of Penn State books that have been digitized by Google.
Click here for the presentation slides (.pdf) (staff only)
August 11, 2009 - A conversation with Barcoding Team supervisor Verne Neff
Recently, Library Services manager Verne Neff answered a few questions about his experiences transporting materials back and forth to the staging area at Cato Park for the Google Books project.
Verne, how do you manage to get all the books out to Cato II to stage them for shipping?
It takes the help of many people to get the books from point A to point B, and I want to say thanks to them all.
How do most of the books get from the Penn State Libraries to Google’s pickup location at the Cato II annex and back?
Most of them go in a 24-foot box truck. I drive, and Trish Notartomas rides shotgun.
Is it difficult to drive a truck that big?
Driving the truck, to me, is the easy part. It’s everyone else in vehicles on the road, or in tight spots such as loading docks, intersections and such, that make Trish’s eyes get big. She often says, "I’m glad you’re driving, that is too close for me!"
Tell us about the challenges of tight spaces.
Places do get tight, especially here on campus. Everywhere you go there are hundreds of pedestrians, bicycles, and parked vehicles making the driving lanes and maneuvering space needed to jockey into position fun sometimes. Not to mention them walking into your blind spot while you are backing up, or in front of you so close that all you see is the hair on their heads peeking over the hood, or just a hand on the hood as they steady themselves while riding by on their bike.
Are there some experiences that are more memorable than others?
Memorable moments, so far:
- While turning around to drop off the truck, a lady in a Beamer [BMW] stopped us. I thought she was mad about something, but she was only being nice and asked us if we were lost.
- Weighing the truck for Max GVW (Gross Vehicle Weight), for future loads and reporting how much you weigh. OK to me, but maybe an awkward moment for Trish. But she overcame it by thinking of figuring in luggage if we have to do overnighters. If you ask her now what GVW means, she sure can tell you.
- Trish telling one of the Google tractor trailer drivers, "He [Verne], is making a truck driver out of me" and him laughingly responding "Good, we need more women truck drivers out there".
- Questions like, how do to you judge wide turns to accommodate overhang swing and how do you judge how close you are to the dock when backing up with only side mirrors. I try to drive without banging into the dock bumpers and making Jeff [Marker] think the building is falling down.
Have there been many problems with this operation?
It does get interesting at times, but it has been a smooth operation, so far. We have branched out, starting to make trips to campuses when and where needed, so if you see a 24-foot box truck traveling down the highway and you say to yourself "Hey, that looks like Trish and Verne", it probably is. So don’t be afraid to wave and honk!
August 5, 2009 - News on recent and upcoming shipments
The sixth shipment of materials was sent to Google for scanning last Wednesday. Roughly 68% of those materials came from the University Park subject and branch libraries. They will return September 4th. Shipment #5, which departed on July 9th, returned last Friday. Planning for the seventh shipment is underway. Those items will be sent on September 2nd and return October 2nd. A significant amount of materials from Life Sciences and Earth & Mineral Sciences are already slated for this shipment.
July 1, 2009 - Presentation on "The Barcoding Inventory Project: Treasures, Treats, and Tribulations"
Jackie Dillon-Fast and Christopher Lemery spoke about the purpose, progress, challenges and benefits of the Barcoding Inventory Project (BIP) to an audience of library staff at the Kern Building and online. They illustrated the most common shortcomings they’ve encountered (and fixed) in catalog records while barcoding over 300,000 inches of library materials since 2003. They also discussed the impact of the Google Book Search project on the BIP team’s focus in the near future. And they presented several examples of their most interesting finds among shelves of previously undiscoverable cultural artifacts.
Click here to view the presentation on Mediasite Live (search "barcoding")
May 26, 2009 - Project members reflect on productive year
Google Team co-director Lisa German reports:
The Google Book Scanning Team gathered together to celebrate the year’s accomplishments. Three shipments have been sent to Google for scanning, thousands of barcodes have been applied to our collection, thousands of bibliographic records have been fixed or upgraded, and hundred of books have received preservation treatment. We’re looking forward to another productive year!
May 21, 2009 - Shipments status, appreciation for the campus libraries
The first two shipments have arrived back from Google, while the third shipment is still being digitized. The fourth shipment is currently being prepared for June.
Lisa German, Ann Snowman, and Trish Notartomas would like to thank the campus libraries for their submissions to the project, and also the branch libraries that are preparing to submit materials.
May 1, 2009 - Campus libraries contributing to upcoming shipments
Over 400 boxes of materials have arrived from the campus libraries for the summer Google shipments. Ann Snowman, Lisa German, and Trish Notartomas would especially like to thank David Van de Streek for coordinating the CCLs’ contributions.
April 23, 2009 - Update on shipments
The first shipment to Google has returned to Cato II. The Dewey-classified items from that shipment will remain there, while LC materials are in the process of returning to Pattee. The second shipment left Cato II on April 15. Planning is underway for the third shipment on May 13. In Pattee/Paterno, post-1964 materials that were omitted from the first two shipments are being readied. The third shipment will also include items from the UP subject libraries, as well as the campus libraries. Approximately 200 boxes of materials have arrived via IDS from the campuses.
April 13, 2009 - Preparing for next shipment, shifting focus for summer shipments
The Google Team is currently mustering materials for the second shipment, which will leave Cato II on Wednesday. This shipment will contain pre-1964 items from Pattee stacks levels B, BA, and 1A, as well as from Cato Park. The first shipment will return to Cato II on Friday.
As the pre-1964 requirement for shipped items will be lifted after the second shipment, and the spring semester ends May 8, planning is underway to include high-demand items in the May, June, and July shipments. This will include reference and reserve materials. All University Park and campus libraries are encouraged to submit materials.
April 6, 2009 - Barcoding Team progress report
The Barcoding Team, including Jackie Dillon-Fast and Christopher Lemery, have returned to Academic Activities after spending two weeks barcoding titles in the Pattee stacks. They report:
During our recent two-week field trip to Pattee, we [the Barcoding Inventory Project team] processed 3,067 items for the Google Pick List. Adding the Pattee numbers to the AA numbers for 11/06/08 through 3/11/09 (when we went to Pattee), the Annex-based BIP Team has processed 12,360 items since early November.
April 2, 2009 - Test shipment books available in Google Books
At least three of the test-shipment titles sent to Google in February are now available in Google Book Search in “snippet” view. To fnd them, go to the Advanced Search, enter the full title in the Title search field and “pennsylvania state university libraries” in the Find results with the exact phrase field. The titles are:
- A history of the Lebanon valley in Pennsylvania
- Old Dansbury (now Stroudsburg, Penna.) and the Moravian mission
- Story of old Allegheny city
Five other titles in the test shipment have also recently reached snippet view in Google Book Search, although it is unclear if these were digitized from the University Libraries’ copies:
- Chambersburg, its record and its prospect : an economic history and inventory
- Pioneering with Sullivan County pioneers (1953 edition)
- Brush Creek tales
- Pennsylvania banking and building & loan codes, 1933
- Old Zionsville
- Eagles Mere and the Sullivan Highlands : origin and history of the region : illustrations and descriptive text of the vacation-land of Pennsylvania
March 19, 2009 - Stanley Katz explains "Why There's No Free Lunch in Cyberpublishing: Take Two"
According to a recent issue of Princeton Weekly, in the late 1980s, Katz went to Congress with a modest proposal: Why not allocate a billion dollars to digitize the contents of the nation's academic libraries? People thought he was crazy. They couldn't imagine why you'd want to do all that digitizing. Even Katz couldn't have projected the revolutions soon to come. Congress chose not to act — but today Google has undertaken the project in collaboration with over two-dozen libraries around the world, including the Penn State University Libraries.
In his presentation, Katz will discuss the proposed Google book settlement and its potential impact on readers, libraries and publishers.
Click here for the full story on Penn State Live
Click here to view the presentation on Mediasite Live (search "cyberpublishing")
March 18, 2009 - Project team members attend Google settlement symposium
Ann Snowman and Mike Furlough, along with Patrick Alexander and Kate Novak of Penn State Press, recently participated in a discussion of “The Google Books settlement: What will it mean for the long term?” at Columbia Law School. Topics included the effects of the settlement on orphan works legislation, the publishing industry, and the individual reader/researcher.
Click here for Mike Furlough's notes (.pdf) (staff only)
March 18, 2009 - First shipment of books sent to Google
A large truckload of materials from the second and third floors of the stacks and West Pattee were shipped from Cato II to Google last Wednesday. Book-pulling personnel will now shift their efforts to Pattee-1A. The next shipment to Google is scheduled for April 15. Last week’s shipment will be returned two days later.
Google Team co-director Lisa German explains that, "this initiative supports our strategic planning goals of accelerating the transition to a digital collection and achieving our goals through partnership and collaboration. To this end, selected books from all of Penn State's campuses will be scanned and added to the Google database."
Trish Notartomas, along with Ann Snowman and Lisa, would like to thank the following people who were instrumental in meeting this first deadline: