One of last year’s big stories, the publishers’ battle with Google over control of digitized books, has been on the back burner in recent months, but an aggressive move by HarperCollins is pushing it back into the spotlight
In late 2005, Harper, already vocal about its displeasure with Google over the search engine giant’s digital book initiative, announced that it would take its own separate approach, building its own little island, as I wrote at the time.
Since then, we haven’t gotten too many updates on Harper’s progress. On Thursday, however, the publisher announced that it would partner with LibreDigital, a division of newspaper digitizing firm NewsStand, while also making a “strategic investment” in NewsStand, with Harper president Brian Murray joining NewsStand’s board of directors.
We also got an update on how far Harper has progressed over the last year in its efforts to digitize its books. The company’s press release announcing the deal indicates that it has digitized “more than 10,000 books and has enabled the ‘Browse Inside’ application for several thousand.” The WSJ in its writeup (Sub. Req.) puts that total number of books digitized at 12,000, with 2,000 of those being online now. Based on these numbers, the publisher is making progress, if not at the pace of Google, which based on its contract with the California state university library system could be capable of scanning as many as 3,000 books a day. Harper has a backlist of 20,000 books, with 3,500 new titles published each year, and this new effort will likely enable the publisher to finish its digitizing efforts sooner than it would have otherwise. In addition, LibreDigital’s technology will better enable Harper to store and manage these digital editions.
In spite of being at odds with one another, to a certain extent the intentions and efforts of Google and the publishers don’t entirely overlap. As the technology has evolved to facilitate the scanning of large quantities of books, Harper and other publishers are desperate to exert control over the digital versions of their books, allowing them to add value to their catalog by either selling digital books or by using those digital books to entice readers to buy the hard copies. The publishers’ biggest fear is that Google will cannibalize their sales by giving the goods away for free.
Google, meanwhile, is more interested in providing as complete a record of the world’s published work as possible. To be sure, there is a profit motive here – Google has made its billions by helping us navigate the information it organizes for us – but the upside, for readers (and society, even) would be the vast store of human knowledge at our fingertips. The fact that a number of university libraries have cooperated with Google (for the Library Project portion of Google Book Search) would seem to indicate that librarians, who know a thing or two about making information accessible, are enthusiastic about Google’s plan. And, as such, its fairly easy to argue that Google’s book scanning efforts would hurt publishers little more than libraries do. As exciting as Google’s book initiatives could be (and they certainly are pretty good already), it appears as though the dream of a universally accessible online library will be forever hamstrung by publishing companies and copyright law.