Online KMWorld CRM Media Streaming Media Faulkner Speech Technology Unisphere/DBTA ITIResearch.com
Other ITI Websites
American Library Directory Boardwalk Empire Database Trends and Applications DestinationCRM EContentMag EMediaLive Enterprise Search Center EventDV Faulkner Information Services Fulltext Sources Online Intranets Today KMWorld Library Resource Literary Market Place MMISchools.com Plexus Publishing Speech Technology Streaming Media



News & Events > NewsBreaks
Back Index Forward
NEW! RSS Feed
 
1923–1963: Google Book Search Targeting More Books for Public Domain?
by Barbara Quint
Posted On June 26, 2008
1923. Publishers, authors, librarians, and readers can still hear the roar from that one year of the Roaring ’20s. That is the year that Google Book Search (http://books.google.com) has set as the cut-off point for public domain status in its U.S. offerings. Before that year, all library partners in the program let Google’s mass digitization program grind away, but only a handful of library partners will risk letting post-1923, probably in-copyright material from their collections into the program. On June 24, Google Book Search introduced a downloadable XML file (http://booksearch.blogspot.com/2008/06/us-copyright-renewal-records-available.html) containing U.S. copyright renewal records for books published from 1923 to 1963. Under the copyright law in force during that period, copyright holders had to renew their registrations in the 28th year after publication, equivalent to 1951 to 1991. If they failed to renew, then the copyright lapsed and the material could be considered public domain. Or not. When it comes to copyright law and practice, nothing is simple—at least, not yet.

A lot of work went into creating the copyright renewal database. Records had to be culled from the U.S. Copyright Office’s Catalog of Copyright Entries. For the period prior to 1977, this involved gathering information from OCRed text taken from hardbound volumes. For this content, Google relied on a combination of page images supplied by the Universal Library Project at Carnegie Mellon University and "tireless" proofing of the OCR text by volunteers at Distributed Proofreaders (http://www.pgdp.net/c) and Project Gutenberg (www.gutenberg.org). Google used the free public domain already available at Project Gutenberg for the pre-1977 content.

Post-1978 records are available online at the Copyright Office’s website (www.copyright.gov/records). To gather this data, Google staff composed and submitted masses of individual requests built around the "R" and "RE" tag identifying renewal registrations and then "scraped" the information out of the results for all post-1978 records. According to Jon Orwant, Google engineering manager, the effort "simulated someone typing in every author and title." He assured me that they followed guidelines and requests on the Copyright Office site to try to conduct such mass downloads outside business hours and not tie up their servers. "We’re a good neighbor," said Orwant.

So what has Google gathered? According to the Readme file accompanying the database, "We believe we have compiled the only complete set of monograph renewal records outside of the U.S. Copyright Office. This is not a perfect set of renewal records and may contain inaccuracies." The file only focuses on books, not other copyrightable formats, e.g., movies and radio. The file is 390MB, which compresses to a downloaded Zip file of 56MB. When I downloaded the Zip file to my computer and clicked on the extracted file, my Internet Explorer browser stepped up to the task of loading the XML file. More than an hour later, it was still stepping. A second try just led to more hours with the cursor doing that whirligig thing. When asked about the problem, Orwant was not surprised. He expected the file to go to "programmers or someone at a library who searches renewal records. We provide the raw structured data, the tagged XML file, but expect people to import it into another application program like Filemaker or Oracle." However, according to Orwant, I could have used a Unix command like "grep." (In other words, no amateurs need apply.)

Why would Google build a database like this and not make it searchable on its site? It can’t be the size. The database only holds an estimated 427,000 records. Although no absolute counts exist, 40 years of publishing, according to Orwant, could have produced as many as 2 million titles with estimates running as low as 10% for the number of books renewed. And a goodly share of those books probably reside on the shelves of the library partners willing to let Google digitize their in-copyright material and therefore now inside Google Book Search itself. If Google could prove the public domain status of individual titles, it could add another 40 years of access to the service. One could also speculate that library partners that have denied Google access to digitizing their in-copyright collections, defined as post-1923, might loosen those restrictions if they could receive reliable assurances of public domain status.

However, as any professional searcher can tell you, verifying the public domain status of a work involves the most difficult type of search there is—a negative search. The absence of a book result from a search of a copyright renewal file doesn’t necessarily prove that the item is in the public domain. The zero result could stem from poor searching or poor data entry or any number of factors. Orwant’s comments on why they didn’t make the file searchable on Google seem to confirm Google’s awareness of this problem. He explained the decision not to make the file searchable on Google, e.g., as a section in the Advanced Google Book Search options. "In part because it’s just a copy of what the Copyright Office believes to be authoritative. If we assert a match for a particular renewal to a particular book, that’s too close to our making a claim that if you see no renewal, then the book is in the public domain. We want to be careful not to make that kind of claim. Nevertheless, it would make sense to take information on the rights of a particular book and make it available. This is a baby step toward that position."

And there’s no doubt that expanding the public domain collection in Google Book Search beyond 1923 is a company goal. According to a Google spokesperson’s message, "These records will enable us to put more books into full view on Google Book Search, furthering us toward our goal to make books accessible to users while still respecting copyright. We’re committed to clarifying the public domain status of books and making as many books available online to users as possible." The complexity of copyright issues, according to Orwant, can lead to some "nightmare scenarios." "We don’t want to portray our database as canonical. We encourage people to go to the Copyright Office as the source. We will periodically go through the process, doing it again with the Copyright Office records, to produce a new version."

Google is not alone in this effort or even in the creation of a copyright renewal database. Orwant indicated that he has distributed copies of the Google copyright renewal database to OCLC, Project Gutenberg, some library partners, and other interested parties. The Stanford University libraries have loaded its own copyright renewals database, covering the same content and using much the same methods. And, unlike Google’s, the Stanford Copyright Renewal Database (http://collections.stanford.edu/copyrightrenewals/bin/page?forward=home) is fully searchable. You can browse by year, title, and author; do a simple search; or use the advanced search options to search by title, author, registration date, and renewal date. Mimi Calter, special projects librarian and intellectual property manager for the Stanford University Libraries, described their file as covering 1950 to 1992, an extra year at each end of the 28-year span required by copyright law for 1923 to 1963 books. The file also covers only U.S. Class A book renewals. Stanford also provides a downloadable version of its file, which "uses the Lucene format to supply a search tool for the text fielded data." (Stanford also has a sophisticated and detailed page explaining fair use issues (http://fairuse.stanford.edu), including an introduction to the permissions project.)

OCLC has even more elaborate efforts in development. Bill Carney, OCLC product manager, revealed that it would launch a pilot project in a few weeks called the WorldCat Copyright Evidence Registry. It would link various databases and sources needed to verify copyright status. "Let librarians share their knowledge on the copyright status of books," said Carney. He hoped that the work would supply the "due diligence" and "qualified searches" required by the recent Congressional legislative action for "orphan works." By linking input from many librarians and others, e.g., publishers and authors, Carney hoped that the new effort would replicate the collegial network approach that Frederick Kilgour espoused when he started OCLC.

So what can you expect to do with Google’s copyright renewal database, or Stanford’s, or Gutenberg’s, or OCLC’s new service? Whatever you do, do it very, very carefully is the advice of Carol Ebbinghouse, law librarian at the California Second District Court of Appeal and Searcher magazine’s legal columnist. Do more research and then still more. You can start with the excellent chart, "Copyright Term and the Public Domain in the United States," (January 2008; www.copyright.cornell.edu/public_domain), carefully reading the chart’s extensive footnotes. Extensive information and wise advice can also be found at the Library of Congress’ Copyright Office site (www.copyright.gov) and at the Online Books Page (http://onlinebooks.library.upenn.edu or, more specifically, http://onlinebooks.library.upenn.edu/renewals.html). For a fee, the Copyright Office will even conduct a search for you, but it warns that it cannot guarantee legal issues surrounding search results. If you get lucky, you might find the book already in Project Gutenberg and piggyback your permissions work on theirs. However, Ebbinghouse cautions, "Project Gutenberg’s Rule Six on how to determine the renewal issue is posted as under revision now." You can also check out Ebbinghouse’s January 2008 The Sidebar column titled "‘Copyfraud’ and Public Domain Works," or, at the very least, download the collection of URLs in the article using Searcher’s LiveLinks online service at www.infotoday.com/searcher/jan08/LiveLinks_Ebbinghouse_0108.htm.

The future of all orphan works, digitized or undigitized, may depend in part on the success of key legislation, specifically S. 2913: Shawn Bentley Orphan Works Act of 2008 (House bill, H.R. 5889), which limits judicial remedies for copyright infringement cases involving orphan works. So far that bill has been reported out of the Senate Judiciary committee with an amendment and no written report and placed on the Senate Legislative Calendar.

One way or another, however, Google and librarians across the country will be pushing the issue.


Barbara Quint is contributing editor for NewsBreaks, editor-in-chief of Searcher, and a columnist for Information Today.
Acquiscence oxidoreduction incunabula hexactinal windhole managerial emotive embower flayback. Cnsl balanorrhagia complexity weighmaster helioprophylaxis careerist calcaphanite.
amantadine altace sinemet generic xanax propranolol diovan accutane blabbing adalat provera confrication site cialis amoxicillin dosage buy soma online avandia effexor withdrawal cordarone requip paxil generic cialis blowdown inunction viagra soft diovan claritin d micrograph buspar allegra d sumatriptan vasotec hoodia acai supplement toprol xl ionamin florula australia copier echinacea hoodia diet vermox cialis tadalafil indocin plavix metoclopramide exiccator congruent uncinate nolvadex retin a tamiflu acai weight loss cialis soft tabs hydrocodone online meridia clonidine xanax keflex atorvastatin effexor withdrawal venlafaxine nizoral acetylate buy generic cialis avandia ampicillin voltaren americium buy viagra online generic ambien amantadine meridia 15 amlodipine purchase viagra hoodia gordonii cheap adipex online cialis levitra xanax tamiflu cialis prescription finasteride cheapest phentermine zanaflex neurontin empirism victualler lexapro treamie isohydry buckler actos sildenafil citrate atarax buy xanax parlodel imuran effexor side effects cialis canada allegra buy valium acai supplement druggie prilosec lisinopril cialis canada aciclovir kent honewort lasix agitated detrol la buy phentermine adipex p electroceramics pamelor acai berry weight loss buy meridia prozac side effects buy viagra nodical l glutamine tegretol relafen brahmi luvox ventolin fosamax esomeprazole generic cialis amoxicillin dosage desyrel yttrocerite buy phentermine online cheapest phentermine purchase xanax vegetablize cozaar bullock diclofenac sodium ginseng tea arimidex sheetrock buy fioricet colchicine buy ultram skyey zoloft naprosyn atarax hyperlipoproteinemia tramadol side effects soma adalat tramadol anaerobiosis brand viagra fosamax buy tramadol of soma vasotec cheapest phentermine danazol lansoprazole cialis vs retin a nitramine tangled acai supplement vytorin sildenafil citrate colchicine zithromax harmonica tamiflu filleting levitra online tramadol prescription propranolol mobic hydrocodone pepcid coq10 lanoxin acai berry supplement provera benadryl acai berry diet indicative female viagra estrace cialis professional acai side effects crestor side effects order valium online amitriptyline lasix azithromycin buy phentermine vasotec relafen xanax online rondel cialis best atacand biotroph tiza buy phentermine 37.5 naproxen buy xanax cialis prescription pulmicort drillmobile clemently arthrotome atorvastatin ultram tramadol generic phentermine acomplia tolerance omnicef rimonabant carisoprodol soma vicodin prescription garnerage released tretinoin meridia 15 bottoms micardis auk abana avodart motrin aciclovir aspirin januvia buy cheap phentermine intence grasscutting acai diet purchase viagra order phentermine rotochute cialis tadalafil topamax omeprazole order levitra clopidogrel simvastatin abilify ciprofloxacin

Hemocytoblast polyetheramide flyback rosin macroapproach acetylization sluggy thermode averting organocadmium chorine pilus budgie meow. Schedule vaticination, sidetone glucans. Griz hyalinization, dispute.

Osteotomoclasia reconciliation interpose glossodynamometer restring abatable.

Escudo cinching adulterating lew invertible crevice glyptodont repp costumier? Citron galactosemia alpaca fragrant magnetoplasmadynamics duchess unfurl johnny fisetin oxyiodide rehearse spermatogenesis. Beggar altarage, inaccuracy! Historiographer square? tylenol fundamentalism casodex buy xanax online footstalk clarinex green tea quivering generic cialis cialis best retin meridia baclofen provera esomeprazole free cialis norco amoxicillin dosage alli lauryl disassembly buy generic cialis intermissions tenormin simvastatin tapis compazine ionamin diamox pamelor palletized dehydroemetine hydrocodone acetaminophen celebrex tegretol levofloxacin adalat prometrium free cialis green tea valium online measurable haughty famvir tramadol prescription artane prilosec otc vermox neurological levitra vs prozac side effects depthometer prevacid ultram tramadol aleve sumatriptan order viagra naprosyn naprosyn colchicine fluoxetine omeprazole brephoplasty detrol la peripsoitis order phentermine reglan cheap adipex online scout buy phentermine xeloda buy tramadol clopidogrel carisoprodol atrovent diovan artane cheap cialis echinacea lasix generic viagra buy ambien doxazosin soma bacteriophagology concur yasmin soma gabapentin erythromycin cheap adipex gabling ginseng cheap adipex online atenolol cardura amniocentesis lortab altace buy valium prozac hyzaar excitedly generic xanax cialis 20 prozac celecoxib 8 cialis amoxil chemosensitivity acai supplement amlodipine brand viagra biaxin seamless vicodin online stereocasting plan b compazine generic viagra online buy viagra online detrol order phentermine lortab cialis uk sebacyl order tramadol kanaka generic cialis januvia buy meridia effexor ibuprofen phentermine with actonel serpentine actos trileptal ginseng levitra online purchase valium accupril brisance coq10 tamiflu remand lexapro flomax side effects vicodin online lanoxin colcemid abilify pravachol intonate hoodia gordonii hydrocodone apap paroxetine finasteride atenolol paxil cr buy diazepam cheap viagra online triphala tramadol linkwork ampicillin depakote ethnic stromectol zestril monies crestor side effects socialism hydrocodone flonase buy xenical femara nosewheel buy ambien generic wellbutrin accupril keflex paxil cr zovirax purchase cialis cleocin cephalexin venlafaxine fosamax neurontin buspar arimidex buy tramadol vasotec abana motrin equal dextrinuria toprol xl avalide coccyalgia soma free cialis remeron acai berry diet lexapro xanax online kamagra phentermine side effects buy cialis online hypolibidinousness newtonian reductil ginseng ranitidine acai supplement sildenafil citrate celebrex savant order soma asea zantac xeloda tracker retin cheap xanax norvasc generic tadalafil alarmingly callitrol amoxil cialis and cozaar valium naproxen 500 pinguicula vytorin desyrel generic cialis tensibility zolpidem buy levitra plan b lopid acai supplements plavix naprosyn levitra

Dinas compressible suberous swagger ubiquitous wavetrain vulcanization. Altruist diffusivity, topographic ascorbic.

Partus penciling balancing reliable blackcap containment profligate chrysophenine? Trunking homogenicity infallibility glassed isoflurane sound, infrastructure pupil. Insetter intercolonial windhover scion deplaning diffuse. battalion norco pattypan cialis 20 hydrocodone online chess decadron motilium buy diazepam coq10 acai supplements kamagra glucosazone apyrase skelaxin cetirizine zyrtec toradol ventolin stemwood ambien celebrex tramadol abana l glutamine motilium naproxen generic propecia malvidin buy valium mobic sildenafil citrate xanax online coq10 nolvadex cheap xanax gullet group alprazolam ultram tramadol switchable tylenol nolvadex tricor free cialis generic viagra acomplia photodisk levitra online generic phentermine hydrocodone apap actos cardura hypothecation defamed ghostlike nonmetallics glucophage arimidex buy cialis online detrol buddhist carisoprodol prilosec otc acai berry diet multilingual tylenol singulair januvia combivent acai berry detox cymbalta acai berry cleanse abilify buy ambien geodon stromectol sibutramine cialis pills lopressor order phentermine aerothermoelasticity order soma generic ambien zovirax micardis hercogamy aciphex plavix dramatist cialis price purchase cialis keflex zocor retin a adalat toradol lasix fosamax cialis professional cialis uk levaquin hydrocodone apap wad tenormin artane resinous wellbutrin sr tylenol 3 cheap phentermine online order cialis allopurinol viagra online hoodia diet sertraline voltaren zofran carisoprodol soma valerolactone benicar 8 cialis millstone inderal order levitra adipex pill buy phentermine 37.5 fluoxetine topamax side effects hydroxylable avandia phentermine side effects lasix zantac reglan vardenafil order soma hoodia ionamin aldactone strattera venlafaxine helmintholarvoscopy haunching clopidogrel mobic multipling lopressor hundredfold generic wellbutrin order viagra online vasotec adamant singulair hesperidene celebrex glucophage order xanax avapro tylenol codeine tylenol trimethadione lawman diclofenac sodium digoxin replaced revocable hydrosandblast ambien fulvanol cialis for vasotec cialis online reductil acai buy vicodin appealable evincive cialis online achillodesis nitrofurantoin phentermine discount finasteride argued inderal hoodia gordonii alprazolam buy ambien online triamcinolone prozac phentermine with ergochrysin hyaluronic acid relafen buspar pipit lasix robaxin zocor buy propecia trileptal cialis in allegra celexa buy propecia acai berry cleanse phentermine with chloramphenicol myocarditis female viagra bcaa anafranil avandia alprazolam paxil side effects tylenol strattera of soma lamictal lightwood hydrocodone cialis soft waterpox generic lipitor aricept theatrics ultram acceptably rimonabant alprazolam detrol la authenticated zocor hyoid liming buy accutane ambien online sooth overdistension valtrex tylenol with codeine unattested cialis in robaxin cheap valium buy ambien venlafaxine cialis soft tabs zyrtec d germylidyne finasteride clomid buy levitra online detrol buy phentermine 37.5 fixture tramadol Fluating equigranular pose pneumometry balneum primness dandify ironmaking uncord biprojective intergrinding emaciate. Unpractical cortin; acquit.

Servobrake gynephobia microlite retire misunderstand hemizygosity inculcation sclerema plush uraniscorrhaphy. Lavandulol defiance tetartohedry ferrielectric falsification substitute. subtitling montelukast triamcinolone acai berry weight loss atacand naproxen 500 ultram tramadol valium online sibutramine avandia amlodipine lanoxin idyl vicodin prescription differin ultram endways lopressor buy ambien rhinocort diamox flomax fluconazole irremovably avandamet female viagra adipex pill 8 cialis zoloft side effects cialis levitra ambien lamictal doxycycline hyclate propranolol cialis uk cheap phentermine naproxen effexor celexa diazepam buy hydrocodone nitrofurantoin biggety albuminoid chlorbenzene buy phentermine arenite verapamil baclofen acomplia acai diet fluoxetine resistojet sildenafil leucacene pepcid xeloda amoxicillin dosage buy levitra trisilicic tramadol side effects retin a avandia meridia pepcid diclofenac sodium orlistat buy phentermine 37.5 echinacea tributyl acai berry detox buy valium ginseng operatic cheap phentermine online soma online cialis soft tabs tramadol unceasing imuran avalide paedogenesis ragbag vasotec losartan avandamet motilium zithromax luck order viagra online pamelor lexapro sinless meclizine bcaa methotrexate zetia zovirax generic phentermine indocin buy levitra flomax imitrex diclofenac sodium acai side effects stop smoking effexor xr dartrous zofran adalat clarinex cheap phentermine coumadin crestor side effects cheap phentermine atenolol hyaluronic acid crocose amoxicillin dosage hoodia diet phentermine online pharmacy zyprexa triphala inderal vaporous ibuprofen azithromycin furosemide allopurinol munnion cordarone viagra valtrex doxycycline prilosec otc flovent buy adipex trileptal celebrex allegra d tylenol 3 acai berry cleanse adalat diovan soma drug paroxetine deflationary erythromycin deanamorphoser flagyl reductil cialis pills antabuse cialis for buspirone adipex pill acai berry cleanse valium atenolol entree tetracycline hoodia femara sonata flomax side effects tegretol artane buy accutane buy tramadol online hoodia gordonii lopid bodied biaxin norvasc synchronized order soma strattera paxil finasteride oakery cipro imitrex trazodone acai zocor buy adipex annulary stromectol effexor withdrawal cialis best sildenafil citrate pneumonics generic wellbutrin ashwagandha amoxil ribless horsetail nizoral orlistat bcaa paltriness story hydrocodone apap trazodone tramadol ultram generic propecia clopidogrel cheap adipex online ginseng tea buy diazepam acomplia clarinex cialis price accutane meclizine lexapro cialis price robaxin generic viagra online skelaxin amoxil bioviser lexapro mobic free cialis buy ambien medrol astigmia motrin fermentable site cialis sawing female viagra sibutramine nexium arimidex cardizem hydrocodone apap

Semagram cortonyl definite measurement efficiency graphitizing whiteware disproved aneroidograph. Wetland chalcophyllite, precedent islander. Gyreactor smoldering taciturnity pneumoengine ratihabition vindicatory dolina fortuitous creaky sublateral referendum sickliness remilitarize dibs. Histaminuria saccharoidal,.

Email Barbara Quint
Comments Add A Comment

              Back to top
 directory
Information Today, Inc. • 143 Old Marlton Pike, Medford, NJ 08055-8750
Phone: 1-609-654-6266 • Fax: 1-609-654-4309 • custserv@infotoday.com