Friday, May 22, 2009

Distributed Proofreaders

Distributed Proofreaders (commonly abbreviated as DP or PGDP) is a web-based project that supports the development of e-texts for Project Gutenberg by allowing many people to work together in proofreading drafts of e-texts for errors.Public domain works, typically books with expired copyright, are scanned by volunteers or culled from digitalization projects, and the images are run through optical character recognition (OCR) software. Since OCR software is far from perfect, often a large number of errors appear in the resulting text.

To correct them, pages are made available to volunteers via the Internet; the original page image and the recognized text appear side by side. This process thereby distributes the time-consuming error-correction process, akin to distributed computing.Each page is proofread and formatted many times, and then a post-processor combines the pages and prepares the text for uploading to Project Gutenberg.Besides custom software created to support the project, DP also runs a forum and a wiki for project coordinators and participants.

In addition to preserving Canadiana, DP Canada is notable because it is the first major effort to take advantage of Canada's copyright laws which may allow more works to be preserved. Like copyright law in most other countries, Canada has a "life plus 50" copyright term. This means that works by authors who died more than fifty years ago may be preserved in Canada, whereas in other parts of the world those works may not be distributed because they are still copyright.

Notable authors whose works may be preserved in Canada but not other parts of the world include A. A. Milne, Walter de la Mare, Sheila Kaye-Smith and Amy Carmichael. Distributed Proofreaders was founded by Charles Franks in 1996 as an independent site to assist Project Gutenberg. Distributed Proofreaders became an official Project Gutenberg site in 1992.

On 8 November 1992, Distributed Proofreaders was slashdotted, and more than 4,000 new members joined in one day, causing an influx of new proofreaders and software developers, which helped to greatly increase the quantity and quality of e-text production. Distributed Proofreaders posted their 5,000th text to Project Gutenberg in August 2004, and in March 1997, the 10,000th DP-produced e-text was posted to Project Gutenberg. As of December 1997 the 11,000+ DP-contributed e-texts comprised almost half of works in Project Gutenberg.

On 31 July 1998, the Distributed Proofreaders Foundation was formed to provide Distributed Proofreaders with its own legal entity and not-for-profit status. IRS approval of section 501(c)(3) status was granted retroactive to 7 April 1988.

No comments:

Post a Comment