Re: Corpora: corpora history revisited

Philip Resnik (resnik@umiacs.umd.edu)
Tue, 2 Mar 1999 11:34:37 -0500 (EST)

> Had anybody heard about any research or reference about Bible as a
> special corpus of texts?

You might be interested in some recent work on treating the Bible as a
multilingual corpus for computational purposes:

Philip Resnik, Mari Broman Olsen, and Mona Diab, "Creating a
Parallel Corpus from the Book of 2000 Tongues", Text Encoding
Initiative 10th Anniversary User Conference (TEI-10), Providence,
November 1997. http://umiacs.umd.edu/~resnik/pubs/tei97.ps.gz

Philip Resnik, Mari Broman Olsen, and Mona Diab, ``The Bible as a
Parallel Corpus: Annotating the `Book of 2000 Tongues''', to appear
in Computers and the Humanities.

Tapas Kanungo and Philip Resnik, "The Bible, Truth, and Multilingual
OCR Evaluation", In Proc. of SPIE Conference on Document Recognition
and Retrieval (VI), San Jose, CA, 27-28 January, 1999.
http://www.cfar.umd.edu/~kanungo/pubs/bible.ps

I. Dan Melamed, "Manual Annotation of Translational Equivalence:
The Blinker Project", IRCS Technical Report #98-07, U Penn,
1998. ftp://ftp.cis.upenn.edu/pub/melamed/papers/blinker.ps.gz

I would be very interested in hearing about similar projects.

Philip

----------------------------------------------------------------
Philip Resnik, Assistant Professor
Department of Linguistics and Institute for Advanced Computer Studies

1401 Marie Mount Hall UMIACS phone: (301) 405-6760
University of Maryland Linguistics phone: (301) 405-8903
College Park, MD 20742 USA Fax : (301) 405-7104
http://umiacs.umd.edu/~resnik E-mail: resnik@umiacs.umd.edu