Re: Corpora: corpora history revisited

Mari Broman Olsen (molsen@umiacs.umd.edu)
Tue, 2 Mar 1999 08:57:27 -0500 (EST)

Philip Resnik, Mona Diab and I have an article coming out in
Computers and the Humanities describing our growing collection of
Bibles and their TEI-compliant annotation. In that article we suggest
several uses for the Bible as a multilingual parallel corpus,
including a 'seed' translation lexicon for Machine Translation.
Philip and Tapas Kanungo (University of Maryland) are
also working on Optical Character Recognition using this corpus.

********
Mari Broman Olsen, Research Associate
University of Maryland Institute for Advanced Computer Studies
3141 A.V. Williams Building
University of Maryland
College Park, MD 20742

EMAIL: molsen@umiacs.umd.edu
PHONE: (301) 405-6754 FAX: (301) 314-9658
WEB: http://www.umiacs.umd.edu/~molsen
*********

>
>
> I would like to continue the history of corpora discussion.
>
> Had anybody heard about any research or reference about Bible as
> a special corpus of texts?
> ---
> YS Vladimir Rykov, PhD in Computational Linguistics
> www.blkbox.com/~gigawatt/rykov.html Linguistic Institute
> WWW.GOL.RU/~iling 1/12 B.Kislovsky per., Moscow, 103009
> M_M_M_M_M_M_M_M_M_M_M_M KREMLIN WALL IS WHERE YOU MAKE IT !
>
>