Re: Corpora: Summary: Corpus metadata

From: Steven Bird (sb@unagi.cis.upenn.edu)
Date: Mon Jun 24 2002 - 18:01:41 MET DST

Next message: sattar.izwaini@stud.umist.ac.uk: "Corpora: Arabic computer texts"

Previous message: Lee Gillam: "Corpora: TKE 2002 Workshop - Call for Papers"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

Mikko Lounela wrote:
> about two weeks ago I posted a query about corpus metadata. I also
> promised to post a summary. Thank you very much for the answers (total
> 8), and here is the summary.

Two of these messages mentioned OLAC, the Open Language Archives Community.
The Linguistic Data Consortium now documents all of its corpora using the
OLAC metadata set. Other language resource institutions are involved,
including ATILF, DFKI, ELRA, LINGUIST, SIL, and more than a dozen others.

The benefits of using OLAC metadata are that it is very easy to use and
the infrastructure for indexing and search is already in place. Please see
www.language-archives.org for full details.

Steven Bird

--
Steven.Bird@ldc.upenn.edu  http://www.ldc.upenn.edu/sb
Assoc Director, LDC; Adj Assoc Prof, CIS & Linguistics
Linguistic Data Consortium, University of Pennsylvania
3615 Market St, Suite 200, Philadelphia, PA 19104-2608

Next message: sattar.izwaini@stud.umist.ac.uk: "Corpora: Arabic computer texts"
Previous message: Lee Gillam: "Corpora: TKE 2002 Workshop - Call for Papers"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

This archive was generated by hypermail 2b29 : Mon Jun 24 2002 - 18:05:55 MET DST