Re: English corpora - ICE and LOB

Nick Smith (nick@comp.lancs.ac.uk)
Tue, 7 May 1996 10:34:37 +0100 (BST)

Imran

> a)Is there a corpus organised in the same structure as LOB but uses texts
> written/published in the 80s-90s?

We have compiled (at Lancaster) a BNC 'sampler' corpus of about 2 million
words, half written, half spoken. The written component roughly reflects the
categories in the larger BNC of 100M words - eg Informative (Science,
Commercial, Arts, World affairs etc) and Imaginative. The texts are very recent,
nearly all from the period you mentioned.
Oxford University Computing Services are handling distribution of this corpus,
expected shortly.
http://info.ox.ac.uk/bnc/ gives more info on the BNC in general.

Nick Smith
Ucrel
Lancaster University.