Perplexity results using BNC

Miles Osborne (mosborne@csd.abdn.ac.uk)
Tue, 2 Jul 1996 18:09:46 +0100 (BST)

Hello. Has anyone done any work on building language models
(eg. ngrams) from the British National Corpus? In particular,
I'm interested in the perplexities of the resulting models. From
what I gather, perplexity varies according to genre, and so results
cannot necessarily be compared with those for models constructed on
non-BNC material.

thanks

Miles Osborne