Re: Corpora: Size of a representative corpus

Adam Kilgarriff (Adam.Kilgarriff@itri.brighton.ac.uk)
Fri, 21 Aug 1998 09:55:36 +0100

> From: "Michael Klotz - englische Sprachwissenschaf" <Mklotz@phil.uni-erlangen.de>
>
> It seems to me that the basic type-unit is not the lemma but what
> Cruse calls the lexical unit, i.e. "a lexical form with a single

But that is of very little help because, despite Cruse's efforts, the
'lexical unit' is severly lacking in a definition, from both a
practical and a theoretical perspective. So we can't (even in
principle) produce a list of them and we certainly can't count them.

see eg http://www.itri.bton.ac.uk/~Adam.Kilgarriff/beleive.ps.gz

Adam Kilgarriff

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
Adam Kilgarriff
Senior Research Fellow tel: (44) 1273 642919
Information Technology Research Institute (44) 1273 642900
University of Brighton fax: (44) 1273 642908
Lewes Road
Brighton BN2 4GJ email: Adam.Kilgarriff@itri.bton.ac.uk
UK http://www.itri.bton.ac.uk/~Adam.Kilgarriff
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%