Re: Corpora: T-score in collocational analysis

Pete Whitelock (pete@sharp.co.uk)
Thu, 09 Dec 1999 10:06:50 +0000

For use of t-score in collocations:

Take a look at Manning and Schuetze's book, specifically the chapter on
collocations
the latter is available for download from:

http://www.sultry.arts.usyd.edu.au/fsnlp/

Przemyslaw Kaszubski wrote:
>
> Regards to to all the subscribers,
>
> Two questions:
>
> 1. Can anyone explain (or point to a Web source or otherwise easily available
> source apart from the Church, K.W,, W. Gale, P. Hanks & D. Hindle "Using Statistics in
> Lexical Analysis" in <italic>Lexical Acquisition: Using On-Line
> Resources to Build a Lexicon</italic>. Ed. Uri Zernik. Hillsdale:
> Lawrence Erlbaum, 1991)
> the use of the t-score statistic in collocation retrieval? I mean the
> one used by Cobuild. How does the formula work? I am familiar with
> MI and Z-scores but the t-score seems to be
> in use only in the CobuildDirect service.
>
> 2. Do you know of corpus analysis
> packages available for researchers that employ this t-score?
>
> I do small corpus research and I am basically after a tool with a statistic that does not favour rare words as much as the MI does. So far TACT's z-scores seem the best option.
>
> Przemek Kaszubski

E-mail: pete@sharp.co.uk \ Pete Whitelock
Internet: http://www.sharp.co.uk \ Sharp Laboratories of Europe Ltd
phone: +44 (0)1865 747711 \ Oxford Science Park
fax: +44 (0)1865 714170 \ Oxford, OX4 4GA, England

The Law of Detail: Nothing is so simple that there is not a stupid way
to do it.