Re: Corpora: frequency lists for clusters & MWU

Tony Berber Sardinha (tony4@uol.com.br)
Wed, 07 Oct 1998 20:07:00 -0300

Dear Przemyslaw and list members,

I've created 2,3,4, and 5-word clusters based on the Brown Corpus and
placed them on:

http://homepages.infoseek.com/~corpuslinguistics/homepage.html

under 'Word lists'.

I will create a similar list for the BNC as well. Please let me know if
you have trouble downloading.

Hope this helps

Tony Berber Sardinha.

Przemyslaw KASZUBSKI wrote:

> Hi,
>
> Another question: Are there frequency lists of English
> (lemmatised/non-lemmatised) 2-3-4-5 word clusters available anywhere,
> preferably retrieved from large balanced corpora? Or
> frequency lists of multi-word-units? Unfortunately, the BNC is not
> accessible from Poland yet ...
>
> I'd be grateful for any help.
>
> Przemek
>
> ==========================================
> Przemyslaw Kaszubski, M.A.
> przemka@amu.edu.pl
> http://elex.amu.edu.pl/ifa/skaszub.htm
>
> MY (ENGLISH) (LEARNER) CORPORA PAGE:
> http://main.amu.edu.pl/~przemka
>
> School of English
> Adam Mickiewicz University
> Al. Niepodleglosci 4
> 61-874 Poznan, POLAND
> tel: +48 61 8528820
> fax: +48 61 8523103
> =========================================