Next message: Alberto Manuel Brandão Simões: "Re: [Corpora-List] Word frequencies in English, French, German, Spanish, Dutch, Italian and Portuguese"
- Previous message: Yorick Wilks: "[Corpora-List] Word frequencies in English, French, German, Spanish, Dutch, Italian and Portuguese"
- In reply to: Yorick Wilks: "[Corpora-List] Word frequencies in English, French, German, Spanish, Dutch, Italian and Portuguese"
- Next in thread: Alberto Manuel Brandão Simões: "Re: [Corpora-List] Word frequencies in English, French, German, Spanish, Dutch, Italian and Portuguese"
- Messages sorted by:
[ date ]
[ thread ]
[ subject ]
[ author ]
Hi,
please have a look at http://corpora.informatik.uni-leipzig.de/download.html
You will find frequency lists as plain text (words.txt) and MySQL data
files (words) (sorry, not for Portuguese at the moment) calculated from
corpora of 100.000 to 3.000.000 sentences, depending on the language.
In addition, you can get the corpora and pre-calculated co-occurrences.
Regards,
Uwe Quasthoff
Yorick Wilks schrieb:
> Does anyone know easily accessible sources of these?
> Yorick Wilks
> Sheffield
>
- Next message: Alberto Manuel Brandão Simões: "Re: [Corpora-List] Word frequencies in English, French, German, Spanish, Dutch, Italian and Portuguese"
- Previous message: Yorick Wilks: "[Corpora-List] Word frequencies in English, French, German, Spanish, Dutch, Italian and Portuguese"
- In reply to: Yorick Wilks: "[Corpora-List] Word frequencies in English, French, German, Spanish, Dutch, Italian and Portuguese"
- Next in thread: Alberto Manuel Brandão Simões: "Re: [Corpora-List] Word frequencies in English, French, German, Spanish, Dutch, Italian and Portuguese"
- Messages sorted by:
[ date ]
[ thread ]
[ subject ]
[ author ]
This archive was generated by hypermail 2b29
: Mon Feb 12 2007 - 18:03:18 MET