Corpora: sampling

Magnus Ljung (ljungm@engelska.su.se)
Fri, 30 Oct 1998 11:08:02 +0100

Query:

I have a query regarding sampling of newspapers over a whole year. If
you want to create a representative corpus consisting of a sample of
issues of,say the Independent or The New York Times for a certain year,
what is the best sampling method? I know that Allan Bell (The Language
of News Media) recommends what he calls a 'constructed week', but I am
not sure just how to go about setting up such a week. I would be most
grateful for suggestions.

Magnus Ljung
Dept of English
Stockholm university