Is there anyone developing corpus sets that are particular to specific
publications?
We are interested in learning about American English language newspaper
vocabularies and how they map to specific subject categories -specifically
the subject classifications found in NewsML. In our ideal world such
corpora would map to specific newspaper sections (the Boston Globe Business
section, the Chicago Tribune National News section, etc.). Our goal is to
track the differences in vocabularies used by specific publications to
describe common events and determine how these vocabularies differ from
academic journals and radio/TV articles on the same or closely related
subject areas.
Jack Bryar
This archive was generated by hypermail 2b29 : Tue Jun 21 2005 - 22:07:38 MET DST