Re: [Corpora-List] Corpus Benevolence

From: Alexander Osherenko (osherenko@gmx.de)
Date: Thu Feb 08 2007 - 11:12:01 MET

  • Next message: Alex Murzaku: "Re: [Corpora-List] lexicographic tools for parallel/comparable corpora"

    Eric,

    > "benevolence" is a term I've not heard of before in Corpus Linguistics,
    > but I think you mean something like "relevance" or "appropriateness"
    > to the
    > specific research question...

    I've never heard of the term "benevolence" too... :)

    > One hint when selecting a Corpus is to look for similar studies to
    > yours, and see what Corpus they used; if you use the same corpus, your
    > results can be directly comparable (moreso than if you experiment with
    > different corpora).

    I've already experimented with some corpora in my research area (opinion
    mining) and there are some corpora that were studied before regarding
    analysis e.g. the Pang Movie Review corpus. There is some information
    about it e.g. the corpus contains 1000/1000 negative/positive reviews
    downloaded from the imdb.com website, but it is insufficient.

    What I need is a thorough (e.g. linguistic) study of a corpus. You
    probably know the book by Leech about Frequencies in the BNC corpus.
    Something like that. Besides linguistic information I would like to get
    sociological information about a corpus e.g. how many reviewers took
    part in compiling and so on. Since I assume that I can oversee something
    I wanted to ask first.

    Best,
    Alexander



    This archive was generated by hypermail 2b29 : Thu Feb 08 2007 - 11:07:42 MET