Re: [Corpora-List] REUTER corpus online?

From: Serge Sharoff (s.sharoff@leeds.ac.uk)
Date: Wed Nov 16 2005 - 11:57:39 MET

  • Next message: joshua raclaw: "[Corpora-List] AAVE Corpus."

    Luis,

    we have an online interface to the Reuters corpus (indexed by
    CorpusWorkbench). It's available from:
    http://corpus.leeds.ac.uk/

    Because of the agreement with Reuters the access is mostly limited to
    inhouse research. However, we can provide a password for
    research-related concordancing.

    As for Portuguese, if you have a reasonable list of words frequent in
    Portuguese newswires and a tagger/lemmatiser, a corpus like this can be
    collected from the web. See the Wacky initiative:
    http://wacky.sslmit.unibo.it/

    Best wishes,
    Serge

    On Tue, 2005-11-15 at 10:45 -0500, Luis Sarmento wrote:
    > Dear Corpora-List members,
    >
    >
    >
    > Does anyone know if there is any publicly available online version of
    > the reuters corpus? In other words, is there any web concordace tool
    > (free) for the Reuters Corpus?
    >
    > Btw, I wonder if there are comparable versions of the reuters corpus
    > available, namely in Portuguese, for bilingual studies. Is anyone
    > using "comparable" version of reuters in Portuguese?
    >
    > Thanks to all,
    >
    >
    >
    > Lus Sarmento
    >
    >
    >
    >
    >

    -- 
    Dr. Serge Sharoff
    Centre for Translation Studies
    School of Modern Languages and Cultures
    University of Leeds
    Leeds, LS2 9JT
    

    tel: +44(0)113 343 7287 fax: +44(0)113 343 3287



    This archive was generated by hypermail 2b29 : Wed Nov 16 2005 - 12:24:15 MET