Re: [Corpora-List] English-Spanish Medical Corpora

From: Dominic Widdows (widdows@maya.com)
Date: Wed Feb 07 2007 - 18:49:45 MET

  • Next message: Alexander Osherenko: "[Corpora-List] Corpus Benevolence"

    Hi Olivier,

    I wasn't previously aware of a large collection of reports that
    included versions in Arabic, Chinese, English, French, Russian and
    Spanish. This would be a great resource to use for a variety of
    experiments: do you know if there is a some part of these sites where
    you can request bulk downloads?

    If not, would it be possible for someone to write a spider and host
    the corpora somewhere else for bulk download? Would there be
    copyright issues, and if so could these be negotiated? If you have
    experience of doing this or any suggestions, I would be very interested.

    Best wishes,
    Dominic

    On Feb 7, 2007, at 12:30 PM, Olivier Kraif wrote:

    > Hi Mario,
    > you can find the WHO reports in both languages (and even in
    > Chinese , Arabic, Russian and French). The reports can be
    > downloaded in pdf from this url :
    > http://www.who.int/whr/previous/es/index.html
    > If you need already processed and aligned reports in English and
    > French, I can send you some texts.
    >
    > You may also have a look to the UN records : http://unbisnet.un.org:
    > 8080/ipac20/ipac.jsp?profile=bib&menu=search&submenu=power#focus
    > A lot of texts are available online, in the latter languages, and
    > some texts concern medical subjects.
    > Texts can be downloaded in PDF (and even in DOC format, if you
    > change something in the URL :-).
    >
    > Regards
    >
    > Olivier
    >
    >
    >> Dear all,
    >>
    >> I am student of Msc Language Technology in Saarland University. I
    >> am looking for a English-Spanish medical corpora or, failing that,
    >> papers, articles, any kind of publication... where you can find
    >> English-Spanish medical texts aligned (like, for example,
    >> abstracts in both languages). I hope someone can help me. Thank
    >> you in advance,
    >>
    >> Mario
    >>
    >>
    >>
    >>
    >
    >
    >



    This archive was generated by hypermail 2b29 : Wed Feb 07 2007 - 19:08:28 MET