RE: [Corpora-List] structured data (enu | csy) for IE needed

From: Mustafa Abusalah (mustafa@sunderland.ac.uk)
Date: Thu Jan 25 2007 - 10:38:05 MET

  • Next message: Paul Buitelaar: "Re: [Corpora-List] structured data (enu | csy) for IE needed"

    Try ebay, I'm not sure if they have a Czech website, if not try google
    translation for certain pages if what your looking for is a small number of
    corpus. If this didn't fit with what you need just use xml tools like xsl
    and xslt to transform content to your requirements.

    Regards,
    Mustafa Abusalah

    -----Original Message-----
    From: owner-corpora@lists.uib.no [mailto:owner-corpora@lists.uib.no] On
    Behalf Of Filip Malik
    Sent: Thursday, January 25, 2007 9:32 AM
    To: versley@sfs.uni-tuebingen.de; Filip Malik
    Cc: CORPORA@uib.no
    Subject: Re: [Corpora-List] structured data (enu | csy) for IE needed

    >My guess would be that Wikipedia fits your description, where you will
    find
    >many tables and/or templates, and it is available in English and Czech. I
    >don't know if anyone has tried extracting specific information from that,
    >though.

    Thanks Yannick for your suggestion. Your reply warn me. I forgot to mention
    very importing condition: I need data from fixed domain (e.g. house sales)

    Best regards,
    Filip Malik
    -fm



    This archive was generated by hypermail 2b29 : Thu Jan 25 2007 - 10:42:47 MET