Re: [Corpora-List] structured data (enu | csy) for IE needed

From: Paul Buitelaar (paulb@dfki.de)
Date: Thu Jan 25 2007 - 11:52:35 MET

  • Next message: José Manuel Martínez Martínez: "Re: [Corpora-List] structured data (enu | csy) for IE needed"

    Filip, in the context of the SmartWeb project
    (http://www.smartweb-projekt.de/) we are developing a data set in the
    football domain consisting of textual (unstructured) and tabular
    (semi-structured) match reports. You can find more info at
    http://www2.dfki.de/sw-lt/olp2_dataset/

    To get access to the data set just send me an email.

    Cheers

       Paul Buitelaar
       DFKI GmbH - Language Technology Lab
       Saarbrücken, Germany

       http://www.dfki.de/~paulb/

    >From: owner-corpora@lists.uib.no [mailto:owner-corpora@lists.uib.no] On
    >Behalf Of Filip Malik
    >Sent: Thursday, January 25, 2007 9:32 AM
    >To: versley@sfs.uni-tuebingen.de; Filip Malik
    >Cc: CORPORA@uib.no
    >Subject: Re: [Corpora-List] structured data (enu | csy) for IE needed
    >
    >
    >
    >> <>My guess would be that Wikipedia fits your description, where you
    >> will find many tables and/or templates, and it is available in
    >> English and Czech. I don't know if anyone has tried extracting
    >> specific information from that, though.
    >
    >
    >Thanks Yannick for your suggestion. Your reply warn me. I forgot to mention
    >very importing condition: I need data from fixed domain (e.g. house sales)
    >
    >Best regards,
    >Filip Malik
    >-fm
    >



    This archive was generated by hypermail 2b29 : Thu Jan 25 2007 - 12:20:07 MET