Re: [Corpora-List] text XML representation for NLP

From: Lou Burnard (lou.burnard@computing-services.oxford.ac.uk)
Date: Mon Feb 28 2005 - 11:24:29 MET

  • Next message: John Mckenny: "[Corpora-List] how many formulaic sequences can you find?"

    The Text Encoding Initiative's Recommendations for encoding are also
    very useful for NLP (not surprisingly, since the ACL was one of the
    first sponsors of the TEI, and most of those currently active in the
    field of XML annotation "cut their teeth" on the TEI.

    The TEI Recommendations were updated to use XML at the last major
    revision (TEI P4, published in 2000); the next major revision, a
    preliminary release of which is now available at the TEI's sourceforge
    site, is a complete rewrite, aiming to include new materials and
    standards. See http://www.tei-c.org/P5/ for details. I think the new ODD
    system may be of particular interest to NLP practitioners.

    Lou Burnard

    Constantin Orasan wrote:

    > Hi,
    >
    > Have a look at:
    > XCES: http://www.xces.org/ and
    > EAGLES/ISLE: http://www.mpi.nl/world/ISLE/
    >
    > Unfortunately these pages haven't been updated for a while. Maybe
    > someone will be able to indicate more up-to-date pages.
    >
    > Regards,
    >
    > Constantin
    >
    >
    >>Dear, CORPORA list people,
    >>
    >> Right now, I am working on a text XML representation for
    >>natural language processing.
    >>
    >> The representation is used for representation of any text. It
    >>will used for our natural language processing. It will include the
    >>layers from base to top of NLP. The base layer may be about the
    >>part-of-speech information. The top layer may be about the syntax
    >>analysis result or shallow semantic information.
    >>
    >>As I known, there were so many conferences on XML for NLP. So I guess
    >>there is some existed text XML representation for NLP. But I have not
    >>found out.
    >>
    >>Could you give some information about it?
    >>
    >>Thank you very much!
    >>
    >>
    >>
    >>Best wishes;
    >>
    >>-Bill_Lang
    >>
    >
    > ============================================
    > Constantin Orasan
    > Research Group in Computational Linguistics
    > University of Wolverhampton
    > http://www.wlv.ac.uk/~in6093/
    >
    >
    >



    This archive was generated by hypermail 2b29 : Mon Feb 28 2005 - 11:22:53 MET