[Corpora-List] ELRA - Language Resources Catalogue - Update

From: ELDA (info@elda.org)
Date: Wed Sep 20 2006 - 10:57:00 MET DST

  • Next message: Nicola Cancedda: "[Corpora-List] Researcher in Machine Learning for Cross-Language Technologies at XRCE"

    Our apologies if you have received multiple copies of this announcement.

    *******************************************************************
    ELRA - Language Resources Catalogue - Update
    *******************************************************************

    *Our on-line catalogue has moved to the following address:
    http://catalog.elra.info <http://catalog.elra.info/>. Please update your
    bookmarks.

    *We are happy to announce that new Written Language Resources are now
    available in our catalogue.

    **** ELRA-L0072 PAROLE-SIMPLE-CLIPS PISA Italian Lexicon ***
    *PAROLE-SIMPLE-CLIPS is a four-level, general purpose lexicon that has
    been elaborated over three different projects. The PAROLE-SIMPLE-CLIPS
    Pisa Italian Lexicon comprises a total of 387,267 phonetic units, 53,044
    morphological units (53,044 lemmas), 37,406 syntactic units (28,111
    lemmas) and 28,346 semantic units (19,216 lemmas). The
    PAROLE-SIMPLE-CLIPS Pisa Italian Lexicon was encoded at the semantic
    level, in full accordance with the international standards set out in
    the PAROLE-SIMPLE model and based on EAGLES. Syntactic and semantic
    encoding were performed jointly with Thamus (Consortium for Multilingual
    Documentary Engineering), which is responsible for 25,000 extra entries
    (to be released soon).
    This lexicon is subdivided into five different subsets:
    L0072-01 Full lexicon
    L0072-02 Phonetic layer
    L0072-03 Morphological layer
    L0072-04 Syntactic layer
    L0072-05 Semantic layer
    For more information, see:
    http://catalog.elra.info/product_info.php?products_id=881&language=en
    <http://catalog.elra.info/product_info.php?products_id=881&language=en>

    **** ELRA-W0043 PAROLE Italian Corpus ***
    *The PAROLE Italian Corpus comprises 3,135,651 words collected from four
    different domains: newspapers (2,179,800 words), periodicals (143,810
    words), books (564,964 words), miscellaneous (247,077 words). Data are
    morphosyntactically annotated and lemmatized.
    For more information, see:
    http://catalog.elra.info/product_info.php?products_id=886&language=en
    <http://catalog.elra.info/product_info.php?products_id=886&language=en>

    **** ELRA-W0044 Italian Syntactic-Semantic Treebank (ISST) ***

    *For more information, see:
    http://catalog.elra.info/product_info.php?products_id=887&language=en
    <http://catalog.elra.info/product_info.php?products_id=887&language=en>

    For more information on the catalogue, please contact Valérie Mapelli
    mailto:mapelli@elda.org <mailto:mapelli@elda.org>



    This archive was generated by hypermail 2b29 : Wed Sep 20 2006 - 10:57:05 MET DST