[Corpora-List] ELRA - Language Resources Catalogue - Update

From: ELDA (info@elda.org)
Date: Wed Sep 20 2006 - 10:57:00 MET DST

Next message: Nicola Cancedda: "[Corpora-List] Researcher in Machine Learning for Cross-Language Technologies at XRCE"

Previous message: Mitkov, Ruslan: "[Corpora-List] Final (corrected) announcement: Professor/Reader in Computational Linguistics/Natural Language Processing"
Next in thread: ELDA: "[Corpora-List] ELRA - Language Resources Catalogue - Update"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

Our apologies if you have received multiple copies of this announcement.

*******************************************************************
ELRA - Language Resources Catalogue - Update
*******************************************************************

*Our on-line catalogue has moved to the following address:
http://catalog.elra.info <http://catalog.elra.info/>. Please update your
bookmarks.

*We are happy to announce that new Written Language Resources are now
available in our catalogue.

**** ELRA-L0072 PAROLE-SIMPLE-CLIPS PISA Italian Lexicon ***
*PAROLE-SIMPLE-CLIPS is a four-level, general purpose lexicon that has
been elaborated over three different projects. The PAROLE-SIMPLE-CLIPS
Pisa Italian Lexicon comprises a total of 387,267 phonetic units, 53,044
morphological units (53,044 lemmas), 37,406 syntactic units (28,111
lemmas) and 28,346 semantic units (19,216 lemmas). The
PAROLE-SIMPLE-CLIPS Pisa Italian Lexicon was encoded at the semantic
level, in full accordance with the international standards set out in
the PAROLE-SIMPLE model and based on EAGLES. Syntactic and semantic
encoding were performed jointly with Thamus (Consortium for Multilingual
Documentary Engineering), which is responsible for 25,000 extra entries
(to be released soon).
This lexicon is subdivided into five different subsets:
L0072-01 Full lexicon
L0072-02 Phonetic layer
L0072-03 Morphological layer
L0072-04 Syntactic layer
L0072-05 Semantic layer
For more information, see:
http://catalog.elra.info/product_info.php?products_id=881&language=en
<http://catalog.elra.info/product_info.php?products_id=881&language=en>

**** ELRA-W0043 PAROLE Italian Corpus ***
*The PAROLE Italian Corpus comprises 3,135,651 words collected from four
different domains: newspapers (2,179,800 words), periodicals (143,810
words), books (564,964 words), miscellaneous (247,077 words). Data are
morphosyntactically annotated and lemmatized.
For more information, see:
http://catalog.elra.info/product_info.php?products_id=886&language=en
<http://catalog.elra.info/product_info.php?products_id=886&language=en>

**** ELRA-W0044 Italian Syntactic-Semantic Treebank (ISST) ***

*For more information, see:
http://catalog.elra.info/product_info.php?products_id=887&language=en
<http://catalog.elra.info/product_info.php?products_id=887&language=en>

For more information on the catalogue, please contact Valérie Mapelli
mailto:mapelli@elda.org <mailto:mapelli@elda.org>

Next message: Nicola Cancedda: "[Corpora-List] Researcher in Machine Learning for Cross-Language Technologies at XRCE"
Previous message: Mitkov, Ruslan: "[Corpora-List] Final (corrected) announcement: Professor/Reader in Computational Linguistics/Natural Language Processing"
Next in thread: ELDA: "[Corpora-List] ELRA - Language Resources Catalogue - Update"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

This archive was generated by hypermail 2b29 : Wed Sep 20 2006 - 10:57:05 MET DST