[Corpora-List] celex plus

From: j_kurjian@hotmail.com
Date: Mon Jul 03 2006 - 02:47:27 MET DST

  • Next message: JGnjbbr@aol.com: "[Corpora-List] Concordancer for Arabic"

    Hi all,
    I was wondering if anyone had a revised celex list, in particular a revised
    list of the celex words split by morpheme. I was planning to use celex as a
    gold standard to test my morphological analyzer. However, when I extracted
    the celex words split by morpheme, I found there were many cases that seem
    inappropriate for my purpose, e.g.
    wrongheadedness --> wrongheaded-ness
    vs. what I'd like: wrong+head+ed+ness
    wistful --> wistful
    vs. wist+ful
    whitening --> whitening
    vs. white+n+ing or whit+en+ing

    Thanks!
    Jerry



    This archive was generated by hypermail 2b29 : Mon Jul 03 2006 - 02:49:32 MET DST