Hi all,
I was wondering if anyone had a revised celex list, in particular a revised
list of the celex words split by morpheme. I was planning to use celex as a
gold standard to test my morphological analyzer. However, when I extracted
the celex words split by morpheme, I found there were many cases that seem
inappropriate for my purpose, e.g.
wrongheadedness --> wrongheaded-ness
vs. what I'd like: wrong+head+ed+ness
wistful --> wistful
vs. wist+ful
whitening --> whitening
vs. white+n+ing or whit+en+ing
Thanks!
Jerry
This archive was generated by hypermail 2b29 : Mon Jul 03 2006 - 02:49:32 MET DST