Re: [Corpora-List] efficient decision tree tool?

From: Andy Roberts (andyr@comp.leeds.ac.uk)
Date: Thu Jan 19 2006 - 10:59:22 MET

  • Next message: Marco Baroni: "[Corpora-List] chinese pos tagger/lemmatizer"

    I expect Ross Quinlan's C4.5 will be adequate then (which is what J4.8
    is based on).

    You can get it from http://www.rulequest.com/Personal/

    Andy

    On Thu, 19 Jan 2006, Caren Brinckmann wrote:

    > Dear all,
    >
    > we are currently working on corpus-based models of duration, F0, intensity,
    > and segmental reductions in read and spontaneous speech. For the first part
    > of our study we will use decision trees.
    >
    > Since our database is fairly large, I am looking for an efficient decision
    > tree tool with the following features:
    >
    > * nominal and numeric input features and predictees (classification and
    > regression trees)
    > * binary as well as multi-way splits
    > * efficient handling of large datasets (200,000 cases/records/instances with
    > up to 100 attributes/features/variables)
    > * nice to have: integrated feature selection algorithm
    >
    > In previous studies, I've worked with "wagon" from the Edinburgh Speech Tools
    > Library (http://www.cstr.ed.ac.uk/projects/speech_tools/) and "J48" from Weka
    > (http://www.cs.waikato.ac.nz/ml/weka/). While wagon is very fast and
    > memory-efficient, it only allows binary splits (as far as I know). Weka
    > allows multi-way splits, but is too slow and memory-consuming for our current
    > datasets.
    >
    > I'm looking forward to your suggestions!
    >
    > Kind regards,
    >
    > Caren.
    >
    > P.S.: If you know any other mailing list or forum where I could post my
    > question, please let me know.
    >
    > --
    > Caren Brinckmann
    > Saarland University, FR 4.7 Institute of Phonetics
    > P.O.Box 151150, 66041 Saarbruecken, Germany
    > Phone: +49-681-3024244, Fax: +49-681-3024684
    >
    >



    This archive was generated by hypermail 2b29 : Thu Jan 19 2006 - 11:22:01 MET