Since Chinese given names are not limited to a set of
lexical items that are prototypically 'names' (i.e. they
can be just about any lexical item), Chinese given names,
as you probably know, often have no clue about gender.
There has been some discussion on 'traits' that are
more feminine or masculine and would be reflected in names,
but there remains a lot of ambiguity. I doubt there is any
statistical method, algorithm, or even native speaker that
can make up for that problem!
Mark Lewellen
> -----Original Message-----
> From: owner-corpora@lists.uib.no
> [mailto:owner-corpora@lists.uib.no] On Behalf Of Jun Lang
> Sent: Tuesday, December 13, 2005 7:31 AM
> To: 'Xiaofei Lu'
> Cc: corpora@uib.no
> Subject: [Corpora-List] ´ð¸´: [Corpora-List] Chiniese Name
> Gender Recognition
>
>
> Yeah! There are many names which could be used for mail and
> female. It is a
> difficult problem. Now I have done some simple research on this topic.
> Recently, I am trying to get more and more data. Since the
> parameter space
> is very huge, decision trees can not get the final result
> quickly. I want to
> use Bayes Model again.
>
> Can you give me some ideas about it? Thanks a lot!
>
> Best wishes,
> Jun Lang
>
> -----ÓʼþÔ¼þ-----
> ·¢¼þÈË: Xiaofei Lu [mailto:xflu@ling.ohio-state.edu]
> ·¢ËÍʱ¼ä: 2005Äê12ÔÂ13ÈÕ 13:56
> ÊÕ¼þÈË: Jun Lang
> Ö÷Ìâ: Re: [Corpora-List] Chiniese Name Gender Recognition
>
> Interesting. What is and how do you establish the baseline?
> Many names can
> be either male or female, can't they?
>
> On Tue, 13 Dec 2005, Jun Lang wrote:
>
> > Hi all Corpora Members,
> >
> > Now I am studying on Chinese Name Gender Recognition.
> The input is a
> > Chinese name. The output is the corresponding gender. I
> used decision
> trees
> > method. But finally, the accuracy is only about 70%.
> >
> > Do you know any other method which can achieve higher
> accuracy? And is
> > there somebody has done any similar research?
> >
> > Thanks a lot!
> >
> >
> >
> > Best wishes,
> >
> > Bill_Lang(Jun Lang): Ph.D Candidate
> >
> > Information Retrieval Laboratory
> >
> > Harbin Institute of Technology
> >
> > Mail: bill_lang@gmail.com
> >
> > Homepage: http://ir.hit.edu.cn/~bill_lang
> >
> >
>
This archive was generated by hypermail 2b29 : Wed Dec 21 2005 - 18:33:48 MET