新規登録 | ログイン | FAQ      [?] 
CiteULike is a free online bibliography manager. Register and you can start organising your references online.
Recent | Unread | Search | Authors | Tags | Export

A Cascaded Linear Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging

by: Wenbin Jiang, Liang Huang, Qun Liu, Yajuan
(June 2008), pp. 897-904.


View FullText article


X Reviews [Write a review of this article]

This cascade linear model is actually a linear combination of different models, including a perceptron based on lexical-targets, a word LM, a POS LM, a co-occurrence model, and word count penalty. The training of the cascade system is in two steps, rather than training all the parameters at a time, which would be intractable. The first step is the perceptron training (of course LMs all need to be trained). And the second step is to optimizing the combination weights.

Using this method, the word segmentation and POS tagging are performed together, and the results are improved. This is sensible since joint segmentation and tagging can help each other.

Reviewed by zzb3886 as - 2008-07-01 00:10:55

X Find related articles from these CiteULike users

X Find related articles with these CiteULike tags

X BibTeX record

X RIS record



RIS BibTeX
CiteULike organises scholarly (or academic) papers or literature and provides bibliographic (which means it makes bibliographies) for universities and higher education establishments. It helps undergraduates and postgraduates. People studying for PhDs or in postdoctoral (postdoc) positions. The service is similar in scope to EndNote or RefWorks or any other reference manager like BibTeX, but it is a social bookmarking service for scientists and humanities researchers.