140428-Xiaoxi Wang

From cslt Wiki
Revision as of 09:56, 28 April 2014 by Wxx (Talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

This week:

preprocessed the baiduzhidao and part of weibo data. wrote a Hanzi2Num tool sampled corpora from weibo and baiduzhidao (4.4G) and grabbed the keywords from them classified corpora according to keywords.

Next week: Train and evaluate lm from classified corpora make improves on algorithms