140428-Xi Ma

From cslt Wiki
Jump to: navigation, search

Last week:

1.Learning the knowledge of language model

2.Segment the training/test sentences with the 150k lexicon and get the ppl of the

test: /nfs/home/zhangzhiyong/work/train_470h/test/huawei_disanpi.txt, using the following LM:

/home/thdnn/resource/lm/Hunhe_zhongzi_and_add_and_PPL_5yuan_1e9.lm

3.Build the new LM using the lexicon with the keywords involved. Re-segment the test files, and then test the PPL.

This week:

1.To extracte the related sentences of this filed from the original corpus