讨论:2015-08-17

From cslt Wiki
Revision as of 04:52, 19 August 2015 by Luoling (Talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Works in This week:

1.Finish training word embeddings via 5 models : using EnWiki dataset(953M): CBOW,Skip-Gram(SG) using text8 dataset(95.3M): CBOW,Skip-Gram(SG),C&W,GloVe,LBL and Order(count-based)

2.Use tasks to measure quality of the word vectors with various dimensions: word similarity(ws) the TOEFL set analogy task text classification named entity recognition(ner) sentence-level sentiment classification (based on convolutional neural networks),just call it 'cnn' part-of-speech tagging(pos)