Asr-nsfc-weekly-2016-11-28

From cslt Wiki
Jump to: navigation, search
Date People Last Week This Week
2016.11.28
清华
  • 发现CodeMap有一些问题,对语料进行转换时会造成一些错误。
  • 尝试降低test集语料对语言模型的ppl
  • 继续语言模型的相关工作。
新大
  • Recording works on Kazak utterances are finished.
  • Checking and preparing Kazak AM text corpora.
  • Revise Kazak LM corpora and some supporting programs.
民大
  • 挑选6000句左右的正式发音文本
  • 校对拉萨话发音词典1000条左右
  • 藏语拉萨话发音词典的校对
  • 蒙语词典录入

Date People Last Week This Week
2016.11.28
清华
  • 确定了训练集匹配问题、语料domain不符等问题
  • 等待数据和语料的进一步更新,重新训练声学模型和语言模型
新大
  • additional Kazak acoustic rules are discussed with experts, and new rules are determined.
  • lost Kazak utterances are extracted and make-up preparation works are done.
  • Kazak irregular acoustic dictionary going to be prepared manually again.
  • recording of Kazak utterances will go on.
民大
  • 选择藏蒙书面语发音文本各50000条左右
  • 校对拉萨话发音词典1000条左右
  • 对藏语书面发音文本进行triphone计算,挑选6000句左右的正式发音文本
  • 藏语拉萨话发音词典的校对
  • 蒙语词典录入