Date People Last Week This Week
2017/1/3 Yang Feng
Jiyuan Zhang
Andi Zhang
  • added source masks in attention_decoder where calculates attention and in gru_cell where calculates new states.
  • found the attribute sentence_length, perhaps it works better than my code
Shiyue Zhang
  • figured out the problem of attention: the initial value of V should be around 0
  • tested different modification, such as add mask, init b with 0.
  • Compared the results, and concluded only change the initial value of V is the best.
  • try to get right attention on memory
Peilun Xiao