Teacher Forcing Mechanism

Drawback

The prediction ability about next word in RNN is weak at beginning.

Method

By introducing the teacher forcing mechanism, assign the input of the decoder with ground truth under teacher forcing. And feed the prediction of decoder under free running.


Reference

https://blog.csdn.net/qq_30219017/article/details/89090690