Mark, you moron! If you add a fourth message to the set of possible messages you should also look at the softmax output for that fourth message when concluding which message was predicted with the highest probability. If you do that, the LSTM works very well even with the “noisy” data… problem solved.
2 Likes