Improving neural language models with black-box analysis and generalization through memorization