Word Bags vs Word Sequences for Text Classification
data:image/s3,"s3://crabby-images/72ee8/72ee8941e8cd75bf4feef0f815a1700ea7b0c782" alt=""
Sequence respecting approaches have an edge over bag-of-words implementations when the said sequence is material to classification. Long Short Term Memory (LSTM) neural nets with words sequences are evaluated against Naive Bayes with tf-idf vectors on a synthetic text corpus for classification effectiveness.