Tag Archives: Bag of words
BoW to BERT
data:image/s3,"s3://crabby-images/11745/117457dbd26c24f90cf6bf8194ec2d90d50c7341" alt=""
Word vectors have evolved over the years to know the difference between “record the play” vs “play the record”. They have evolved from a one-hot world where every word was orthogonal to every other word, to a place where word vectors morph to suit the context. Slapping a BoW on word vectors is the usual way to build a document vector for tasks such as classification. But BERT does not need a BoW as the vector shooting out of the top [CLS] token is already primed for the specific classification objective
Word Embeddings and Document Vectors: Part 2. Classification
data:image/s3,"s3://crabby-images/fbcd4/fbcd40a61f4a6a5bba4d074989c68dea9847b456" alt=""
In the previous post Word Embeddings and Document Vectors: Part 1. Similarity we laid the groundwork for using bag-of-words based document vectors in conjunction with word embeddings (pre-trained or custom-trained) for computing document similarity, as a precursor to classification. It seemed that document+word vectors were better at picking up on similarities… Read more »
Word Embeddings and Document Vectors: Part 1. Similarity
data:image/s3,"s3://crabby-images/9244d/9244d870629ebbdeda7b1d023972d28afdc367f3" alt=""
Classification hinges on the notion of similarity. This similarity can be as simple as a categorical feature value such as the color or shape of the objects we are classifying, or a more complex function of all categorical and/or continuous feature values that these objects possess. Documents can be classified… Read more »