Category Archives: Deep Learning

BoW vs BERT: Classification

Ashok Chilakapati October 10, 2019 7 Comments

BERT yields the best F1 scores on three different repositories representing binary, multi-class, and multi-label/class situations. BoW with tf-idf weighted one-hot word vectors using SVM for classification is not a bad alternative to going full bore with BERT however, as it is cheap.

Attention as Adaptive Tf-Idf for Deep Learning

Ashok Chilakapati July 22, 2019 No Comments

Attention is like tf-idf for deep learning. Both attention and tf-idf boost the importance of some words over others. But while tf-idf weight vectors are static for a set of documents, the attention weight vectors will adapt depending on the particular classification objective. Attention derives larger weights for those words that are influencing the classification objective, thus opening a window into the decision making process with in the deep learning blackbox…

Reconciling Data Shapes and Parameter Counts in Keras

Ashok Chilakapati June 18, 2019 No Comments

Convolutional layers and their cousins the pooling layers are examined for shape modification and parameter counts as functions of layer parameters in Keras/Tensorflow…

Flowing Tensors and Heaping Parameters in Deep Learning

Ashok Chilakapati June 6, 2019 1 Comment

Formulae for trainable parameter counts are developed for a few popular layers as function of layer parameters and input characteristics. The results are then reconciled with what Keras reports upon running the model…

Convolution Nets For Sentiment Analysis

Amit Bishnoi February 28, 2019 No Comments

In our last article, we were getting some really good results with CNN when we used a custom text corpus. But will CNN manage to hold onto its lead when it competes with SVM in the battle of sentiment analysis, let’s find that out…

Multiclass Classification with Word Bags and Word Sequences

Ashok Chilakapati February 21, 2019 18 Comments

SVM with Tf-idf vectors edges out LSTM in quality and performance for classifying the 20-newsgroups text corpus.

Sequence Based Text Classification with Convolution Nets

Amit Bishnoi January 30, 2019 No Comments

Earlier with the bag of words approach we were getting some really good text classification results. But will that hold, when we take into consideration the sequence of words? There is only one way to find out, let’s get right into the action, where we are doing a head on comparison of traditional approach (Naive Bayes) with a modern neural based one (CNN).

Word Bags vs Word Sequences for Text Classification

Ashok Chilakapati January 13, 2019 1 Comment

Sequence respecting approaches have an edge over bag-of-words implementations when the said sequence is material to classification. Long Short Term Memory (LSTM) neural nets with words sequences are evaluated against Naive Bayes with tf-idf vectors on a synthetic text corpus for classification effectiveness.