Monthly Archives: June 2018

Reduced Order Models for Documents

The term-document matrix  is a high-order, high-fidelity model for the document-space. High-fidelity in the sense that  will correctly shred-bag-tag it to represent it as a vector in term-space as per VSM.  has entries, with distinct terms (rows) building documents (columns). But do we need all those values to capture this shred-bag-tag effect of … Read more »