Dot products

How do we quantify the similarity between two documents in this vector space? A first attempt might consider the magnitude of the vector difference between two document vectors. This measure suffers from a drawback: two documents with very similar content can have a significant vector difference simply because one is much longer than the other. Thus the relative distributions of terms may be identical in the two documents, but the absolute term frequencies of one may be far larger.

Dot products

  • services sprite Dot products
  • services sprite Dot products
  • services sprite Dot products
  • services sprite Dot products
  • services sprite Dot products
  • services sprite Dot products
  • services sprite Dot products
  • services sprite Dot products
  • services sprite Dot products
  • services sprite Dot products
  • services sprite Dot products

Related posts:

This entry was posted in Links. Bookmark the permalink.









Comments are closed.