Posted on November 1, 2008 by Peter Turney
There is a steady trickle of visitors to my post on Why Does SVD Improve Similarity Measurement?, so I gave this question a bit more thought. In that post, I offered three hypotheses about why SVD helps — high-order co-occurrence, latent meaning, and noise reduction — and I said that I didn’t know which hypothesis [...]
Filed under: Computational Linguistics, Semantics | Tagged: data analysis, SVD, text analysis | 6 Comments »
Posted on November 15, 2007 by Peter Turney
I recently completed a technical report on tensor decomposition, Empirical Evaluation of Four Tensor Decomposition Algorithms. The tech report has two messages: (1) If you like Singular Value Decomposition, you’re going to love Tucker Decomposition. (2) Many interesting applications involve large tensors. If your application requires large tensors, the tech report includes MATLAB source code [...]
Filed under: Computer Science, Semantics | Tagged: data analysis, tensors, text analysis | 1 Comment »
Posted on September 18, 2007 by Peter Turney
Recently I’ve been experimenting with algorithms for the Singular Value Decomposition and the Tucker Decomposition, with the goal of processing large matrices (more than 105 rows and columns) and large tensors (more than 104 rows, columns, and tubes) that are relatively sparse (about 10% density). The problem with matrices and tensors of this size is [...]
Filed under: Computational Linguistics, Computer Science, Philosophy of Mind | Tagged: data analysis, SVD, tensors, text analysis | 3 Comments »
Posted on July 24, 2007 by Peter Turney
For the last several months, I’ve been playing with tensors as an approach to data and text analysis. Here are some pointers to get started on tensors.
Tensors are a generalization of matrices to higher dimensions:
order 0 tensor = scalar
order 1 tensor = vector
order 2 tensor = matrix
order n > 2 tensor = higher order tensor
PARAFAC [...]
Filed under: Computational Linguistics, Computer Science, Semantics | Tagged: data analysis, SVD, tensors, text analysis | 4 Comments »
Posted on January 24, 2007 by Peter Turney
In response to my earlier post on Effects of High-Order Co-occurrences on Word Semantic Similarity, Tom Landauer sent me the following note:
You have given me an idea. Because I have just been asked again to review papers that say that the way LSA works is by indirect associations, it seems that few have seen my [...]
Filed under: Computational Linguistics, Semantics | Tagged: data analysis, SVD, text analysis | 2 Comments »