A Uniform Approach to Analogies, Synonyms, Antonyms, and Associations

In a previous post, I wondered how Latent Semantic Analysis (LSA) and Latent Relational Analysis (LRA) could be unified. LSA can be used to recognize synonyms and LRA can be used to recognize analogies. The two algorithms are quite similar, so it seems it should be possible to design a more general algorithm that [...]

Effects of High-Order Co-occurrences on Word Semantic Similarity

Comments on:
Benoît Lemaire and Guy Denhière
Effects of High-Order Co-occurrences on Word Semantic Similarity
PMI-IR estimates the semantic similarity between a pair of words by how frequently they co-occur within a certain window of text. This simple measure of similarity is surprisingly good at recognizing synonyms: it seems that synonyms often appear close together in text. [...]

Unified Latent Analysis

In Latent Semantic Analysis, we use a large collection of text to build a matrix, in which the rows represent words and the columns represent chunks of text. A chunk can be a sentence, a paragraph, a document, or any sequence of words. The value in a cell in the matrix is based on the [...]