Polynomial filtering in latent semantic indexing for information retrieval

Citation

Reference

Related Paper

Citation Trend

Abstract:

Latent Semantic Indexing (LSI) is a well established and effective framework for conceptual information retrieval. In traditional implementations of LSI the semantic structure of the collection is projected into the k-dimensional space derived from a rank-k approximation of the original term-by-document matrix. This paper discusses a new way to implement the LSI methodology, based on polynomial filtering. The new framework does not rely on any matrix decomposition and therefore its computational cost and storage requirements are low relative to traditional implementations of LSI. Additionally, it can be used as an effective information filtering technique when updating LSI models based on user feedback.

Keywords:

Rank (graph theory)

Implementation

Latent semantic analysis

Topics:

Topic Modeling

Data Management and Algorithms

Advanced Database Systems and Queries

10.1145/1008992.1009013

Cite

PDF

Latent Semantic Analysis

Hang Li

This chapter introduces an unsupervised learning method—Latent Semantic Analysis (LSA), first describing the word vector space model and the topic vector space model, followed by the SVD algorithm for LSA, and the Non-negative matrix factorization (NMF) algorithm.

Latent semantic analysis

Non-negative Matrix Factorization

Vector space model

Semantic space

10.1007/978-981-99-3917-6_17

Cite

Citations (0)

Latent Semantic Analysis: An Approach to Understand Semantic of Text

2017 International Conference on Current Trends in Computer, Electrical, Electronics and Communication (CTCEEC) (2017)

Pooja Kherwa Poonam Bansal

Latent semantic analysis (LSA) is a method for analyzing a piece of text with certain mathematical computation and analyzing relationship between terms in the documents, between the documents in the corpus.Various application of intelligent information retrieval, search engines, internet news sites requires an accurate method of accessing document similarity in order to carry out classification, clustering, summarizing or search tasks. So in this paper we are studying latent semantic analysis based on single value decomposition. The aim of Latent semantic analysis is to exploit the global structure of documents. The emphasis of latent semantic analysis is to find hidden relationship in document for better understanding the relationship between terms and document in dataset. In this paper, we have conducting a study using Latent semantic analysis (LSA) to find correlation of terms in a dataset consisting of research papers of various natural language processing applications.LSA shows that single value decomposition collapse multiple terms with same semantic and can identify terms with multiple meaning and represent documents in lower dimensional conceptual space.

Latent semantic analysis

Explicit semantic analysis

Document Clustering

Semantic compression

10.1109/ctceec.2017.8455018

Cite

Citations (34)

Latent concepts and the number orthogonal factors in latent semantic analysis

Georges Dupret

We seek insight into Latent Semantic Indexing by establishing a method to identify the optimal number of factors in the reduced matrix for representing a keyword. This method is demonstrated empirically by duplicating all documents containing a term t, and inserting new documents in the database that replace t with t'. By examining the number of times term t is identified for a search on term t' (precision) using differing ranges of dimensions, we find that lower ranked dimensions identify related terms and higher-ranked dimensions discriminate between the synonyms.

Latent semantic analysis

10.1145/860435.860477

Cite

Citations (25)

How Does Latent Semantic Analysis Work? A Visualisation Approach

arXiv (Cornell University) (2014)

J.H. Koeman William Rea

By using a small example, an analogy to photographic compression, and a simple visualization using heatmaps, we show that latent semantic analysis (LSA) is able to extract what appears to be semantic meaning of words from a set of documents by blurring the distinctions between the words.

Latent semantic analysis

10.48550/arxiv.1402.0543

Cite

Citations (1)

How Does Latent Semantic Analysis Work? A Visualisation Approach.

arXiv (Cornell University) (2014)

J.H. Koeman William Rea

Latent semantic analysis

Source

Cite

Citations (2)

Comparison of Latent Semantic Analysis and Probabilistic Latent Semantic Analysis for Documents Clustering

Marcin Kuta Jacek Kitowski

In this paper we compare usefulness of statistical techniques of dimensionality reduction for improving clustering of documents in Polish. We start with partitional and agglomerative algorithms applied to Vector Space Model. Then we investigate two transformations: Latent Semantic Analysis and Probabilistic Latent Semantic Analysis. The obtained results showed advantage of Latent Semantic Analysis technique over probabilistic model. We also analyse time and memory consumption aspects of these transformations and present runtime details for IBM BladeCenter HS21 machine.

Latent semantic analysis

Hierarchical clustering

Source

Cite

Citations (7)

Automatic essay grading with probabilistic latent semantic analysis

Tuomo Kakkonen Niko Myller Jari Timonen Erkki Sutinen

Probabilistic Latent Semantic Analysis (PLSA) is an information retrieval technique proposed to improve the problems found in Latent Semantic Analysis (LSA). We have applied both LSA and PLSA in our system for grading essays written in Finnish, called Automatic Essay Assessor (AEA). We report the results comparing PLSA and LSA with three essay sets from various subjects. The methods were found to be almost equal in the accuracy measured by Spearman correlation between the grades given by the system and a human. Furthermore, we propose methods for improving the usage of PLSA in essay grading.

Latent semantic analysis

Grading (engineering)

10.3115/1609829.1609835

Cite

Citations (55)

Comparing the Performance of Latent Semantic Analysis and Probability Latent Semantic Analysis Models on Autoscoring Essay Tasks

Lecture notes in computer science (2017)

Xiaohua Ke Haijiao Luo

Latent semantic analysis

10.1007/978-3-319-52836-6_42

Cite

Citations (1)

Feasible settings for the adaptive latent semantic analysis: Hk-LSA model

Khu Phi Nguyen Huy Q. Phan

Recently, improvements of latent semantic analysis or LSA which stems from singular value decomposition to derive latent semantic classes, especially hk-LSA model, have been proposed. The hk-LSA model is based on reducing dimension of vector space and like-probabilistic relationship between document-term and latent-topic space. This improved model overcomes some shortcomings of standard LSA such as processing very dense and orthogonal matrices and difficulties in parallelization. It is dealt with this paper, some feasible ways to setup such a model and statistical comparisons between proposed ways to recognize good setup feasible for the hk-LSA model. Case studies on this subject suggest some ways to setup hk-LSA and show relationships between the standard LSA and hk-LSA model.

Latent semantic analysis

10.1109/ciapp.2017.8167211

Cite

Citations (2)