Subscribe to DSC Newsletter

Information Retrieval Document Search Engine in R


In this post, we learn about building a basic search engine or document retrieval system using Vector space model. This use case is widely used in information retrieval systems. Given a set of documents and search term(s)/query we need to retrieve relevant documents that are similar to the search query. 

Problem statement:

The problem statement explained above is represented as in below image. 

Document retrieval system

High level design of document search system is shown below :

The content of the post is as follows:

  • Explaining various techniques used in Information retrieval  such as vector space models, term document matrix, similarity score calculation
  • Data description 
  • High level design of the document search system
  • Code implementation in R

Please go thorough the complete blog at below location:

Views: 2865

Tags: document, information, language, mining, natural, processing, retrieval, search, text


You need to be a member of AnalyticBridge to add comments!

Join AnalyticBridge

On Data Science Central

© 2021   TechTarget, Inc.   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service