Subscribe to DSC Newsletter

Information Retrieval Document Search Engine in R

Introduction:

In this post, we learn about building a basic search engine or document retrieval system using Vector space model. This use case is widely used in information retrieval systems. Given a set of documents and search term(s)/query we need to retrieve relevant documents that are similar to the search query. 

Problem statement:

The problem statement explained above is represented as in below image. 

Document retrieval system

High level design of document search system is shown below :

The content of the post is as follows:

  • Explaining various techniques used in Information retrieval  such as vector space models, term document matrix, similarity score calculation
  • Data description 
  • High level design of the document search system
  • Code implementation in R

Please go thorough the complete blog at below location:

http://www.dataperspective.info/2017/11/information-retrieval-docum...

Views: 621

Comment

You need to be a member of AnalyticBridge to add comments!

Join AnalyticBridge

Follow Us

On Data Science Central

On DataViz

On Hadoop

© 2017   AnalyticBridge.com is a subsidiary and dedicated channel of Data Science Central LLC   Powered by

Badges  |  Report an Issue  |  Terms of Service