A Data Science Central Community
In this post, we learn about building a basic search engine or document retrieval system using Vector space model. This use case is widely used in information retrieval systems. Given a set of documents and search term(s)/query we need to retrieve relevant documents that are similar to the search query.
The problem statement explained above is represented as in below image. …
ContinueAdded by suresh kumar Gorakala on November 7, 2017 at 6:30am — No Comments
This post is the third part of the multi-part series on how to build a search engine –
Added by Vivek Kalyanarangan on December 30, 2016 at 6:00am — No Comments
This post is the second part of the multi-part series on how to build a search engine –
Added by Vivek Kalyanarangan on December 23, 2016 at 10:30am — No Comments
"Information is the oil of the 21st century, and analytics is the combustion engine" Peter Sondergaard, SVP, Gartner Research
In analytics, we retrieve information from various data sources; it can be structured or unstructured. The biggest challenge here is to retrieve information from unstructured data mainly texts. Here machine learning comes into the picture to overcome this challenge. Different algorithms have been designed in different platforms…
ContinueAdded by Vivek Kalyanarangan on September 9, 2016 at 8:30am — No Comments
I am pleased to announce my new book on social media analytics. The book offers concepts, tools, tutorials, and cases studies to understand and analyze the seven layers of social media data, including text, actions, networks, apps, hyperlinks, search engine, and location…
ContinueAdded by Dr. Gohar Khan on July 6, 2015 at 7:14pm — No Comments
As a coincidence, I noticed from Google Insights for Search that there was some sort of correlation between web search interests in "American Idol" (singing competition on US television) and "iPhone" (Apple smartphone). Looking at the time series graph, it appears that American Idol searches are seasonal from January to May each year, which correspond to the months which this television series is broadcast. As soon as American Idol interest drops in June, searches for iPhone spike in the…
ContinueAdded by Jimmy Wong on June 20, 2011 at 1:42am — No Comments
This is an excerpt from my blogpost Working With Large Data Sets...
For the past 18 months I’ve moved from working on the SMTP proxy to working on our other systems, all of which make use of the data we collect from each connection. It’s a fair amount of data and it can be up to 2Kb in size for each connection. Our servers receive approximately 1000 of these pieces of data per second, which is fairly sustained due to our global…
ContinueAdded by Phil Whelan on September 28, 2010 at 2:02pm — 1 Comment
Added by Andrzej Góralczyk on August 28, 2009 at 3:57pm — 1 Comment
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
© 2021 TechTarget, Inc.
Powered by
Badges | Report an Issue | Privacy Policy | Terms of Service
Most Popular Content on DSC
To not miss this type of content in the future, subscribe to our newsletter.
Other popular resources
Archives: 2008-2014 | 2015-2016 | 2017-2019 | Book 1 | Book 2 | More
Most popular articles