A Data Science Central Community
Figure 1. Scatter plot of word embedding coordinates (coordinate #3 vs. coordinate #10). You can see that semantically related words are close to each other.
This blog post is an extract from chapter 6 of the book “From Words to Wisdom. An Introduction to Text Mining…Continue
Added by Rosaria Silipo on May 7, 2018 at 12:00am — No Comments
In this post, we learn about building a basic search engine or document retrieval system using Vector space model. This use case is widely used in information retrieval systems. Given a set of documents and search term(s)/query we need to retrieve relevant documents that are similar to the search query.
The problem statement explained above is represented as in below image. …Continue
Added by suresh kumar Gorakala on November 7, 2017 at 6:30am — No Comments
Recently I have come across a term, CRISP-DM - a data mining standard. Though this process is not a new one but I felt every analyst should know about commonly used Industry wide process. In this post I will explain about different phases involved in creating a data mining solution.
CRISP-DM, an acronym for Cross Industry Standard Process for Data Mining, is a data mining process model that includes commonly used approaches that data…
In recent years, I proposed three new kinds of data structure to represent itemsets. They are Node-list , N-list , and Nodeset . They use prefix-tree nodes to represent itemsets instead of the set of tranaction ids.
Since prefix-tree is usually high compressed, Node-list , N-list , and Nodeset  are much shorten than Tidset or diffset, which are two classical vertical representatnion of itemsets. Therefore, for frequent itemset mining, the algorithms based on Node-list,…
Added by Zhi-Hong Deng on April 10, 2014 at 8:13pm — No Comments
Maintenance, Repair and Overhaul (MRO) operations are of extreme importance for some industries. They are particularly crucial for airplanes, helicopters, trains and heavy production machinery (such as power plant equipment). In fact, some equipment has a lifespan of 20 to 25 years, and over the course of its usage period, for every euro spent to purchase the equipment, MRO costs will be 3 to 3.5 euros.
In this market, the distribution of MRO activities varies widely between…Continue
Added by Michel Bruley on March 10, 2014 at 2:35am — No Comments
Practicing Data science…
Added by Manish Bhoge on October 18, 2013 at 12:22pm — No Comments
Since February's launch of my book, Predictive Analytics: The Power to Predict Who Will Click, Buy, Lie, or Die, I have participated in a number of video interviews that explore the topic and field of predictive analytics. Here is a sampling:
Bloomberg TV – Predictive Analytics in Four Minutes:
Added by Eric Siegel on August 21, 2013 at 1:16pm — No Comments
The Financial Times reviewed my book, Predictive Analytics: The Power to Predict Who Will Click, Buy, Lie, or Die.
Click here to read the full Financial Times review (free membership required).
Excerpt from the book review:
Here's a review of my book Predictive Analytics from Robert Nisbet, Ph.D., a leading consultant, author, and predictive analytics instructor at University of California – Irvine (posted here with his permissoin).
Review of Predictive Analytics – The Power to Predict Who Will Click, Buy, Lie, or Die By Eric Siegel.
Robert Nisbet, Ph.D.
March 21, 2013
Predictions have a problem. They are viewed…Continue
Have you seen the press coverage related to my book, Predictive Analytics: The Power to Predict Who Will Click, Buy, Lie, or Die (Wiley, February 2013)?
If you're already an expert practitioner, these articles can serve to help ramp up your clients and coworkers.
If your work doesn't connect to data munching in any way, these articles (and the book) are still totally for you. This accessible book has been…Continue
Added by Eric Siegel on August 8, 2013 at 7:10am — No Comments
When you invest the time to read a book, you're investing a lot more than the $17 to buy it.
Many ask whether my book, Predictive Analytics: The Power to Predict Who Will Click, Buy, Lie, or Die, is at the right level for their needs. Is it too advanced? (Quick answer: definitely not.) Will it instruct me on how to execute on predictive analytics? (Not directly – it is an industry primer rather than a…Continue
Added by Eric Siegel on July 17, 2013 at 11:45am — No Comments
This blog post is to retract an aspect of a story about Chase Bank that I reported in my book, Predictive Analytics: The Power to Predict Who Will Click, Buy, Lie, or Die (published by Wiley, February 2013).
Although predictive analytics professionals usually tell accurate stories about their projects, sometimes the information that comes from human beings is less reliable than the data driven systems they…Continue
Added by Eric Siegel on July 8, 2013 at 12:30pm — No Comments
Who benefits by predicting your behavior? Organizations do—companies, governments, hospitals, and political campaigns. They employ predictive analytics, technology that learns from data to render per-person predictions, one individual at a time.
People have been struck by the final words in the title of my new book on this subject, Predictive Analytics: The Power to Predict Who Will Click, Buy, Lie, or Die (…
Added by Eric Siegel on April 4, 2013 at 1:53pm — No Comments
Joel Schectman at the Wall Street Journal wrote about a story broken in my new book, Predictive Analytics: The Power to Predict Who Will Click, Buy, Lie, or Die.
Wall Street Journal Article:
Book: HP Piloted Program to Predict Which Workers Would Quit
Joel Schectman, Wall Street Journal
Hewlett Packard Co. tested a predictive scoring system that attempted…Continue
Added by Eric Siegel on March 20, 2013 at 2:19pm — No Comments
What's next is what's next… Predictive analytics is where business intelligence is going.
— Rick Whiting, InformationWeek
Ten Predictions for the First Hour of 2020
Good morning. It's January 2, 2020, the first workday of the year. As you drive to the office, the only thing predictive analytics doesn't do for you is steer the car (yet that's coming soon as well).
1. Anti-theft. As you enter your car, a predictive model establishes your…
Added by Eric Siegel on February 26, 2013 at 9:34am — No Comments
The Fiscal Times is running an excerpt from my book, Predictive Analytics, about the Obama campaign's use of uplift modeling (aka net lift or persuasion modeling).
ARTICLE: The Real Story Behind Obama's Election Victory
By Eric Siegel
Elections hang by a thinner thread than you think.
By now you probably know that Barack Obama's 2012 campaign for a second term "moneyballed" the…
Added by Eric Siegel on February 14, 2013 at 9:30am — No Comments
Here is the preface for Predictive Analytics: The Power to Predict Who Will Click, Buy, Lie, or Die
By Eric Siegel, with a foreword from Tom Davenport
(Wiley, February 2013)
To order the book:…
Added by Eric Siegel on February 8, 2013 at 6:30am — No Comments
Webinar signup: Advances in Gradient Boosting: the Power of Post-Processing
December 14, 10-11 a.m., PST
Webinar Registration: http://2.salford-systems.com/gradientboosting-and-post-processing/
I. Gradient Boosting and Post-Processing:
o What is missing from Gradient Boosting?
o Why post-processing techniques are used?
Added by Lisa Solomon on November 29, 2012 at 1:31pm — No Comments
Indian School of Business, one of the premier management institution's in India and rated one among the Top 20 B-Schools in the world for their post graduate programs has collaborated with CrowdANALYTIX to initiate a contest that will help students collaborate with a large community of data scientists on a ideation problem. This is the first in a sequence of 4 including ideation and data mining…
Added by Aravind on October 24, 2012 at 12:52am — No Comments
Improving our presentation skills will make for happier clients and bosses. In this article, Tim Graettinger focuses on five crucial practices that will make your next presentation twice as good as your last.…Continue
Added by Daniel Graettinger on October 15, 2012 at 3:18pm — No Comments