Subscribe to DSC Newsletter

Featured Blog Posts – September 2010 Archive (28)

A three months trip, from Italy to China... without a driver !

VisLab's adventure on the Silk road…


Added by Vincent Granville on September 30, 2010 at 9:00pm — No Comments

Working With Large Data Sets

This is an excerpt from my blogpost Working With Large Data Sets...

For the past 18 months I’ve moved from working on the SMTP proxy to working on our other systems, all of which make use of the data we collect from each connection. It’s a fair amount of data and it can be up to 2Kb in size for each connection. Our servers receive approximately 1000 of these pieces of data per second, which is fairly sustained due to our global…


Added by Phil Whelan on September 28, 2010 at 2:02pm — 1 Comment

STATISTICA Data Miner wins the PAKDD-2010 Data Mining Competition

PAKDD 2010 Data Mining Competition In recent news about the… Continue

Added by Vincent Granville on September 27, 2010 at 2:41pm — No Comments

Data Mining Survey - Most Popular Tools Used To Solve Business Analytics Problems

510 of Analytic Bridge visitors have participated in a data mining tool survey, where the respondents were asked to make selection…


Added by Vincent Granville on September 27, 2010 at 2:39pm — No Comments

Social Media Insights from Predictive Analytics

Here is one more example on how Predictive Analytics may help professionals to make better decisions. For this post a total of 3000 Social Media title posts where analyzed to gain -hopefully- important insights for Social Media professionals. To achieve this, Text Mining was used to analyze the text of titles, identify the most important subjects (do posts…

Added by Vincent Granville on September 27, 2010 at 2:13pm — No Comments

Google's Schmidt to Colbert: Comment About Kids Changing Their Identity 'Was a Joke

Google CEO Eric Schmidt was on "The Colbert Report" for a "Google Chat" last night. As Stephen would say, "At least that's what it said when I looked it up on Bing."…


Added by Vincent Granville on September 26, 2010 at 6:50pm — No Comments

Web Analytics Salary Guide (source: IQ Workforce)

Our clients and candidates frequently ask for reliable salary guidelines for web analytics professionals. We have always directed people to the…


Added by Vincent Granville on September 26, 2010 at 6:48pm — No Comments

Twitter to Release a Real-time Analytics Solution

Twitter plans to launch a free analytics dashboard that will help its users – especially businesses – understand how others are interacting with their tweets.

Member of Twitter’s business development team Ross Hoffman has revealed at the Sports Marketing Summit that Twitter plans to launch the dashboard in the last quarter of 2010. He was speaking in the context of sports, but there’s no reason to believe the tool won’t be available to other users, too.

The team that works… Continue

Added by Vincent Granville on September 26, 2010 at 6:42pm — No Comments

IBM launches three new analytics centers in Europe


Added by Vincent Granville on September 26, 2010 at 6:38pm — No Comments


Hi All
Could anybody please let me know how to start with R-language. I want to learn it as it is an add-onn for many statistical softwares.
Manish Negi

Added by MANISH NEGI on September 25, 2010 at 8:25am — 2 Comments

Invitation to an Introduction to GNU-R

I'll be giving a talk[1] in October to introduce people to GNU-R[2] - a popular and free statistical language and computing environment.

The talk is being hosted by the Manchester Free Software group[3] and will be held at the Madlab[4] on 19/10/10 19:00-20:30.

Naturally I'll be taking questions on the day but if you can think of any particular topics that you would like me to cover then please post a comment with your suggestions.

I look forward to seeing you… Continue

Added by Robin Gower on September 24, 2010 at 7:51am — 2 Comments

How to data merging with not matching condtion.

I have two data sets data_A and data_B. data_A consists of costumer demographic information and data_B have costumer transactional information. costumer_ID is the common variable in both data set. now my question is how can i do data merging where costumer_ID not match in both data sets.

Please anyone help....

Added by Prashant on September 21, 2010 at 5:58am — 2 Comments

IBM to buy analytics firm Netezza for $1.7 billion

NEW YORK (Reuters) - IBM Corp said on Monday that it would buy data analytics company Netezza Corp for $1.7 billion to expand…


Added by Vincent Granville on September 20, 2010 at 9:31am — No Comments

New Kings of Wall Street: Math Geeks

The rapid-fire growth of high-frequency trading, HFT, has spawned a new breed of market mavens whose backgrounds are far…


Added by Vincent Granville on September 16, 2010 at 2:59pm — No Comments

Real time example for Normalization


this article, we are going to see normalization in action in a popular

web application. People who are not familiar with normalization please refer to my previous post.

We all know very well the capability of Google to exploit the… Continue

Added by Venkatesh Umaashankar on September 16, 2010 at 11:23am — No Comments

IBM Research Division Tackles Analytics, IT Services, Health Care

YORKTOWN HEIGHTS, N.Y.—"Watson" isn’t brushing up on any trivia today.…


Added by Vincent Granville on September 16, 2010 at 10:24am — No Comments

Data Preprocessing – Normalization

Data Preprocessing – Normalization


to introduction, in this article I am going to discuss “Data

Preprocessing” an important step in the knowledge discovery process, can

be even considered as a fundamental building block of data mining.

People who come from data…

Added by Venkatesh Umaashankar on September 16, 2010 at 4:06am — 1 Comment

StatSoft Polska Predicted Election Results


Added by Vincent Granville on September 14, 2010 at 5:56pm — No Comments

10,000 observations and 100,000 parameters: what to do?

Everybody learned in elementary statistical classes that when you have more parameters (in your statistical model) than observations, it is a recipe for disaster.

Here, I would like to provide two examples where more parameters than observations can successfully be handled:

  • Scoring system to detect fraud: logistic or linear regression: 500,000 binary rules (most of them with a triggering rate < 0.05%) , resulting in 500,000 regression…

Added by Vincent Granville on September 11, 2010 at 6:30pm — 8 Comments


“Modelling and Forecasting UK Mortgage Arrears and Possessions.”

JANINE ARON, Department of Economics, Oxford
JOHN MUELLBAUER, Nuffield College, Oxford
July, 2010

Abstract: This paper presents new models for aggregate UK data on mortgage possessions
(foreclosures) and mortgage arrears (payment delinquencies). The innovations include the
treatment of difficult to observe…

Added by John A Morrison on September 10, 2010 at 10:06pm — No Comments

Featured Monthly Archives














On Data Science Central

© 2020   TechTarget, Inc.   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service