VisLab's adventure on the Silk road…
Added by Vincent Granville on September 30, 2010 at 9:00pm —
This is an excerpt from my blogpost Working With Large Data Sets...
For the past 18 months I’ve moved from working on the SMTP proxy to working on our other systems, all of which make use of the data we collect from each connection. It’s a fair amount of data and it can be up to 2Kb in size for each connection. Our servers receive approximately 1000 of these pieces of data per second, which is fairly sustained due to our global… Continue
Added by Phil Whelan on September 28, 2010 at 2:02pm —
In recent news about the… Continue
Added by Vincent Granville on September 27, 2010 at 2:41pm —
510 of Analytic Bridge visitors have participated in a data mining tool survey, where the respondents were asked to make selection… Continue
Added by Vincent Granville on September 27, 2010 at 2:39pm —
Here is one more example on how Predictive Analytics may help professionals to make better decisions. For this post a total of 3000 Social Media title posts where analyzed to gain -hopefully- important insights for Social Media professionals. To achieve this, Text Mining was used to analyze the text of titles, identify the most important subjects (do posts… Continue
Added by Vincent Granville on September 27, 2010 at 2:13pm —
Google CEO Eric Schmidt was on "The Colbert Report" for a "Google Chat" last night. As Stephen would say, "At least that's what it said when I looked it up on Bing."…
Added by Vincent Granville on September 26, 2010 at 6:50pm —
Our clients and candidates frequently ask for reliable salary guidelines for web analytics professionals. We have always directed people to the… Continue
Added by Vincent Granville on September 26, 2010 at 6:48pm —
Twitter plans to launch a free analytics dashboard that will help its users – especially businesses – understand how others are interacting with their tweets.
Member of Twitter’s business development team Ross Hoffman has revealed at the Sports Marketing Summit that Twitter plans to launch the dashboard in the last quarter of 2010. He was speaking in the context of sports, but there’s no reason to believe the tool won’t be available to other users, too.
The team that works… Continue
Added by Vincent Granville on September 26, 2010 at 6:42pm —
Added by Vincent Granville on September 26, 2010 at 6:38pm —
Could anybody please let me know how to start with R-language. I want to learn it as it is an add-onn for many statistical softwares.
Added by MANISH NEGI on September 25, 2010 at 8:25am —
I'll be giving a talk in October to introduce people to GNU-R - a popular and free statistical language and computing environment.
The talk is being hosted by the Manchester Free Software group and will be held at the Madlab on 19/10/10 19:00-20:30.
Naturally I'll be taking questions on the day but if you can think of any particular topics that you would like me to cover then please post a comment with your suggestions.
I look forward to seeing you… Continue
Added by Robin Gower on September 24, 2010 at 7:51am —
I have two data sets data_A and data_B. data_A consists of costumer demographic information and data_B have costumer transactional information. costumer_ID is the common variable in both data set. now my question is how can i do data merging where costumer_ID not match in both data sets.
Please anyone help....
Added by Prashant on September 21, 2010 at 5:58am —
NEW YORK (Reuters) - IBM Corp said on Monday that it would buy data analytics company Netezza Corp for $1.7 billion to expand… Continue
Added by Vincent Granville on September 20, 2010 at 9:31am —
The rapid-fire growth of high-frequency trading, HFT, has spawned a new breed of market mavens whose backgrounds are far… Continue
Added by Vincent Granville on September 16, 2010 at 2:59pm —
this article, we are going to see normalization in action in a popular
web application. People who are not familiar with normalization please refer to my previous post.
know very well the capability of Google to exploit the… Continue
Added by Venkatesh Umaashankar on September 16, 2010 at 11:23am —
YORKTOWN HEIGHTS, N.Y.—"Watson" isn’t brushing up on any trivia today.… Continue
Added by Vincent Granville on September 16, 2010 at 10:24am —
to introduction, in this article I am going to discuss “Data
Preprocessing” an important step in the knowledge discovery process, can
be even considered as a fundamental building block of data mining.
People who come from data… Continue
Added by Venkatesh Umaashankar on September 16, 2010 at 4:06am —
Added by Vincent Granville on September 14, 2010 at 5:56pm —
Everybody learned in elementary statistical classes that when you have more parameters (in your statistical model) than observations, it is a recipe for disaster.
Here, I would like to provide two examples where more parameters than observations can successfully be handled:
- Scoring system to detect fraud: logistic or linear regression: 500,000 binary rules (most of them with a triggering rate < 0.05%) , resulting in 500,000 regression…
Added by Vincent Granville on September 11, 2010 at 6:30pm —
“Modelling and Forecasting UK Mortgage Arrears and Possessions.”
JANINE ARON, Department of Economics, Oxford
JOHN MUELLBAUER, Nuffield College, Oxford
Abstract: This paper presents new models for aggregate UK data on mortgage possessions
(foreclosures) and mortgage arrears (payment delinquencies). The innovations include the
treatment of difficult to observe… Continue
Added by John A Morrison on September 10, 2010 at 10:06pm —