Subscribe to DSC Newsletter

May 2013 Blog Posts (22)

For The People, By The Data by Ron Vinson

For The People, By The Data

For The People, By The Data

RON VINSON (ART+DATA) Institute Fellow & Director of Media, Dept. of Technology for the City of San Francisco.

The city of San Francisco is one of the most technologically advanced cities in…


Added by Zach Piester on May 30, 2013 at 1:00pm — No Comments

Weekly Digest - June 3

Sponsored Announcement

  • Predictive Analytics World Chicago - June 2013. Hear how top practitioners deploy predictive modeling, and what kind of business impact it delivers. Will feature over 30 sessions with case studies across 2 tracks: 1)…

Added by Vincent Granville on May 30, 2013 at 12:00am — No Comments

Hashtag Myths, Real Time Meme Detection

Building a great list of Twitter followers, and judiciously using #hashtags seem to be the holy grail to grow traffic and reap rewards from social media. A good reference on the subject is the Mashable article How to Get the Most Out of Twitter…


Added by Vincent Granville on May 27, 2013 at 11:00pm — No Comments

New pattern to predict stock prices, multiplies return by factor 5

This pattern was found on recent price activity for the 500 stocks that are part of the S&P 500 index. For each day between 4/24 and 5/23, I looked at companies that experienced the most extreme returns - among these 500 companies - comparing today with yesterday close.

Then I looked at the daily performance the following…


Added by Vincent Granville on May 26, 2013 at 10:30pm — 6 Comments

Time Period for Analytical Positions Recruitment.

Though having rough estimate of how much time each analytical position recruitment takes (i.e. due to experience in the filed); thought of giving a quantitative…

Added by Dr. Pradeep Mavuluri on May 23, 2013 at 9:35pm — 6 Comments

Answer questions quickly with data - Sponsored by Tableau

Answer questions as fast as you can think of them with Tableau.

Born at Stanford and raised by a pack of wild PhDs, Tableau has best practices built right in. You get the benefit of years of research on the best way to represent data, from carefully optimized color schemes that tell clear stories, to built-in worldwide…


Added by Vincent Granville on May 22, 2013 at 3:30pm — No Comments

Cross-validation in R: a do-it-yourself and a black box approach

In my previous post, we saw that R-squared  can lead to a misleading interpretation of the quality of our regression fit, in terms of prediction power. One thing that R-squared offers no protection against is overfitting. On the other hand, cross validation, by allowing us to have cases in our testing set that are different from the cases in our training set,  inherently offers protection against overfittting.

1.Do-it-yourself leave-one-out cross validation in R.

In this type…


Added by Theophano Mitsa on May 22, 2013 at 8:06am — 2 Comments

When to use a view and when a table, the fuzz about it for analysis

When to use a view and table?

Is it an art? I doubt it. Only when you cannot explain it and depends on feeling, it is an art.

Recently we are revising our codes and this question of giving guidance.on when to use table and view come to me. I consider the following questions:

1. when does it justify a table/view?

Having worked in a team setting on modeling and reporting works, the key to me is readability. leveraging my programming learning on object oriented concept, every… Continue

Added by Jeffrey Ng on May 20, 2013 at 4:50am — 4 Comments

R (Web Server) Solutions - Amplifying Artichokes

Every month I see one or more new R based web server solutions coming into the market, sight seeing some of them thought of sharing one of my old architecture map manifested to the client long back in early 2009 (good to see quick spreading of scalable and customizable open source statistical computing tool in the market).…


Added by Dr. Pradeep Mavuluri on May 19, 2013 at 10:52pm — No Comments

Weekly Digest - May 20

Selected articles, blog posts and forum questions from DSC, AnalyticBridge, BigDataNews


Added by Vincent Granville on May 17, 2013 at 11:30am — No Comments

The application of Propensity Score Matching

Propensity Score Matching is a statistical matching technique that attempts to estimate the effect of a treatment, policy or other intervention by accounting for the covariates that predict receiving the treatment. It helps to reduce bias due to confounding and can be used to estimate the counterfactual outcome.

For example, many of you will have been to a particular university or school and achieved a certain…


Added by Ian Morton on May 16, 2013 at 3:00am — 1 Comment

Strategy for building a “good” predictive model

By Ian Morton. Ian worked in credit risk for big banks for a number of years. He learnt about how to (and how not to) build “good” statistical models in the form of scorecards using the SAS Language.

Read original post and similar articles…


Added by Mirko Krivanek on May 15, 2013 at 10:00pm — 2 Comments

Big Data: market basket analysis and product affinities

Analytics can offer a key possibility identifying which products are sold together, because that information can be used to influence targeted communication efforts, store layouts, and in-store promotions.


Facing this topic I have made a personal research, and realize a synthesis, which has helped me to clarify some ideas. The attached presentation does not intend to be exhaustive on the subject, but could perhaps bring you some useful insights.



Added by Michel Bruley on May 13, 2013 at 3:47am — No Comments

Analytical Skills Development Spending.

Challenged with the acquisitiveness for adaptability and agility, analytical service organizations are turning to real-world work & emergent…

Added by Dr. Pradeep Mavuluri on May 13, 2013 at 2:04am — No Comments

Key Non-ignorable Points for Agile Analytical Services.

Added by Dr. Pradeep Mavuluri on May 13, 2013 at 12:28am — No Comments

The End of Theory: The Data Deluge Makes the Scientific Method Obsolete | Wired

Here's my rebuttal to this article published in Wired in 2008.

Vincent's rebuttal:

A lot can be done with black-box pattern…


Added by Vincent Granville on May 12, 2013 at 7:30pm — 3 Comments

Use PRESS, not R squared to judge predictive power of regression

R squared, also known as coefficient of determination, is a popular measure of quality of fit in regression. However, it does not offer any significant insights into how well our regression model can predict future values. Instead, the PRESS statistic (the predicted residual sum of squares) can be used as a measure of predictive power. The PRESS statistic can be computed in the leave-one-out cross validation…


Added by Theophano Mitsa on May 12, 2013 at 9:00am — 4 Comments

Building a good predictive model for credit risk

A colleague of mine wanted to understand how to build predictive models, and asked if I had a strategy for building them. I thought it would be useful to share this. For more details about each stage see my personal blog (My suggested strategy for building a “good” predictive model).…


Added by Ian Morton on May 9, 2013 at 10:00am — 1 Comment

Wiley's list of leading and interesting blogs to follow

Here are the top 10, in alphabetical order. Wiley's full list mentions many interesting statistical blogs.


Added by Vincent Granville on May 8, 2013 at 10:00am — 2 Comments

Predictive Analytics in Campaign Management

One of the most popular application of predictive analytics is optimization of marketing campaign management. I've implemented it several times for clients in four different industries and here is my solution.

Introduction to campaign management

Marketing departments are usually responsible for management the marketing campaigns. Campaigns always have some triggers - may be planned, triggered by customer events and various alerts. Campaigns are run for a…


Added by Jozo Kovac on May 7, 2013 at 7:00pm — No Comments

Blog Topics by Tags

Monthly Archives














On Data Science Central

© 2021   TechTarget, Inc.   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service