Subscribe to DSC Newsletter

Featured Blog Posts (1,455)

Outlier analysis: Chebyschev criteria vs approach based on Mutual Information

As often happens, I usually do many thing in the same time, so during a break while I was working for a new post on applications of mutual information in data mining, I read the interesting paper suggested by Sandro Saitta on his blog (dataminingblog)  related to the outlier detection. 

...Usually such behavior is not proficient to obtain good results, but this time I think that the change of…


Added by Cristian Mesiano on May 23, 2012 at 2:20pm — No Comments

Web Report Studio: Adding Drill-Down Filter Based on a Date

When I was creating the Summary and Detailed reports for the SAS Global Forum paper, I was demonstrating how to link from the weekly chart to the detailed report about the week.  On my first try with my Week Filter based on the date value – it just would not work.  Eeek!  To fix the problem I made a new data item that was a character value.  This posts talks about my strategy.  [You can read our paper "…


Added by Tricia Aanderud on May 22, 2012 at 4:20am — No Comments

12 selected AnalyticBridge and DataScienceCentral articles from the last 5 days

Here is our weekly selection, with several original articles from Vincent Granville (the Founder).

  • Four different ways to solve a data science problem - case study…

Added by Vincent Granville on May 18, 2012 at 4:30pm — No Comments

SAS BI: Does Your Organization Have a BI Strategy?


One of the best things about attending the SAS Global Forum is all the brilliant people you get to meet.  Guy Garrett’s presentation about planning a BI strategy was quiet popular and I have to say he was very witty.  Turns out implementing a BI strategy is similar to dating –
who knew?  Anyway – here’s a follow up from Guy – I encourage you to sign-up for the
Achieve Intelligence monthly newsletter for more goodies.

What is…


Added by Tricia Aanderud on May 18, 2012 at 5:00am — No Comments

15 great data science articles from influential news outlets

This is our third post in our series of "great articles". For each article, click on the link after the title to read the full story.

1. How Big Data Will Disrupt the $9 Billion Music Publishing Rights Business - …


Added by Vincent Granville on May 16, 2012 at 11:30pm — No Comments

SAS Stored Processes: 3 Tips to Improve Your Prompts

Here’s some usability tricks that I have learned with my SAS Stored Processes to make them more robust and harder to break.  Really the out-of-the box prompts provide a lot of functionality that really helps. That’s right – let’s build a better mousetrap!…


Added by Tricia Aanderud on May 16, 2012 at 6:01am — No Comments

Learn R - the lingua franca of data scientist!

New Courses from R Gurus


Looking to learn R, or to expand your R skills for data visualization or package development?

Here are some R courses presented by the experts you may be interested in:

June 19-20: Visualization in R with ggplot2. This course presented by Garrett Grolemund & Dr. Winston Chang of Rice University is also a web-based course with live presentation. This course provides instruction on data…


Added by James Peruvankal on May 16, 2012 at 2:15pm — No Comments

Understanding the Reality of Real-Time Analytics


Added by Vincent Granville on May 15, 2012 at 1:30pm — No Comments

Machine Learning in Python has never been easier

At BigML we believe that over the next few years automated, data-driven decisions and data-driven applications are going to change the world.  In fact, we think it will be the biggest shift in business efficiency since the dawn of the office calculator, when individuals had “Computer” listed as the title on their business card.  We want to help people rapidly and easily create predictive models using their datasets, no matter what size they are. Our…


Added by Jos Verwoerd on May 15, 2012 at 3:20am — No Comments

BiG DaTa & Vectorization


It has been while when Big data entered into the market and buzz the analytics world. Now a day all analytics leaders are chanting about Big data applications. Since I have started with Hadoop technologies and with Machine learning one question has been bugging in mind:

Which is a greater innovation Big Data Or Machine Learning…


Added by Manish Bhoge on May 13, 2012 at 11:50pm — 1 Comment

SAS Global Forum: Here’s the Wrap Up!

SAS Global Forum 2012 was a success! After a whirlwind week of activities followed by a vacation and week of rest – I’m ready to give you some highlights.  It was a lot of fun! Tip: Click on any picture to enlarge it.

Day 1 – Saturday Ready for the Tweet-Up

The biggest drama was at the airport – our flight was delayed due to mechanical failure so I decided it might be better to take a later flight. Met…


Added by Tricia Aanderud on May 14, 2012 at 6:46am — No Comments

Email marketing: analytic tips to boost performance by 300% - case study

This post is part of our blog post series on data science case studies and success stories.

Analyticbridge improved open rates by 300%, and dramatically improved total clicks and click-through rates using the following strategies:…


Added by Vincent Granville on May 13, 2012 at 1:00pm — No Comments

Four different ways to solve a data science problem - case study

Here we discuss four approaches to solve the following marketing problem: identify, each day, the most popular Google groups, within a large list of target groups. You want to post in these groups only. The only information that is quickly available for each group, is the time when the last posting occured. Intuitively, the newer the last posting, the most active the group. There are some caveats such as groups…


Added by Vincent Granville on May 12, 2012 at 11:30pm — 4 Comments

The Math Behind Ticket Bargains | SeatGeek


Added by Capri on May 12, 2012 at 5:20pm — No Comments

Quickly start and optimize keyword advertising campaigns on Google in 7 days: a 11-step procedure

This is what I did, and it worked quite well. 

  1. Identify 10 top, high volume, well targeted keywords for your business. These are your seed…

Added by Vincent Granville on May 12, 2012 at 4:00pm — 2 Comments

More resources for data scientists and analytic professionals

Recently posted on DataScienceCentral and AnalyticBridge:

1. Conferences

  • Europe’s Best and Brightest come together at Analytics 2012 -…

Added by Capri on May 12, 2012 at 1:00pm — No Comments

R you ready for Big Machine Learning?

Recently, we released python bindings for our API.  We received fantastic feedback on the related

blog post from hacker news and twitter, so we started thinking about other languages that could benefit from a tighter integration with the…


Added by Justin Donaldson on May 11, 2012 at 11:40am — No Comments

SAS Enterprise Guide: Import the Excel Spreadsheet – Easy Peasy

One SAS Enterprise Guide feature I particularly like is the ability to import Microsoft Excel data quickly and easily.  SAS offers many ways to work with Excel spreadsheets but often I find I just want to extract data from Excel and get on with my job.  

Use a “Known Good” First Time

If you are trying this process for the first time, use a “known good” or simple spreadsheet so if any issues arise you can at least eliminate the data as the cause. When this process fails, I…


Added by Tricia Aanderud on May 10, 2012 at 10:00am — No Comments

Big Data Will Need 1.5 Million Data Scientists | Dice

How many of these jobs can be performed by bots (computer programs)? Here's the story:…


Added by Capri on May 9, 2012 at 2:46pm — No Comments

Featured Monthly Archives











Follow Us

On Data Science Central

On DataViz

On Hadoop

© 2017 is a subsidiary and dedicated channel of Data Science Central LLC   Powered by

Badges  |  Report an Issue  |  Terms of Service