For all the strides that data mining tools have made, using them well still requires hard work and critical thought. In this article on the nuts & bolts of DM, we review a real workhorse for data mining and analysis - the histogram. Among the histograms encountered most frequently in practice are the following: “money”, “count”, and “outlier”. We will look at each one of them in turn…. Read the… Continue
Added by Daniel Graettinger on June 19, 2012 at 7:07pm —
In this post we explore a handy technique used here at BigML for understanding numeric data.
If you are familiar with the MapReduce paradigm, you've probably seen the word count example. It's a simple, distributed way to reduce a large set of words… Continue
Added by Jos Verwoerd on June 19, 2012 at 4:11am —
We have seen few posts ago an approach to extract meta "concepts" from text based on latent semantic paradigm.
In this post we apply this approach to classify documents, and we do a comparison between this approach and the canonical bag of words.
The comparison test will be done through the ensemble method already showed in the last post.
To read the entire post click … Continue
Added by Cristian Mesiano on February 20, 2012 at 7:22am —
This is a quick entry to tell you about three things -- Continue
- My free report, "Text/Content Analytics 2011: User Perspectives on Solutions and Providers," is out. Are you looking for business value in "unstructured" social, online, or enterprise sources? My report will provide background information and "wisdom of the crowds" guidance you can use. Download the report free via altaplana.com/TA2011.
- For a deeper dive into…
Added by Seth Grimes on October 8, 2011 at 8:08pm —
There are four key data visualization techniques used by data analysis pros in the government and local law enforcement. As financial institutions, e-commerce organizations and social network analysts begin to apply data visualization more frequently, these techniques will help guide the process of uncovering meaningful insights hidden within mountains of disparate data. This post focuses on advanced data visualization using relationship graphs.
In our last post… Continue
Added by Tony Agresta on February 1, 2011 at 9:00am —
For quite some time, Intelligence Analysts in key government agencies have used these four techniques to reveal significant insights hidden beneath layers of disparate data. Today, they can be applied to your business. Just follow these guidelines.
Whether you are a fraud analyst, cyber security expert or social networking analyst, you'll reveal essential information to manage your business… Continue
Added by Tony Agresta on January 24, 2011 at 10:53am —
Businesses have a large and growing need to analyze data. This is no easy task today with the exploding volumes of data pouring in from everywhere, and the enormous pressure to turn these mountains of data into information that can be acted upon quickly.
It is no surprise that organizations spend over $15B annually on Business Intelligence (BI) and Data Mining technologies. But with all of the focus on infrastructure technologies, there is little emphasis on the… Continue
Added by Tony Agresta on January 5, 2010 at 8:44pm —
I recently read a blog post which said that ATMs would now be irrelevant! I could not disagree with it more!
For many of us, the ATM has become the true face of our bank. We hardly walk into our branch but we often use the ATM. Colin Henderson has this very interesting post which centers on the reality that with each transaction, a consumer is making a decision about their banking service.
“What has really happened is that the product experience has transferred from the… Continue
Added by Chandan Pathak on October 16, 2009 at 5:00am —
“Banks will let you overspend on your debit card in a way that is much, much more expensive than almost any credit card,” said Eric Halperin, director of the Washington office of the Center for Responsible Lending. Debit has essentially changed into a stealth form of credit, according to critics like him.
The problem is that banks charge you an overdraft fee when you spend more than what is in your account, instead of denying the purchase. Three-quarters of the largest American banks… Continue
Added by Chandan Pathak on September 17, 2009 at 6:40am —
I have submitted a post on my blog to describe the basics of monitoring brand using discourse analysis. See http://discourseweb.wordpress.com/2009/08/28/monitoring-brand/
Added by Andrzej Góralczyk on August 28, 2009 at 3:57pm —
I am happy to announce the release of Inference for R version 3.5. This is a significant release for Inference that includes several major features requested by users:
• Inference in PowerPoint: Embed and run R code in PowerPoint presentations just as you can in Word and Excel.
• Run-in-Place: Execute R code inside of an Excel, Word, or PowerPoint document and see the code results immediately in the same document – “preview” how your document will execute without having to… Continue
Added by Ben Hinchliffe on June 18, 2009 at 2:55pm —
Inference for R
places the power of R statistical analysis software inside the productivity rich environment of MS Office. With Inference for R you can combine R code, data, and text in Excel, Word, and PowerPoint documents to create reports and deploy analysis solutions.
While Excel spreadsheets are an excellent tool for simple analysis, they are ill suited for performing complex analysis… Continue
Added by Ben Hinchliffe on May 26, 2009 at 7:30am —
We’re pleased to announce a new update (Version 3.0.2) to Inference for R
. This update includes the following:
• Support for R 2.9
• Support for running Inference for R on 64-bit (x64) versions of Windows XP and Windows Vista
If you're a user of R 2.9 or 64-bit Windows, you should download a new version today. We’re offering extended trial license keys to anyone who would like to try this updated version. You may request an… Continue
Added by Ben Hinchliffe on May 22, 2009 at 11:37am —