Subscribe to DSC Newsletter

Featured Blog Posts – January 2012 Archive (18)

Document Classification: how to boost your classifier

ADaBoost.M1 tries to improve step by step the accuracy of the classifier analyzing its behavior on training set. (Of course you cannot try to improve the classifier working with the test set!!).

Here lays the problem, because if we choose as "weak algorithm" an SVM, we know that almost always it returns excellent accuracy on the training set with results closed to 100% (in term of true positive).

In this scenario, try to improve the accuracy of classifier assigning different weights…

Continue

Added by Cristian Mesiano on January 30, 2012 at 2:06pm — No Comments

Microsoft General Counsel and EVP, Brad Smith, Complains About How Hard it is to Find Analytic Talent

The complaint is also about the fact that Microsoft, located in Redmond, Washington, can't find local candidates and has to hire candidates from out-of-state (and I guess, from out-of-country), to fill big data, IT, data scientist and engineering positions. There was a long article today in the Seattle Times about how terrible Washington State is - according to Brad Smith - in terms of educating its citizens.…

Continue

Added by Vincent Granville on January 29, 2012 at 7:30pm — No Comments

Scoring technology used for credit scoring, healthcare, retail analytics and much more | Seattle Times

By CANDICE CHOI  - The Associated Press. Source: http://seattletimes.nwsource.com/avantgo/2017309601.html

Companies are getting smarter at predicting your next move.…

Continue

Added by Vincent Granville on January 29, 2012 at 7:00pm — No Comments

5 low-profile startups that could change the face of big data | Gigaom.com



 Big data is hot, but infrastructure-level platforms such as Hadoop, which focus on storage and processing, still need help to…

Continue

Added by Vincent Granville on January 29, 2012 at 3:51pm — No Comments

Copula Dependence Structure on Stock Market with Application to Risk

Copula Dependence Structure on Stock Market with Application to Risk

Shaoxuan Guan

Department of Mathematical Statistics

CHALMERS UNIVERSITY OF…

Continue

Added by John A Morrison on January 28, 2012 at 12:00am — No Comments

Operational analytics in electronic security environments

 

 

A.      INTRODUCTION

 

The science of…

Continue

Added by Gerhard Furter on January 25, 2012 at 1:48pm — No Comments

Article: MORE Frequently-Asked Questions about Data Mining

Another recent article from Discovery Corps, Inc. president Tim Graettinger:   Grab Bag 2:  More Frequently-Asked Questions about Data Mining.  

"While helping present a monthly webinar on Data Mining, I'm asked some challenging and really pivotal questions about DM and predictive analytics:  

  1. How much data do I need? 
  2. Why doesn't my predictive model perform as well on new data as it did on the training data?
  3. Are new DM algorithms needed to…
Continue

Added by Daniel Graettinger on January 23, 2012 at 1:26pm — 1 Comment

The Rapleaf Personalization Fund: $1MM available to analytic startups

Have a cool idea for a product related to data or personalization? We’ll help you build it! The Rapleaf Personalization Fund is $1 million of cash and in-kind awards (plus invaluable support from our partners) for cool startups that integrate with our data. We’re looking for awesome start-ups based on Rapleaf data that will take personalization to the next level.

Start your Application:…

Continue

Added by Vincent Granville on January 22, 2012 at 3:30pm — No Comments

Big Data is the Answer - What was the Question?

Interesting blog post by SmartAnalytics

Content:

  • Big Data, Little Data
  • Big Data Use Cases
  • E-tailing – E-Commerce – Online Retailing Use Cases (Interesting Use Case – Amazon Will Pay Shoppers $5 to Walk Out of Stores Empty-Handed)
  • Retail/Consumer Use Cases
  • Financial Services Use Cases (example: Trade Surveillance Use Case)
  • Web & Digital Media Services Use Cases
  • Government Use…
Continue

Added by Vincent Granville on January 21, 2012 at 7:12pm — No Comments

The crisis in the socio-economic continuum of modern civilization - as a historic challenge to...

The crisis in the socio-economic continuum of modern civilization - as a historic challenge to a qualitatively new role of socio-technologies for the expansion of consumption and market diversification is not so much a real possibility - but there is a historical inevitability ... .

Modern civilization has undoubtedly in many respects, the main product of the market economy and market relations. And the social foundations of the…

Continue

Added by Oleg Mazukabzov on January 21, 2012 at 10:03am — No Comments

StatSoft's STATISTICA Achieves SAP-Certified Integration With SAP NetWeaverR

StatSoft announced its STATISTICA  has achieved certified integration with the SAP NetWeaver® technology platform. STATISTICA, StatSoft’s flagship analytics software platform, works with SAP NetWeaver®  for data exchange.

Tulsa, OK – January 19, 2012 - StatSoft announced that its STATISTICA release has achieved certified integration with the SAP NetWeaver® technology platform.  STATISTICA, StatSoft’s flagship analytics software platform, works with…

Continue

Added by Vincent Granville on January 20, 2012 at 9:00am — No Comments

'Point in Time' versus 'Through the Cycle'

Point in Time (PiT) v Through the Cycle (TTC)

There used to be an argument about this; its clear to me now that this is more than an arithmetic conundrum about Ratings and RWAs its a philosophical matter applicable to wider aspects of life & thus any arguments for PiT are just lazy and myopic. Here are some good recent references; I have selected recent papers since the debate has evolved towards a conclusion since it first developed 2 or so years ago. The game has…

Continue

Added by John A Morrison on January 19, 2012 at 5:17am — No Comments

Northwestern, IBM collaborate on business analytics courses

Northwestern University and IBM are collaborating on new business and technology curricula to help students gain the latest skills in business analytics. The new courses of study, Masters of Science degree programs with analytics concentrations in the McCormick School of Engineering and Applied Science and the School of Continuing Studies, will better prepare students and current professionals who are seeking new analytics skills for today’s competitive job market.

The U.S. Bureau of…

Continue

Added by Vincent Granville on January 17, 2012 at 8:51pm — No Comments

Extract meta concepts through co-occurrences analysis and graph theory

....
So what I did is the following (be aware that is not the formal implementation of LSA!):
  1. Filter and take the base form of the words as usual.
  2. Build the multidimensional sparse matrix of the co-occurrences;
  3. I calculated for each instance the frequency to find it in the corpus;
  4. I calculated for each instance the frequency to find it in the doc;
  5. I weighted such TF-IDF considering also the distance among the…
Continue

Added by Cristian Mesiano on January 13, 2012 at 9:04am — No Comments

Latest Article: Data Mining Lessons from the Movie "Moneyball"

My company (Discovery Corps, Inc.)  president Tim Graettinger has released a new article I think you'll like:

"If you’re a data miner, there’s a good chance you saw the movie Moneyball.  For me, the movie was entertaining, but the book was really enlightening.  I found it to be a source of numerous and familiar lessons about data mining - while couched in the intriguing context of major league baseball.  In this article, I’d like to share some of…

Continue

Added by Daniel Graettinger on January 6, 2012 at 1:30pm — No Comments

Great apes think analytically | ScienceDaily

Great Apes Make Sophisticated Decisions: Research Suggests That Great Apes Are Capable of Calculating the Odds Before Taking Risks

ScienceDaily (Dec. 29, 2011) — Chimpanzees, orangutans, gorillas and bonobos make more sophisticated decisions than was previously thought. Great apes weigh their chances of success, based on what they know and the likelihood to…

Continue

Added by Vincent Granville on January 4, 2012 at 3:00pm — No Comments

Business Technology: So, What's Your Algorithm? | Wall Street Journal

We are ruined by our own biases. When making decisions, we see what we want, ignore probabilities, and minimize risks that uproot our hopes.

What's worse, "we are often confident even when we are wrong," writes Daniel Kahneman, in his masterful new book on psychology and economics called "Thinking, Fast and Slow."

An objective…

Continue

Added by Vincent Granville on January 4, 2012 at 3:00pm — No Comments

Missing Values Imputation with Bayesian Networks

A New Approach to Missing Values Processing with Bayesian Networks

Download the white paper here (39.5 MB).

With the abundance of “big data” in the field of analytics, and all the challenges today’s immense data volume is causing, it may not be particularly fashionable or pressing to discuss missing values. After all, who cares…

Continue

Added by Vincent Granville on January 4, 2012 at 2:39pm — No Comments

Featured Monthly Archives

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

On Data Science Central

© 2020   AnalyticBridge.com is a subsidiary and dedicated channel of Data Science Central LLC   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service