Subscribe to DSC Newsletter

Vincent Granville's Blog (773)

Data Mining Combined With Predictive Modeling Equal 3D Data Visualization

The interaction and cooperation between computers and the human brain is at a crossroad. There are some who believe that decision support systems should be completely automated. There are others who believe that there are many areas of business, technology, and science that have not been discovered yet, and, hence, only part of a decision support system can be automated. I subscribe to the latter proposition.



Computer science is, at its core, an attempt to replicate the processing,… Continue

Added by Vincent Granville on July 10, 2008 at 11:30pm — No Comments

Scorecards: Logistic, Ridge and Logic Regression

In the context of credit scoring, one tries to develop a predictive model using a regression formula such as Y = Σ wi Ri, where Y is the logarithm of odds ratio (fraud vs. non fraud). In a different but related framework, we are dealing with a logistic regression where Y is binary, e.g. Y = 1 means fraudulent transaction, Y = 0…

Continue

Added by Vincent Granville on July 8, 2008 at 5:00pm — No Comments

Netflix $1,000,000 Contest

The Netflix Prize seeks to substantially improve the accuracy of predictions about how much someone is going to love a movie based on their movie preferences. Improve it enough and you win one (or more) Prizes. Winning the Netflix Prize improves our ability to connect people to the movies they love.

www.netflixprize.com

Added by Vincent Granville on June 28, 2008 at 10:09am — No Comments

AT&T Invents Programming Language for Mass Surveillance

From the company that brought you the C programming language comes Hancock, a C variant developed by AT&T researchers to mine gigabytes of the company's telephone and internet records for surveillance purposes.



An AT&T research paper published in 2001 and unearthed today by Andrew Appel at Freedom to Tinker shows how the phone company uses Hancock-coded software to crunch through tens of millions of long distance phone records a night to draw up what AT&T calls… Continue

Added by Vincent Granville on June 26, 2008 at 4:07am — No Comments

New Journal: Statistical Analysis and Data Mining

Aims and Scope





Statistical Analysis and Data Mining addresses the broad area of data analysis, including data mining algorithms, statistical approaches, and practical applications. Topics include problems involving massive and complex datasets, solutions utilizing innovative data mining algorithms and/or novel statistical approaches, and the objective evaluation of analyses and solutions. Of special interest are articles that describe analytical techniques, and discuss their… Continue

Added by Vincent Granville on May 26, 2008 at 4:00pm — No Comments

Terrorism Study Drops a Bombshell on Boise

By Lyndsey Layton and Ashley Surdin

Washington Post Staff Writers

Saturday, April 5, 2008; Page A02



Quick: Name the Western U.S. city most vulnerable to a terrorist attack. Is it Los Angeles, with its crowded roads that make quick escape impossible? San Francisco and its iconic bridge? Or Seattle with its Space Needle and busy port?



Try Boise, Idaho, with its, um, potatoes.



A new study funded largely by the Department of Homeland Security ranked 132… Continue

Added by Vincent Granville on May 26, 2008 at 3:30pm — No Comments

Social Networks Not Generating Enough Revenue

While social networking is a red-hot topic at Revenue and around the online marketing space, eMarketer has revised its U.S. social network ad spend projections downward. The market researcher estimates that advertisers will spend $1.4 billion to place ads on online social networks in 2008, down from the previous projection of $1.6 billion.



U.S. online social network ad spend is now projected to reach $2.6 billion in 2012. In its last projection, made in December 2007, eMarketer… Continue

Added by Vincent Granville on May 15, 2008 at 9:00am — No Comments

Logistic Regression

130 keywords related to logistic regression

www.datashaping.com/logistic_regression.shtml

Added by Vincent Granville on May 5, 2008 at 11:30pm — No Comments

Interview with Edmund Freeman, V.P. of Direct Marketing Modeling for Washington Mutual

Bio



I was born in Michigan in 1961, went to the University of Michigan for my bachelors, and then post-graduate studies in pure mathematics at Wisconsin and Illinois. I finally got tired of working on topics that I couldn't talk to even my office mates about, so I got out with a MS in statistics. My first real job was fraud analysis for Medicare. I spent most of the 90's working for a bleeding-edge data mining firm called NeoVista and later Accrue. This century I've worked in health… Continue

Added by Vincent Granville on May 1, 2008 at 12:00am — No Comments

Interview with Ajay Ohri, Data Mining Consultant from India

Short Biography

Ajay Ohri has been working in the field of analytics since 2004 , when it was a still nascent emerging Industries in India. He has worked with the top two Indian outsourcers listed on NYSE,and with Citigroup on cross sell analytics where he helped sell an extra 50000 credit cards by cross sell analytics .He was one of the very first independent data mining consultants in India working on analytics products and domestic Indian market…
Continue

Added by Vincent Granville on April 19, 2008 at 9:15pm — No Comments

Top 10 challenging problems in data mining

  • Developing a unifying theory of data mining
  • Scaling up for high dimensional data and high speed data streams
  • Mining sequence data and time series data
  • Mining complex knowledge from complex data
  • Data mining in a network setting
  • Distributed data mining and mining multi-agent data
  • Data mining for biological and environmental problems
  • Data Mining process-related problems
  • Security, privacy and data…
Continue

Added by Vincent Granville on April 19, 2008 at 12:30pm — No Comments

Applying the Markov copulae approach to modeling credit derivatives

In the latest issue of the Journal of Credit Risk, Bielecki et al. propose a dynamic bottom-up approach by using Markov copula for pricing and hedging credit index derivatives and ratings-triggered corporate step-up bonds.



The Markov copula procedure works efficiently and is a useful step towards developing a copula-like formalism for multivariate processes, which can be applied to the modeling of credit derivatives. Read the full article and receive the current edition of the… Continue

Added by Vincent Granville on April 15, 2008 at 2:55pm — No Comments

Increasing customer loyalty using data analytics

Loyalty Marketing has become a key strategy for most companies in today's competitive marketplace. The practice is based on a very simple premise - as you develop stronger relationships with your best customers, they will stay with you longer; the longer they stay, the more profitable they become.



It costs less to retain a customer than to acquire a new one. To retain a customer involves many factors - personal relationships, product quality, customer service, price, and other brand… Continue

Added by Vincent Granville on April 6, 2008 at 12:15pm — 3 Comments

Predictive Model Markup Language - PMML

The Predictive Model Markup Language (PMML) is a mark up language for statistical and data mining models. Some sort of HTML language to handle statisical tasks such as regression, decision trees etc.



PMML is an XML-based language which provides a way for applications to define statistical and data mining models and to share models between PMML compliant applications.



PMML provides applications a vendor-independent method of defining models so that proprietary issues and… Continue

Added by Vincent Granville on April 5, 2008 at 1:30am — No Comments

Nonparametric regression: the LOESS procedure

PROC LOESS implements a nonparametric method for estimating local regression surfaces pioneered by Cleveland (1979); also refer to Cleveland et al. (1988) and Cleveland and Grosse (1991). This method is commonly referred to as loess, which is short for local regression.



PROC LOESS allows greater flexibility than traditional modeling tools because you can use it for situations in which you do not know a suitable parametric form of the regression surface. Furthermore, PROC LOESS is… Continue

Added by Vincent Granville on March 31, 2008 at 11:00pm — No Comments

Upcoming Data Mining Conferences - Submission Deadlines

ISIPS: Interdisciplinary Studies in Information Privacy and Security, due Mar 30 (extended)

International Conference on Advanced Intelligence (ICAI-08), due Apr 1

The Fifth Conference On Email and Anti-Spam (CEAS 2008), due Apr 3

Intelligent Techniques for Web Personalization & Recommender Systems, due Apr 7

EMAIL-2008: AAAI 2008 Workshop On Enhanced Messaging, due Apr 7

Inference and Estimation in Probabilistic Time-Series Models, due Apr 11

ICDM '08 workshop… Continue

Added by Vincent Granville on March 25, 2008 at 7:30am — No Comments

New Ideas to Forecast Stock Market Trends

My interest has mostly been in trading QQQQ and other major indexes, hoping to build a small portfolio of contrarian (negatively correlated) indexes. I am interested in two types of strategies:



1. A strategy where some cash is dormant on a trading account for most of the time, yielding a return of less than 4% a year when in "dormant mode". Once in a while (it could be every two or three years, sometimes every six months), when large movements occur on the stock market, I step in as… Continue

Added by Vincent Granville on March 22, 2008 at 11:36am — 2 Comments

Salary survey for Data Warehouse and Business Intelligence Professionals

The purpose of this report is to gain a better sense of the people and teams who built and maintained business intelligence (BI) and data warehousing (DW) solutions during the 2007 calendar year. This report uses the term “BI” to refer to both business intelligence and data warehousing initiatives, and the term “BI professionals” to the individuals who deliver these initiatives. Specifically, the report looks at…

Continue

Added by Vincent Granville on March 21, 2008 at 2:00am — No Comments

What is Six Sigma?

The concepts surrounding the drive to Six Sigma quality are essentially those of statistics and probability. In simple language, these concepts boil down to, “How confident can I be that what I planned to happen actually will happen?” Basically, the concept of Six Sigma deals with measuring and improving how close we come to delivering on what we planned to do.



Anything we do varies, even if only slightly, from the plan. Since no result can exactly match our intention, we usually… Continue

Added by Vincent Granville on March 15, 2008 at 9:09am — No Comments

Analyticcircle, Analytictunnel and Other Analytic Domains

The two domains AnalyticCircle.com and AnalyticTunnel.com both currently point to AnalyticBridge. Feel free to use them when you invite contacts to join our network. Other related domains include



Continue

Added by Vincent Granville on March 8, 2008 at 10:00pm — No Comments

Monthly Archives

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

On Data Science Central

© 2019   AnalyticBridge.com is a subsidiary and dedicated channel of Data Science Central LLC   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service