Subscribe to DSC Newsletter

All Blog Posts (2,055)

Great Friday Reading

Here is our list of newly featured articles and resources:

Continue

Added by Vincent Granville on August 18, 2017 at 8:50pm — No Comments

Curious Mathematical Object: Hyperlogarithms

Logarithms turn a product of numbers into a sum of numbers: log(xy) = log(x) + log(y). Hyperlogarithms generalize the concept as follows: Hlog(XY) = Hlog(X) + Hlog(y), where X and Y are any kind of objects, and the product and sum are replaced by operators in some arbitrary space. …

Continue

Added by Vincent Granville on August 16, 2017 at 12:00pm — 1 Comment

Nice Generalization of the K-NN Clustering Algorithm -- Also Useful for Data Reduction

I describe here an interesting and intuitive clustering algorithm (that can be used for data reduction as well) offering several advantages, over traditional classifiers:

  • More robust against outliers and erroneous data
  • Executing much faster
  • Generalizing well known algorithms

You don't need to know K-NN to understand this article -- but click here if you want to…

Continue

Added by Vincent Granville on August 15, 2017 at 12:00pm — No Comments

Great Saturday Reading

Here is our selection of featured articles and resources posted over the last few days:

Continue

Added by Vincent Granville on August 12, 2017 at 5:00pm — No Comments

Type I and Type II Errors in One Picture

This picture speaks more than words. It explains the concept or false positive and false negative, that is, what is referred to by statisticians as Type I and Type II errors.

Other great pictures summarizing data science and statistical concepts, can be found…

Continue

Added by Vincent Granville on August 10, 2017 at 5:17pm — No Comments

Fighting eCommerce fraud with graph technology



ECommerce fraud is growing quickly, creating new challenges in terms of prevention and detection. As merchants gather more and more information about customers and their behaviors, the key element in the fight against fraud is now to draw on the connections within the data collected to uncover fraudulent behaviors. In this post we explain why and how graph technologies are crucial in the detection of eCommerce fraud.…

Continue

Added by Elise Devaux on August 9, 2017 at 9:30am — No Comments

Great Sunday Reading

Here is our selection of new featured articles for today:

Continue

Added by Vincent Granville on August 6, 2017 at 11:33am — No Comments

Data Science Simplified: Principles and Process

In 2006, Clive Humbly, UK Mathematician, and architect of Tesco’s Clubcard coined the phrase “Data is the new oil. He said the following:

Data is the new oil. It’s valuable, but if unrefined it cannot be used. It has to be…

Continue

Added by Vincent Granville on August 3, 2017 at 4:30pm — No Comments

Introducing User Behavioral Analysis in the Risk Process

Many years ago when I was entering the intelligence community, I attended a class in Virginia where the instructor opened the session with a test that I will never forget and that I have applied to almost every analytic task in my career. At the beginning of the class we were shown a ten-minute video of grand central station at rush hour with tens of thousands of people and were asked if we could find a single pickpocket in the crowd by the end of the video.  At the end of ten minutes no…
Continue

Added by Andrew Marane on July 31, 2017 at 11:30am — No Comments

Capturing Low-Probability, High-Impact Events 'Black Swans' in Economic and Financial Models

Capturing Low-Probability, High-Impact Events 'Black Swans' in Economic and Financial Models

Jamilu Auwalu Adamu , Lecturer, Nigeria

Incorporation of Fat - Tailed Effects of the Underlying Assets Probability Distribution using Advanced Stressed Methods.



Capturing the effects of Low-Probability, High-Impact "Black Swans" in the existing stochastic and deterministic models is tremendously…

Continue

Added by Jamilu Auwalu Adamu on July 31, 2017 at 8:30am — No Comments

Great Saturday Reading

Here is our list of featured articles and resources recently published on DSC:

Continue

Added by Vincent Granville on July 22, 2017 at 12:41pm — No Comments

Open sourcing 'spot the difference'

Capital One UK’s Data Science team has been focused on move from proprietary (paid-for) software to open source for some time now.

There are several key benefits to making this change. Open source software is prevalent in academia which makes it much easier for our new starters to hit the ground running, building models and analysing data on day one with the company (the switch has also been a terrific development opportunity for my team to learn new skills). Our team now has greater…

Continue

Added by Dan Kellett on July 21, 2017 at 1:30am — No Comments

Data Scientists Automated and Unemployed by 2025 - Update

Summary:  A year ago we wrote about the emergence of fully automated predictive analytic platforms including some with true One-Click Data-In Model-Out capability.  We revisited the five contenders from last year with one new addition and found the automation movement continues to move forward.  We also observed some players from last year have now gone in different directions.  …

Continue

Added by Vincent Granville on July 18, 2017 at 11:37am — No Comments

Great Saturday Reading

Here is our list of featured articles and resources added today:

Resources:

Continue

Added by Vincent Granville on July 15, 2017 at 1:40pm — No Comments

How to Detect if Numbers are Random or Not

In this article, you will learn some modern techniques to detect whether a sequence appears as random or not, whether it satisfies the central limit theorem (CLT) or not -- and what the limiting distribution is if CLT does not apply -- as well as some tricks to detect abnormalities. Detecting lack of randomness is also referred to as signal versus noise detection, or pattern recognition.

It leads to the exploration of time series with massive, large-scale (long term) auto-correlation…

Continue

Added by Vincent Granville on July 10, 2017 at 12:00am — 4 Comments

Great Sunday Reading

Here is our selection of articles and resources posted in the last few days.

Continue

Added by Vincent Granville on July 9, 2017 at 10:07am — No Comments

Text Clustering : Get quick insights from Unstructured Data

In this two-part series, we will explore text clustering and how to get insights from unstructured data. It will be quite powerful and industrial strength. The first part will focus on the motivation. The second part will be about implementation.

This post is the first part of the two-part series on how to get insights from unstructured data using text clustering. We will build this in a very modular way so that it can be applied to any dataset. Moreover, we will also focus…

Continue

Added by Vivek Kalyanarangan on July 5, 2017 at 9:30pm — No Comments

Understanding the Changing Position Roles in Data Science

Is everyone a ‘data scientist’? What about ‘data engineers’ and the junior versus senior, or skill level distinctions? We do seem to need some agreement about titling. Data Scientists is still the prestige title but there are some folks lobbying to take that title away. Click here to read more. 

New blog post: 7 Great Articles About TensorFlow

  • Google + open-source = TensorFlow 
  • Deep Learning with TensorFlow in…
Continue

Added by Vincent Granville on July 5, 2017 at 6:23pm — No Comments

Great Friday Reading

Here is our selection of most recent featured articles and resources:

Continue

Added by Vincent Granville on June 30, 2017 at 1:51pm — No Comments

Data Science and Machine Learning Without Mathematics

There is a set of techniques covering all aspects of machine learning (the statistical engine behind data science) that does not use any mathematics or statistical theory beyond high school level. So when you hear that some serious mathematical knowledge is required to become a data scientist, this should be taken with a grain of salt.…

Continue

Added by Vincent Granville on June 26, 2017 at 4:54pm — No Comments

Monthly Archives

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

Follow Us

On Data Science Central

On DataViz

On Hadoop

© 2017   AnalyticBridge.com is a subsidiary and dedicated channel of Data Science Central LLC   Powered by

Badges  |  Report an Issue  |  Terms of Service