Subscribe to DSC Newsletter

Featured Blog Posts (1,513)

2020 Enterprise Analytics Trends: Deep Learning Delivers a Competitive Advantage

By 2022, Gartner predicts that 90% of corporate strategies will explicitly mention information as a critical enterprise asset and analytics as an essential competency.



“Increasingly, leading and thriving organizations in every segment are wielding data and analytics as a competitive weapon, operational accelerant, and innovation catalyst,” notes analysts in …

Continue

Added by Tricia Morris on February 19, 2020 at 8:26am — No Comments

New Books in AI, Machine Learning, and Data Science

We are in the process of writing and adding new material (compact eBooks) exclusively available to our members, and written in simple English, by world leading experts in AI, data science, and machine learning. In the upcoming months, the following will be added:

  • The Machine Learning Coding Book
  • Off-the-beaten-path Statistics and Machine Learning Techniques 
  • Encyclopedia of Statistical Science
  • Original Math, Stat and Probability Problems - with…
Continue

Added by Vincent Granville on December 1, 2018 at 6:26pm — No Comments

Which data protection techniques do you need to guarantee privacy?

This story was initially published here



The economics, legal, and corporate implications of data privacy are now too strong to be ignored. In the last decades,…

Continue

Added by Elise Devaux on September 30, 2020 at 10:30am — No Comments

Data-driven innovation in healthcare: synthetical clinical data

This articles discusses some of the data challenges that the healthcare industry faces. It also revisits how Statice's collaboration with the leading health organization Roche to test the use of synthetic medical data for clinical research and what opportunities we see from this.

The data challenges in the healthcare industry 

Maybe more than for other industries, research and innovation…

Continue

Added by Elise Devaux on September 14, 2020 at 4:12am — No Comments

Introduction to privacy-preserving synthetic data

This blog takes a closer look at the concept of privacy-preserving synthetic data. It answers the question “what is synthetic data” and looks at the origin of synthetic data in the context of data privacy. It also presents one way of generating privacy-preserving synthetic data and its benefits for organizations.…

Continue

Added by Elise Devaux on July 2, 2020 at 11:30am — No Comments

How “anonymous” is anonymous data?

This post discusses what actually makes data anonymous, share about the misconception we have of it and describe the problems it raises.

In the beginning, there was data…

Continue

Added by Elise Devaux on May 23, 2020 at 1:00pm — No Comments

Use the Data Insights Iceberg to Manage Stakeholder Expectations

One of the main challenges in data science projects is managing stakeholder expectations. Often those in the business will have little idea of the complexity and timescales of seemingly simple tasks.

Sourcing Data

Consider sourcing data. In some organisations, with a non-collaborative culture, something as simple as getting a file of data from IT can take weeks. Add on time to check the data, spend time with someone to explain it, handle revisions and…

Continue

Added by Andrew Watson on May 1, 2020 at 7:00am — No Comments

Python for Automating Your Quality Analysis

Analyzing the quality of your software is crucial to any business. The process appears towards the end of your software development lifecycle but indeed decides the fate of it. In other words, quality analysis demonstrates a process in which the actual output of the software is tested with its expected output. There are a variety of test inputs that are used in the process of quality analysis so that the product sheds light on the actual truth of where it…

Continue

Added by Divyesh Aegis on November 7, 2019 at 11:00pm — No Comments

40+ Modern Tutorials Covering All Aspects of Machine Learning

This list of lists contains books, notebooks, presentations, cheat sheets, and tutorials covering all aspects of data science, machine learning, deep learning, statistics, math, and more, with most documents featuring Python or R code and numerous illustrations or case studies. All this material is available for free, and consists of content mostly created in 2019 and 2018, by various top experts in their respective fields. A few of these documents are available on LinkedIn: see last…

Continue

Added by Vincent Granville on October 13, 2019 at 11:00am — No Comments

Python as a tool benefiting data scientists in many ways

Being extremely versatile general purpose, professional programming language, Python offers plenty of applications. Python language is user-friendly and simple to grasp and this made it popular throughout the world. Python plays a critical role for data scientists to find out lucrative job opportunities. 

Today, Python has become the most in-demand programming language in the data science world. Python offers an extensive range…

Continue

Added by Divyesh Aegis on September 5, 2019 at 12:00am — No Comments

Different Ways to Incorporate Data in Business Strategy for Security

In the data-driven enterprise system, Spark has become a popular name that is easy to use, offer speed and versatility. The data can be understood at fast speed allowing one to make faster decisions. The Big Data has a huge benefit with the faster data processing of Spark. This clustering of large datasets works with a framework in open source that helps in analyzing. The codes are done in the Scala that has made it possible and easier for data processing that gives a certain boost to the…

Continue

Added by Divyesh Aegis on August 13, 2019 at 12:51am — No Comments

The Power of Machine Learning Models

Properly implemented Machine Learning (ML) models can have a positive effect on organizational efficiency. It is first necessary to understand how these models are created, how they function, and how they are put into production.

The Definition of a Machine Learning Model

When a computer is presented with questions within a particular domain, a machine learning model will run an algorithm that will enable it to resolve those questions. These algorithms are not…

Continue

Added by Arash Aghlara on August 7, 2019 at 3:30am — 1 Comment

Is Python Completely Object Oriented?

Python was introduced in 1991 by Guido Van Rossum as a high level, general purpose language. Even today, it supports multiple programming paradigms including procedural, object oriented and functional. Soon, it became one of the most popular languages in the industry, and in fact is the very language that influence Ruby and Swift. Even TIOBE Index reports mentions python as the third most popular…

Continue

Added by Divyesh Aegis on July 16, 2019 at 12:55am — No Comments

Questions To Answer And Factors To Consider For Web Analytics

It will be unwise to expect you will generate lot of sales if you have significant amount of web traffic. It alone cannot be of much help in this matter. You will need to track the website metrics properly in order to take necessary measure to convert the traffic into your business prospects. You will need to analyze your website from time to time to ensure that it is not only accessible to the users but also provides all necessary guidance to show them the right way to make a…

Continue

Added by Jenny Richards on June 6, 2019 at 1:30am — No Comments

7 Simple Tricks to Handle Complex Machine Learning Issues

We propose simple solutions to important problems that all data scientists face almost every day. In short, a toolbox for the handyman, useful to busy professionals in any field.

1. Eliminating sample size effectsMany statistics, such as correlations or R-squared, depend on the sample size, making it difficult to…

Continue

Added by Vincent Granville on June 4, 2019 at 12:00pm — No Comments

Re-sampling: Amazing Results and Applications

This crash course features a new fundamental statistics theorem -- even more important than the central limit theorem -- and a new set of statistical rules and recipes. We discuss concepts related to determining the optimum sample size, the optimum k in k-fold cross-validation, bootstrapping, new re-sampling techniques, simulations, tests of hypotheses, confidence intervals, and statistical inference using a unified, robust, simple…

Continue

Added by Vincent Granville on May 4, 2019 at 12:30pm — No Comments

The graph visualization landscape 2019

Graph are meant to be seen



The third layer of graph technology that we discuss in this article is the front-end layer, the graph visualization one. The visualization of information has been the support of many types of analysis, including 
Social Network Analysis. For decades, visual representations have helped researchers,…

Continue

Added by Elise Devaux on April 9, 2019 at 4:00am — No Comments

The importance of Alternative Data in Credit Risk Management

The emergence of alternative data as a key enabler in expanding credit delivery and financial inclusion is unmistakable.

The saying that the only thing that is constant is change, is attributed to Heraclitus, the Greek Philosopher. This is so very relevant today in the way lenders use technology and scoring solutions to understand the credit worthiness of applicants. Credit Risk Management has come a long way from the days when banks used just one credit score cut off to…

Continue

Added by Naagesh Padmanaban on March 25, 2019 at 11:15pm — No Comments

Fascinating Developments in the Theory of Randomness

I present here some innovative results from my most recent research on stochastic processes. chaos modeling, and dynamical systems, with applications to Fintech, cryptography, number theory, and random number generators. While covering advanced topics, this article is accessible to professionals with limited knowledge in statistical or mathematical theory. It introduces new material not covered in my recent book (available …

Continue

Added by Vincent Granville on March 21, 2019 at 7:30am — No Comments

The graph analytics landscape 2019

Read part 1 - The graph database landscape

The graph analytics landscape 2019

Graph analytics frameworks consist of a set of tools and methods developed to extract knowledge…

Continue

Added by Elise Devaux on February 27, 2019 at 5:00am — No Comments

Featured Monthly Archives

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

On Data Science Central

© 2020   TechTarget, Inc.   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service