A Data Science Central Community
We study the properties of a typical chaotic system to derive general insights that apply to a large class of unusual statistical distributions. The purpose is to create a unified theory of these systems. These systems can be deterministic or random, yet due to their gentle chaotic nature, they exhibit the same behavior in both cases. They lead to new models with numerous applications in Fintech, cryptography, simulation and benchmarking tests of statistical hypotheses. They are also…Continue
Added by Vincent Granville on November 29, 2019 at 2:30am — No Comments
Summary: 99% of our application of NLP has to do with chatbots or translation. This is a very interesting story about expanding the bounds of NLP and feature creation to predict bestselling novels. The authors created over 20,000 NLP features, about 2,700 of which proved to be predictive with a 90% accuracy rate in predicting NYT bestsellers.…Continue
Added by Vincent Granville on November 28, 2019 at 10:00pm — No Comments
In this article, we explore a new type of generalized univariate normal distributions that satisfies useful statistical properties, with interesting applications. This new class of distributions is defined by its characteristic function, and applications are discussed in the last section. These distributions are semi-stable (we define what this means below). In short it is a much wider class than the stable distributions (the only stable distribution with a finite variance…Continue
Added by Vincent Granville on November 27, 2019 at 11:14pm — No Comments
Machine learning is a hot topic in research and industry, with new methodologies developed all the time. The speed and complexity of the field makes keeping up with new techniques difficult even for experts — and potentially overwhelming for beginners.
To demystify machine learning and to offer a learning path for those who are new to the core…Continue
Added by Vincent Granville on November 27, 2019 at 10:58am — No Comments
This article is by Jorge Castañón, Ph.D., Senior Data Scientist at the IBM Machine Learning Hub.
Data visualization plays two key roles:
1. Communicating results clearly to a general audience.
Added by Vincent Granville on November 12, 2019 at 10:00am — No Comments
Analyzing the quality of your software is crucial to any business. The process appears towards the end of your software development lifecycle but indeed decides the fate of it. In other words, quality analysis demonstrates a process in which the actual output of the software is tested with its expected output. There are a variety of test inputs that are used in the process of quality analysis so that the product sheds light on the actual truth of where it…Continue
Added by Divyesh Aegis on November 7, 2019 at 11:00pm — No Comments