Subscribe to DSC Newsletter

All Blog Posts Tagged 'science' (17)

Book on Computer Programming

Data scientists use a range of tools in their work and some of these eventually require programming. This book, titled The Art and Craft of Computer Programming, is a guide to computer programming. It does not focus on a specific programming language, but instead contains the essential material from a first year Computer Science course. The book is available from Amazon.com.…

Continue

Added by Mark McIlroy on October 19, 2017 at 8:00pm — 1 Comment

Life is better when you're meeting new data people

One of my favorite things over the year was starting a personal blog. (You can find my website here if you are curious.) How did it happen? Well, I was reading an article and one quote in particular really struck me: "it's not what you know it's who you know"

That quote really resonates with me. Throughout my life I’ve learned a lot, and…

Continue

Added by Olga on November 2, 2016 at 9:03pm — No Comments

Data Science is Cooking!

Most people think data science is smart people doing very smart stuff. Well that’s not it. Data science is just another subject involving its own bit of subtle complexities that has to be handled with knowledge and an innovative approach. JUST LIKE COOKING.

data-science-is-cooking

Getting the Data

Cooking is art and science. So is Analytics. Both start from getting the right ingredients. No matter how many spices and cooking techniques you apply, the dish won’t…

Continue

Added by Vivek Kalyanarangan on November 1, 2016 at 10:00am — No Comments

Building an end-end search engine

"Information is the oil of the 21st century, and analytics is the combustion engine" Peter Sondergaard, SVP, Gartner Research

In analytics, we retrieve information from various data sources; it can be structured or unstructured. The biggest challenge here is to retrieve information from unstructured data mainly texts. Here machine learning comes into the picture to overcome this challenge. Different algorithms have been designed in different platforms…

Continue

Added by Vivek Kalyanarangan on September 9, 2016 at 8:30am — No Comments

Making data science accessible – HDFS

By Dan Kellett, Director of Data Science, Capital One UK

 

Disclaimer: This is my attempt to explain some of the ‘Big Data’ concepts using basic analogies. There are inevitably nuances my analogy misses.

 

What is HDFS?

When people talk about ‘Hadoop’ they are usually referring to either the efficient storing or processing of large amounts of data. MapReduce is a framework for efficient processing using a parallel, distributed algorithm…

Continue

Added by Dan Kellett on July 21, 2016 at 2:00am — No Comments

Ideas for improving already good models

You’re working on the MAIN MODEL. The one that leverages half the company’s assets, and on which your paycheck and that of many others depends. You’ve already run through a stepwise, forward, and backward search of the variables, their interactions, and possible curvatures. What are the most productive things to do next?

 

Here are a couple of ideas…

Continue

Added by David G. Young on April 27, 2016 at 8:07am — No Comments

Data Science with R

As R programming language becoming popular more and more among data science group, industries, researchers, companies embracing R, going forward I will be writing posts on learning Data science using R. The tutorial course will include topics on data types of R, handling data using R, probability theory, Machine Learning, Supervised – unSupervised learning, Data Visualization using R, etc. Before going further, let’s just see some stats and tidbits on data science and…

Continue

Added by suresh kumar Gorakala on December 29, 2015 at 9:30am — 1 Comment

Cross Industry Standard for Data Mining

Recently I have come across a term, CRISP-DM - a data mining standard. Though this process is not a new one but I felt every analyst should know about commonly used Industry wide process. In this post I will explain about different phases involved in creating a data mining solution. 



CRISP-DM, an acronym for Cross Industry Standard Process for Data Mining, is a data mining process model that includes commonly used approaches that data…

Continue

Added by suresh kumar Gorakala on October 22, 2015 at 10:59am — 2 Comments

Is opensource enough? The Need for an Open Architecture Analytics

The job of a data analyst nowadays has become very extensive, in its need to cover a number of different and ever-changing tasks.

A data analyst must query a variety of internal and external data sources, each with a different access protocol and format; integrate these data with results from REST and web services queried over the Internet, such as Google API or any social media channel; exchange information with business analysts, who, while lacking the deep mathematical background,…

Continue

Added by Rosaria Silipo on July 14, 2015 at 2:27am — No Comments

Random Forest in Python

Random Forest is a machine learning algorithm used for classification, regression, and feature selection. It's an ensemble technique, meaning it combines the output of one weaker technique in order to get a stronger result.

The weaker technique in this case is a decision tree. Decision trees work by splitting the and re-splitting the data by…

Continue

Added by Alex Woods on July 4, 2015 at 8:30am — No Comments

Data scientists are wasting their time

We all know that time is money, especially when you're paying a data scientist. But the New York Times reports that... 

"Data scientistsaccording to interviews and expert estimates, spend 50 percent to 80 percent of their time mired in [the] mundane labor of collecting and preparing unruly digital data, before it can be explored for useful nuggets."…

Continue

Added by Jennifer Methvin on June 25, 2015 at 4:30am — 4 Comments

Accessing Big Data with KNIME

Continue

Added by Rosaria Silipo on May 6, 2015 at 12:30am — No Comments

Data Scientist, The Magician?

Isn't it true? Isn't this what most of the business folks and CXO's expect from their data science

teams? Yes in fact, this is what we have been told in nearly all of conferences we attend. Thumping case studies, feverish pitch makes one believe in the story and you walk out with a true sense of achieving same…
Continue

Added by Vikas Kamra on December 1, 2014 at 4:30am — No Comments

Build basic recommendation engine using R

In our day to day life, we come across a large number of Recommendation engines like Facebook Recommendation Engine for Friends’ suggestions, and suggestions of similar Like Pages, Youtube recommendation engine suggesting videos similar to our previous searches/preferences. In today’s blog post I will explain how to build a basic recommender System.…

Continue

Added by suresh kumar Gorakala on June 5, 2014 at 10:55pm — No Comments

Read this tutorial before you use Proc Corr

All of us at some point in the process of examining…

Continue

Added by aatash shah on February 27, 2014 at 5:36am — No Comments

Warm-up exercise before data science.

Practicing Data science…

Continue

Added by Manish Bhoge on October 18, 2013 at 12:22pm — No Comments

Which are the data products one can curate for AIRLINE industry ?

Here is a crisp info graphic which communicates the top 5 data products which can be $ denting in the Airline industry

http://blog.fluturasolutions.com/2013/06/which-data-products-can-make-dent-in.html

1. Property reccomender

2. Word of mouth modeler

3, Funnel friction Spotter

4. Traveller churn scorer

5. Sentiment Analyzer

Added by derick on June 21, 2013 at 12:43pm — 1 Comment

Monthly Archives

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

On Data Science Central

© 2019   AnalyticBridge.com is a subsidiary and dedicated channel of Data Science Central LLC   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service