Where: Sentiment Analysis Symposium,
New York, NY
Date: March 5, 2014
Time: 1:30 – 5:00 p.m. …
A Data Science Central Community
In this post, I've tried to capture some of the common aspects of working in the analytics industry. While we occasionally hear about India growing fast into this space, there are a lot of things happening in India that might transform this field further. While some of these aspects are specific to what I've observed in India, a lot of them are generic.
As in previous posts, I try to classify these aspects under different heads:
Added by Amogh Borkar on August 13, 2014 at 1:32am — No Comments
While having a many-to-many relationship may sound steamy to some of you, when it comes to the data in your tables it can get complicated. That is, unless you understand the different scenarios, their resolutions, and how to build a good relationship with your data. Just as a general overview, a relationship specifies the logic used to combine data from one or more tables. You create relationships by connecting fields between two…Continue
Added by Elana Roth on July 10, 2014 at 11:00am — No Comments
We’ve just wrapped up another eventful season of the annual phenomenon that is the Indian Premier League. Records were broken, trophies and caps won, and careers and fortunes made in a span of 6 high-octane weeks. Fans, being the lifeblood of any successful sporting event, were given extra-special attention this time around. The millions of tweets that social-media savvy fans across India tweeted during the tournament formed a veritable growing corpus of social content that was used through…Continue
Added by Farid Jalal on June 27, 2014 at 12:42am — No Comments
Words Like Data Lake or Big data are taking on the world these days. Let’s try to understand what they actually means
Data Lake - A data lake is a storage repository that holds a vast amount of raw data in its native format. A data lake uses a flat architecture to store data. Each data element in a lake is assigned a unique identifier and tagged with a set of extended metadata tags. When a business question arises, the…Continue
Added by Vishal Sharma on June 16, 2014 at 6:00am — No Comments
In our day to day life, we come across a large number of Recommendation engines like Facebook Recommendation Engine for Friends’ suggestions, and suggestions of similar Like Pages, Youtube recommendation engine suggesting videos similar to our previous searches/preferences. In today’s blog post I will explain how to build a basic recommender System.…
Added by suresh kumar Gorakala on June 5, 2014 at 10:55pm — No Comments
What separate a Big data from a huge data? I have seen the abuse of Big Data being adopted by the database and BI vendors. It lacks the spirit. Merely calculating faster or storing more should not qualify for a Big Data thinking. It is purely technological breakthrough - just like having a faster CPU. Will you call an intel CPU i7 a change of era from i5? Here are my observations and principles to be Big Data:
1. Collect the impossible
With the emergence of e-government, more…Continue
All studies show that, with globalization, large companies are faced with an increasing number of decisions and that these decisions are more complex and involve greater quantities of data. In today’s world, clearly one must have access to strategic and operational intelligence throughout the entire company and be able to take actions that are appropriate for the dynamics of the business…
Added by Michel Bruley on February 11, 2014 at 1:32am — No Comments
Both R & Python should be measured based on their effectiveness in advanced analytics & data science. Initially, as a new comer in data science field we spend good amount of time to understand the pros and cons of these two. I too carried out this study solely for “self” to decide which tool should i pick to get in depth of data science. Eventually, i have started realizing that both (R & Python) has its space of mastery along with their broad support to data science. Here some…Continue
Added by Manish Bhoge on February 7, 2014 at 11:22pm — No Comments
Where: Sentiment Analysis Symposium,
New York, NY
Date: March 5, 2014
Time: 1:30 – 5:00 p.m. …
Added by Steven Ramirez on February 3, 2014 at 2:30pm — No Comments
For a company, having "a presence" in social media is worthless unless it allows it to achieve some objectives. However, beyond the control of the company, social media talk about the company, under these conditions is it reasonable to ignore this space frequented by customers, prospects, partners and competitors?
Social media influences reputation, image, and…
Added by Michel Bruley on January 7, 2014 at 2:07am — No Comments
The term "Data Science" has been evolving not only as a niche skill but as a niche process as well. It is interesting to study "how" the Big data analytics/Data Science/Analytics can be efficiently implemented into the enterprise. So, along with my typical study of analytics viz. Big data analytics I have been also exploring the methodologies to bring the term "Data Science" into mainstream of existing enterprise data analysis, which we conventionally know as "Datawarehouse & BI". This…Continue
It is a known fact that development of sports activities is not a top priority in the national budget of most of the developing countries. It can also be established that sports activities are not an active part of the eductional system of most developing countries. I’ll not be wrong if I say that the investment into sports in developed countries is much higher than in the developing countries. Given this backdrop, a natural question that comes to mind is: “Will developed countries reap all…Continue
Added by Ashish Soni on November 13, 2013 at 3:38am — No Comments
The latest buzz words, ‘data analytics’, has swept most of the Industries, as far as I have researched, and sports industry is no exception. Just like other industries, data analytics has found wide variety of applications within the sports industry. This includes players’ performance analysis, opposition strength/weakness analysis, in-game strategy analysis and a lot more. In this article, I have tried to cover some…Continue
Added by Ashish Soni on November 8, 2013 at 10:02pm — No Comments
In very simple words, the process of predicting the probability of an event using mathematical models is defined as predictive modelling. In different fields there has been a wide-spread application of predictive modelling. This is mainly for the purpose of decision making. Recently, the technological advancements have made it possible for predictive modelling to expands its reach within the healthcare sector.…Continue
With the development of web technologies, an increasing amount of opinions are published online every day!!
People rely on the reviews more than before to help determine the quality of product in which they are interested!!
We use the chatter data from Twitter.com to forecast box-office revenues for movies!!
The data that is being generated on the social networks as a result of human activities is commonly referred as Social Data. This is one of the largest streams of…Continue
Added by Ashish Soni on November 2, 2013 at 2:07am — No Comments
Added by Patrick C. Walker, MPA on November 1, 2013 at 10:17am — No Comments
Today, analytics-driven best practices dictate that, content collecting portals and other data-centric projects undergo the process of data cleaning and enrichment because analytic-oriented failures (i.e. descriptive, predictive, and prescriptive analytics models or reports) are often linked to the quality of data. To halt the perpetuation of bad data across the enterprise, initial “clean and enrich” initiatives have been effective including gathering missing or incomplete data, identifying…Continue
Added by Patrick C. Walker, MPA on October 27, 2013 at 8:00pm — No Comments
I have a 5-20K USD budget to get a tool (w/wo dedicated hardware), including training for 2 people, to prepare (clean, merge, etc...) and summarize (samples, grouping/aggregations, etc) large data sets (over 5GBs, over 4 million rows).
I need an efficient solution (fast data processing and fast to learn), and definitely easy to use. I would have done it with excel if excel could handle such large data files.
After preparing and summarizing the data, I'll use excel pivot tables…Continue
Currently, high schools have some of the biggest challenges determining the individual paths of students that enter the school yard each day. In fact, both institutions and industries would like to know, for example, which student will graduate from a particular course program and which student will need assistance in order to graduate on time. Which graduates are most receptive to what kind of high education institution? Which non-graduates are most receptive to what kind of intervention to…Continue
Added by Patrick C. Walker, MPA on October 11, 2013 at 9:30am — No Comments
Historically, Leadership and Organizational assessments were recommended or promoted as means to determine if enterprises were effective and efficient. Often determined when all elements of the organization were aligned properly, resulting in superior performance. Today, some professionals advise their clients in ways to improve performance by conducting alleged unique assessment models similarly to the traditional form (e.g. gut-feeling approaches). The purpose of these assessment models…Continue
Added by Patrick C. Walker, MPA on October 3, 2013 at 11:00am — No Comments