A Data Science Central Community
We discuss here a large class of big data problems where MapReduce can't be used - not in a straightforward way at least - and we propose a rather simple analytic, statistical solution.
MapReduce is a technique that splits big data sets into many smaller ones, process each small data set separately (but simultaneously) on…Continue
I am trying to determine the rate of cannibalization of product sales for A with product B. I am using ~ 2 years of daily sales data for product A and then ~8 months of data for product B. That is, product B launched 8 month ago. I am using the longer series for A to capture trend effects in the business (natural or industry rate of decline or growth). I also control for seasonality by using a weekday variable (= day of week 1-7) and a week number variable (1-52).
If I use the log-log…Continue
This survey is aimed at analytic practitioners. If you are looking for a career change, it will also help you identify the skills and expertise in high demand.
As a token of our appreciation, everyone who completes the survey and provides their contact details will be entered into our drawing for a chance to win an iPad…Continue
Added by Vincent Granville on January 31, 2013 at 12:00pm — No Comments
The Big Data Analytics revolution is underway. This revolution is an historic and game-changing expansion of the role that information plays in business, government and consumer realms. To harness the power of this data revolution, a paradigm shift is required. Organizations must be able to do more than query their Big Data stores; search is no longer enough. We announced the…Continue
Added by Radhika Subramanian on January 31, 2013 at 9:15am — No Comments
Also check the Additional Resources section at the bottom.
It is so good to see more and more people sharing what can be done via Analytics and who can do what. What I do not see, again, is the art of implementing Analytics. Analytics is not done by a simple act of software installation or model churning.
What I am working on includes: exciting people about their future state of works. A must-have responsibility of any Analytics include regular and routine reporting. However, what they really want is the vision of bringing inspiration to…Continue
Added by Jeffrey Ng on January 25, 2013 at 11:30pm — No Comments
Today’s need to leverage unprecedented amounts of available information has resulted in a flood of tools, services and models claiming to surface insights from Big Data. One model in particular, visualization, has received a lot of attention lately because of its abilities to organize and present information. However, visualization is actually one of the biggest barriers to insight because it places the burden of discovery on the user, and any tool that places the burden on the…Continue
Added by Radhika Subramanian on January 25, 2013 at 2:00pm — No Comments
Our skyscraper and smaller banner ads do not show up (although our leader board banner ad usually does) if you…Continue
Added by Vincent Granville on January 23, 2013 at 5:08pm — No Comments
Added by Simon Rees on January 23, 2013 at 6:27am — No Comments
A NoSQL database from MarkLogic provides the Royal Society of Chemistry (RSC) with the ability to unlock a treasure trove of assets. Now the RSC can publish three times as many journals and four times as many articles. It also gave the Society the ability to develop new educational applications to make chemistry accessible to a wider audience.
Modern approaches to information products replete with full text search have the power to…
Added by Tony Agresta on January 18, 2013 at 7:58am — No Comments
Your relationship with data and your attitude towards it will be one of the biggest predictors of the health of your business in 2013 and beyond. Why? Because the data revolution is upon us, and organizations are reaching the limit of possible improvements based on older methods. The transformational change occurring in organizations today is dependent upon adopting insight from data for visibility and profit improvement.
Some business sectors have a natural affinity for data.…Continue
Added by Radhika Subramanian on January 17, 2013 at 9:31am — No Comments
Facing these questions I have make a personal research, and realize a synthesis, which has help me to clarify some ideas.
The attached presentation does not intent to be exhaustive on the subject, but could perhaps bring you some useful insights:…Continue
Added by Michel Bruley on January 14, 2013 at 2:04am — No Comments
Added by Sandeep Raut on January 12, 2013 at 10:42pm — No Comments
Added by yanping on January 11, 2013 at 6:30pm — No Comments
Have you ever wondered about the logistics of what it takes to pull off transporting you and your bags from your home to your destination and back again? Have you ever imagined what it might be like to work for the airlines and be in charge of scheduling flights? Believe it or not, the computational complexity of airline travel planning is some of the most complicated in the world. Every traveler understands the criticality of time in the airline business. And if your business is…Continue
Added by Radhika Subramanian on January 10, 2013 at 8:07am — No Comments
This seminal article highlights the dangers of reckless applications and scaling of data science techniques that have worked well for small, medium-size and large data. We illustrate the problem with flaws in big data trading, and propose solutions. Also, we believe expert data scientists are more abundant (but very different) than what hiring companies claim: read our "related articles" section at the bottom for…Continue
With vast amounts of data about consumer usage, cost effective advertising, churn reduction and effective cross sell seem well suited to be addressed with big data technology in the telco space. Now that data streams can be captured in real time as data is pushed to communication service providers, this data can be connected to data pulled from other databases to create vivid usage patterns. When anonymized and profiled…
Added by Tony Agresta on January 4, 2013 at 10:21am — No Comments
Taming the analytics hydra with an Analytics Center of Excellence (ACE)
by Scott Mongeau
Managing analytics, organizationally, has the aspect of a hydra, the ancient Greek mythological dragon which Hercules slew. For each head Hercules dispatched,…Continue
Will analytics help amusement park dinosaurs survive the 21st century? My recent experience at Disneyland in California makes me think that a lot of simple things can be done to improve revenue and user experience. Here are a few starting points: