A Data Science Central Community
The big data blast has given rise to a host of information technology software and tools…Continue
Added by marry tho on June 17, 2016 at 6:00am — No Comments
The extended annotated version of the "Big Data A to Z Glossary of my Favorite Data Science Things" is now live at: http://bit.ly/1g5NcBt
However, the original…Continue
Added by Kirk Borne on March 20, 2014 at 3:00pm — No Comments
Intellipaat will Start a new Hadoop Developer Batch From 29th june 2013. Interested Candidates Drop an Email at sales(@)intellipaat(dot)com.
sales Team Intellipaat
In a recent post of mine I had mentioned about the partnership between MapR and Canonical towards an initiative to make Hadoop available with Ubuntu natively through Ubuntu Partner Archive. Since, the package has been released now, I thought of showing how to get it done. Trust me it's really cool to…Continue
Added by Mohammad Tariq Iqbal on April 30, 2013 at 7:17pm — No Comments
8 years ago not even Doug Cutting would have thought that the tool which he's naming after the name of his kid's soft toy would so soon become a rage and change the way people and organizations look at their data. Today Hadoop and BigData have almost become synonyms to each other. But Hadoop is not just Hadoop now. Over the time it has evolved into one big…Continue
Added by Mohammad Tariq Iqbal on April 25, 2013 at 6:55pm — No Comments
Most of Datawarehouse folks are very much accustomed with the term "Capacity Planning", Read Inmon. This is widely used process for DBA's and Datawarehouse Architects. In an typical project of data management and warehouse wide variety of audience is involved to drive the capacity planning. It involves everyone from Business Analyst to Architect to…Continue
Added by Manish Bhoge on February 15, 2013 at 2:46am — No Comments
Hadoop is an open source platform, completely written in Java and distributed under Apache's licence, that allows us to store, manage and process gigantic amounts of data in a highly parallel manner on clusters of commodity machines. It is most suitable for batch processing. People and organizations use Hadoop to build their ETL tools, to perform their BI operations, to do analytics etc.
You can find countless posts on Hadoop over the internet. And most of them are really good. But…Continue
Added by Mohammad Tariq Iqbal on February 14, 2013 at 5:40pm — No Comments
Big Data holds a big promise. But has that promise paid out already? Or are you heading for Big Dollar Disaster? Many take inventory of their data and find out they have terabytes of data lying around. Surely something should be done with that, so here’s how we see a lot of companies going about implementing ‘something’ for their Big Data.
Added by Jos Verwoerd on November 13, 2012 at 4:04am — No Comments
Hadoop (MapReduce where code is turned into map and reduce jobs, and Hadoop runs the jobs) is the most well known technology used for "Big Data" because it allows an organization to store huge quantities of data at very…Continue
Added by Michael Walker on November 7, 2012 at 5:55pm — No Comments
The Hadoop stack includes more than a dozen components, or subprojects, that are complex to deploy and manage. Installation, configuration and production deployment at scale is challenging.
The main components…Continue
Added by Michael Walker on August 22, 2012 at 1:00pm — No Comments
Hadoop (MapReduce where code is turned into map and reduce jobs, and Hadoop runs the jobs) is great at crunching data yet inefficient for analyzing data because each time you add, change or manipulate data you must stream over the entire dataset.
In most organizations, data is always…Continue
Added by Michael Walker on August 12, 2012 at 12:49pm — No Comments
Ron Fredericks writes: The Bay Area useR group event had a full house. The video presented here attempts to capture both the content and the excitement surrounding this event:…Continue
Added by Ron Fredericks on February 14, 2011 at 8:00pm — No Comments