Subscribe to DSC Newsletter

All Blog Posts Tagged 'Hadoop' (12)

Hadoop Security Issues and Best Practices

The big data blast has given rise to a host of information technology software and tools…

Continue

Added by marry tho on June 17, 2016 at 6:00am — No Comments

Big Data A to Z – The Annotated Glossary of my Favorite Data Science Things

The extended annotated version of the "Big Data A to Z Glossary of my Favorite Data Science Things" is now live at: http://bit.ly/1g5NcBt

However, the original…

Continue

Added by Kirk Borne on March 20, 2014 at 3:00pm — No Comments

Intellipaat Hadoop Online Training

Hi, All

Intellipaat will Start a new Hadoop Developer Batch From 29th june 2013. Interested Candidates Drop an Email at sales(@)intellipaat(dot)com.

Regards

sales Team Intellipaat

9019368913

Added by raja singh on June 24, 2013 at 4:30am — 1 Comment

How to install MapR M3 on Ubuntu through Ubuntu Partner Archive.

In a recent post of mine I had mentioned about the partnership between MapR and Canonical towards an initiative to make Hadoop available with Ubuntu natively through Ubuntu Partner Archive. Since, the package has been released now, I thought of showing how to get it done. Trust me it's really cool to…

Continue

Added by Mohammad Tariq Iqbal on April 30, 2013 at 7:17pm — No Comments

Hadoop Herd : When to use What...

8 years ago not even Doug Cutting would have thought that the tool which he's naming after the name of his kid's soft toy would so soon become a rage and change the way people and organizations look at their data. Today Hadoop and BigData have almost become synonyms to each other. But Hadoop is not just Hadoop now. Over the time it has evolved into one big…

Continue

Added by Mohammad Tariq Iqbal on April 25, 2013 at 6:55pm — No Comments

Big Data : How do you run capacity planning ?

Most of Datawarehouse folks are very much accustomed with the term "Capacity Planning", Read Inmon. This is widely used process for DBA's and Datawarehouse Architects. In an typical project of data management and warehouse wide variety of audience is involved to drive the capacity planning. It involves everyone from Business Analyst to Architect to…

Continue

Added by Manish Bhoge on February 15, 2013 at 2:46am — No Comments

How to begin with Hadoop

Hadoop is an open source platform, completely written in Java and distributed under Apache's licence, that allows us to store, manage and process gigantic amounts of data in a highly parallel manner on clusters of commodity machines. It is most suitable for batch processing. People and organizations use Hadoop to build their ETL tools, to perform their BI operations, to do analytics etc. 

You can find countless posts on Hadoop over the internet. And most of them are really good. But…

Continue

Added by Mohammad Tariq Iqbal on February 14, 2013 at 5:40pm — No Comments

Digesting Big Data

Big Data holds a big promise. But has that promise paid out already? Or are you heading for Big Dollar Disaster? Many take inventory of their data and find out they have terabytes of data lying around. Surely something should be done with that, so here’s how we see a lot of companies going about implementing ‘something’ for their Big Data.

  • Buy hardware, set up a cluster and install some flavor of Hadoop.
  • Break their brains to figure out how to run map-reduce…
Continue

Added by Jos Verwoerd on November 13, 2012 at 4:04am — No Comments

R + Hadoop = Data Analytics Heaven

 

Hadoop (MapReduce where code is turned into map and reduce jobs, and Hadoop runs the jobs) is the most well known technology used for "Big Data" because it allows an organization to store huge quantities of data at very…

Continue

Added by Michael Walker on November 7, 2012 at 5:55pm — No Comments

Hadoop Technology Stack

The Hadoop stack includes more than a dozen components, or subprojects, that are complex to deploy and manage. Installation, configuration and production deployment at scale is challenging.

The main components…

Continue

Added by Michael Walker on August 22, 2012 at 1:00pm — No Comments

Percolator, Dremel and Pregel: Alternatives to Hadoop

Percolator, Dremel and Pregel: Alternatives to Hadoop



Hadoop (MapReduce where code is turned into map and reduce jobs, and Hadoop runs the jobs) is great at crunching data yet inefficient for analyzing data because each time you add, change or manipulate data you must stream over the entire dataset.

In most organizations, data is always…

Continue

Added by Michael Walker on August 12, 2012 at 12:49pm — No Comments

RHIPE: An Interface Between Hadoop and R for Large and Complex Data Analysis

Ron Fredericks writes: The Bay Area useR group event had a full house. The video presented here attempts to capture both the content and the excitement surrounding this event:…

Continue

Added by Ron Fredericks on February 14, 2011 at 8:00pm — No Comments

Monthly Archives

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

On Data Science Central

© 2019   AnalyticBridge.com is a subsidiary and dedicated channel of Data Science Central LLC   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service