Print

Hadoop hits milestone 1.0 release



Alex Handy
Email
January 4, 2012 —  The big-data world hit a major milestone over the holiday break: the Apache Hadoop project 1.0 release went out today.

Hadoop has grown to become one of the most active projects at the Apache Foundation, with dozens of sub-projects attached to it. The project as a whole fulfills several big-data goals: It gives developers cluster-management tools, a place to put petabytes of data for analysis, an implementation of the map/reduce algorithm for distributing jobs across that data, and a host of analysis tools for querying and processing all of that data. (A primer on map/reduce is available here.)

One of the primary points of focus in the 1.0 release is the HBase database, which allows administrators to store entire relational databases inside of the Hadoop File System. That distributed file system is rudimentary and unsuited to use outside of the simple storage of massive amounts of data across cluster nodes, but by using HBase, Hadoop administrators are actually able to host live data from their Hadoop clusters. Popular social bookmarking site StumbleUpon uses HBase as the live database for its website, as opposed to MySQL or Oracle.

In this release, HBase was moved up to a top-level project under Hadoop, and received numerous performance improvements, with the end goal being the removal of performance barriers that keep HBase from being a fully viable MySQL replacement for Hadoop users.

Another new addition to Hadoop 1.0 is WebHDFS, which is a RESTful API for inserting data directly into Hadoop. Previously, ingress of data into a Hadoop cluster required very specific tooling, and could not be performed via REST calls.

As Hadoop has begun to pop up inside of enterprises, a more recent focus on security has been prevalent in the project. With the release of version 1.0, Kerberos authentication has been implemented across nodes.

Merv Adrian, research vice president at Gartner, said that "Gartner is seeing a steady increase in interest in Apache Hadoop and related 'big data’ technologies, as measured by substantial growth in client inquiries, dramatic rises in attendance at industry events, increasing financial investments and the introduction of products from leading data management and data integration software vendors. The 1.0 release of Apache Hadoop marks a major milestone for this open-source offering as enterprises across multiple industries begin to integrate it into their technology architecture plans."




Related Search Term(s): Hadoop, HBase


Share this link: http://sdt.bz/36236
 
Most Read Latest News Blog Resources

Add comment


Name*
Email*  
Country     


  • Comment
Loading




close
NEXT ARTICLE
The five “Next Big Things” in open source
What do experts see as the "projects to watch" for 2011 and beyond? Read More...
 
 
 
 
News on Monday
more>>
SharePoint Tech Report
more>>


   

 
 

Download Current Issue
MAY 2012 PDF ISSUE

Need Back Issues?
DOWNLOAD HERE

Want to subscribe?


 
blogs tab
Creation
To write better software, cultivate your ability to be creative.
05/19/2012 07:40 PM EST

Slick...but who needs it?
compilr.com is a well-designed site and the folks behind it seem to have their heart in the right place. But...who needs it?
05/16/2012 12:45 PM EST

How to be a better software developer
Want to be a better developer? You won't get there by mastering an interesting language or learning a new set of APIs.
05/14/2012 12:18 PM EST

Wooing Galatea
Do yourself a favor and check out Galatea 2.2, a wonderful book by novelist Richard Powers.
05/12/2012 07:05 PM EST

The world as story
An artificial-intelligence system at Carnegie Mellon seeks to understand the world by making statements about it.
05/10/2012 06:39 AM EST

The Rise of the Brogrammer, or the Rise of the Sexist Programmer?
Women in Silicon Valley get vocal about sexist ads and campaigns that contribute to a tense work environment.
05/09/2012 03:14 PM EST

 

Events calendar tab
5/23/2012 to 5/24/2012
Chicago
IEG

6/3/2012 to 6/7/2012
Orlando
IBM Rational

6/10/2012 to 6/15/2012
Las Vegas
SQE

6/10/2012 to 6/15/2012
Las Vegas
SQE

6/11/2012 to 6/14/2012
Bellevue, Wash.
AMD