Apache moves forward with machine-learning project



Email    print   
November 25, 2009 —  (Page 1 of 2)
When Bradford Cross and the FlightCaster team launched their startup earlier this year, they had already spent months building systems that could learn from the FAA and airline information. Elsewhere, NetFlix paid an independent developer a million dollars to build a new recommendation engine for their movie rental system. Both of these systems were built on machine learning, also known as artificial intelligence, but both of these learning machines were built from scratch.

AI has been around since the early days of software development, but it is only recently that machine learning has begun to become a viable business tool. As such, a group of Apache Lucene developers decided earlier this year to build out the tools needed to create competent machine-learning systems on top of the big data systems being created at Apache.

In mid-November, the Apache Mahout project reached version 0.2, and with it comes some of the first building blocks for creating systems like the NetFlix recommendation engine. While this is only the second release for the project, and even its creators say it is still in its infancy, Mahout is functional and can be put to use now.

Mahout is an attempt to build libraries and algorithms useful to developers attempting to create machine-learning applications. It is typically used on top of Apache Hadoop, where large data sets can be stored and analyzed, and where the actual trained machine-learning routines can be executed across that data.

Grant Ingersoll is one of the cofounders of the Mahout project. He's also cofounder of Lucid Imagination, a company founded at the beginning of this year and dedicated to providing commercial support and service to the Apache Lucene project.

He said that Mahout “grew out of some frustrations all of us had had with various other machine-learning packages. There are others, and some of them are quite good, but we didn't feel they all address the issues important to us."

Those important factors were the existence of a vibrant community, the scalability of the solution, and the licensing of the tools under a business-friendly license.



Related Search Term(s): Apache, Hadoop, Mahout

Pages 1 2 


Share this link: http://sdt.bz/33941
 
Most Read Latest News Blog Resources

Add comment


Name*
Email*  
Country     


  • Comment
Loading




close
NEXT ARTICLE
Apache graduates Hadoop incubator projects
Foundations announces project promotions, and pushes Hadoop, Lucene sub-projects into the incubator Read More...
 
 
 
 
News on Monday
more>>
SharePoint Tech Report
more>>


   

 
 

Download Current Issue
FEBRUARY 2012 PDF ISSUE

Need Back Issues?
DOWNLOAD HERE

Want to subscribe?


 
blogs tab
Agility, mom, and apple pie
If we're to evaluate the state-of-the-art in software development, we should start with the values espoused in the Agile Manifesto.
02/07/2012 11:57 AM EST

RIM woos developers with free tablet
How do you get more apps ported to the BlackBerry PlayBook? By giving every developer a free tablet, of course!
02/04/2012 01:57 PM EST

GitHire: Use Headhunters to Find Your Perfect Programmer
Are you a hiring manager tired of scouring the job boards? Check out this new service that will find 5 people interested in your jobs.
02/03/2012 12:17 PM EST

Facebook claims hacker cred
Facebook's SEC S-1 filing form includes a short essay on the Hacker Way by Mark Zuckerberg himself.
02/02/2012 08:26 AM EST

Ryan Dahl steps down
Ryan Dahl, creator of Node.js, steps back from his position as gatekeeper for the project.
02/01/2012 04:58 PM EST

Bloomberg opens its API
Bloomberg's APIs could lead to a future standard for accessing market data.
02/01/2012 04:41 PM EST

 
Events calendar tab
2/13/2012 to 2/16/2012
Santa Clara
TechWeb

2/26/2012 to 2/29/2012
San Francisco
BZ Media

2/27/2012 to 3/2/2012
San Francisco
RSA

3/4/2012 to 3/7/2012
Las Vegas
IBM Tivoli

3/5/2012 to 3/9/2012
San Francisco
TechWeb