Posts Tagged Hadoop
Our friend Matt Asay, who oversees business development for streaming Big Data analytics player Nodeable (read hear about Nodeable’s recent shift in business model), penned a column today sizing up the Hadoop distribution competition. Asay narrows the competitors to two – Hortonworks and Cloudera – and proceeds under the premise that only one of the two can and will survive.
theCUBE Summer Tour continues next week when we roll into San Jose June 13th and 14th with live coverage of Hadoop Summit 2012. Presented by Hortonworks, Hadoop Summit brings together an incredible array of players in the Big Data market – both new school Hadoop providers like Hortonworks, Cloudera, Hadapt, MapR and Datameer, as well as tried-and-true IT vendors from IBM and Microsoft to Oracle and HP.
There are also a slew of Hadoop practitioners on-hand at the event, many of whom will join myself and SiliconANGLE’s John Furrier in theCUBE to share their experiences and advice with colleagues just getting started with Big Data. They include Intuit, Adobe, Klout, Twitter and Sears. As always, the show will be streamed in live HD at SiliconANGLE.tv.
Wikibon research is based upon open source sharing amongst peers and SiliconANGLE’s live video broadcasts with theCUBE are a great enabler of this mission. The upcoming schedule of shows highlight the biggest megatrends of IT including cloud computing, big data, converged infrastructure and more. Wikibon will share its latest research findings and go in-depth with IT practitioners on-air. Come join the Wikibon community on-site or streaming in the comfort of your home or office. All of the details for the summer tour:
I’m back from Strata Conference and after three days, 16 keynote presentations, countless sessions, 20+ hours of live coverage via theCUBE, and two very long flights from Boston to Silicon Valley and back, these things I’m sure of:
- Big Data, namely Hadoop, is for real.
- It still has some maturing to do.
You know a technology is headed to the mainstream when the two “Elite” sponsors of the premier event designed to showcase that technology are Microsoft and EMC. Neither company is known for adopting and promoting emerging open source technologies, to put it mildly. But there they both were at Strata Conference, the event dedicated to open source Big Data approaches like Hadoop and NoSQL, topping the list of event sponsors. They were followed not far behind by fellow IT giants and Strata “Impact” sponsors IBM and Oracle.
The Wikibon / SiliconANGLE team is excited to be broadcasting live coverage of this year’s Strata Conference in Santa Clara, California, February 28th through March 1st of this week. Co-hosts John Furrier and Dave Vellante will be bringing coverage live on #theCUBE, the flagship telecast from SiliconANGLE.tv, with original content and analysis developed exclusive by the SiliconANGLE and Wikibon teams.
With a full schedule of guests and in-depth coverage of the key moments at this year’s Strata Conference, the Wikibon / SiliconANGLE team will be covering all of the angles.
EMC announced today it has integrated the Isilon scale-out network attached storage (NAS) platform with an Apache-based Greenplum Hadoop distribution to make the Big Data framework more palatable to enterprises with strict SLA requirements.
The move is designed to address a number of Hadoop’s enterprise-level shortcomings by applying Isilon’s backup and recovery capabilities and more efficient storage to the open source Big Data framework, according to EMC.
EMC will make Isilon an optional module of its Greenplum Data Computing Appliance, which also includes an Apache-based Hadoop distribution called Greenplum HD, the standard or high-capacity Greenplum database, and the Greenplum Data Integration Accelerator.
Oracle added a twist to this morning’s announcement regarding the general availability of its Big Data Appliance and related Big Data connectors. Rather than shipping the appliance with its own Hadoop distribution or the vanilla Apache distribution, Oracle has partnered with Cloudera to include its Hadoop distribution and management software instead.
Originally announced at Open World in October, the Oracle Big Data Appliance is a preconfigured hardware-software bundle running Oracle Linux. It is available in a full rack configuration of 18 Oracle Sun servers and includes the community edition of Oracle’s NoSQL database, an open source distribution of R, and Oracle HotSpot Java Virtual Machine for running MapReduce jobs, in addition to CDH and Cloudera Manager.
- 2012 Will Be the Year of Big Data Applications. Thanks to the intense competition between The Big Three distribution vendors, Hadoop developed rapidly in 2011 and is, by most accounts, enterprise-ready (there are always areas for improvement, of course, notably around Hadoop’s single point of failure issue.) This, along with readily available capital, will result in significant innovation from both existing and new start-up Big Data Application vendors now confident that Hadoop is for real. Expect to see new vertical Hadoop-based Big Data Applications for healthcare, retail, financial services and manufacturing in the year ahead, as well as horizontal applications focused on human capital management and enterprise resource planning. Adoption will start slow, but for traditional enterprises, Big Data Applications are the key to realizing impactful business value from Hadoop. 2012 should be a good year on this front.
At the start of 2011, there was only one commercial Hadoop distribution vendor on the market, virtually no Big Data application vendors with products ready for primetime, and Data Scientists were considered little more than propeller-heads working on some wacky experiments.
Well, a lot can happen in a year. As of December 2011, there are three viable commercial Hadoop distribution vendors doing battle for market supremacy, a slew of start-ups as well as stalwart software vendors getting into the Big Data application game, and Data Scientists are the new rock stars of the IT world.