Archive for category Big Data
The Hadoop ecosystem is an eclectic mishmash of start-ups, mid-sized vendors and IT heavyweights with products and services up and down the Big Data stack. Inevitably the ecosystem will consolidate and thin itself out through mergers, acquisitions and – unfortunately for some of these start-ups – bankruptcies.
Consolidation is part of the natural evolution of any given technology market after an initial period of frenzied innovation, and the Big Data market is no exception. I believe we are witnessing the start of this consolidation today. It will take several years to play out, but the first phase of consolidation is manifesting itself in the form of strategic technical partnerships between vendors that play in different segments of the Hadoop market.
IBM’s annual revenue last year dropped below $100 billion for the first time since 2010. The company’s fourth quarter results were particularly weak, coming in 5.5% below expectations. This was due in large part to IBM’s struggling hardware business, with revenue dropping a staggering 27%.
I’ve already laid out my predictions for Big Data in 2014, but I also wanted to let the Wikibon community know how my colleagues and I plan to cover Big Data in the year ahead. We’ve organized our research agenda into three major buckets.
Technology. Clearly the technologies and products that collectively make up Big Data – including Hadoop, NoSQL data stores, analytic databases, data visualization tools and more – are maturing at a rapid pace (much faster, for example, than relational databases did in the 1980s.) Big Data is also applicable across industries, meaning these technologies are inevitably and increasingly intersecting with adjacent technology movements, namely the cloud, mobile computing and social media. As we have for the last several years, Wikibon will devote significant coverage to these developments with an eye on putting technology innovations in context for enterprise Big Data practitioners (both technology practitioners and line-of-business practitioners.)
After a (very) brief respite, #theCUBE is back on the conference circuit. Starting tomorrow at 10:20am ET, theCUBE is live for two days at the HP Vertica Big Data Conference (#HPBigData2013) in Boston. Unlike some Big Data shows that focus more on vendors, the Vertica show is heavy on customers and real Big Data end-users.
Everyone knows that the shift from traditional print and broadcast advertising to digital advertising is taking a huge toll on the media industry. Newspapers can’t charge as much for online ads as they do for print ads, and revenues are shrinking precipitously.
But the impact of this shift is being felt not just in the media industry but in the advertising industry itself. Advertising agencies, whose bread and butter is building creative ad campaigns and negotiating print and broadcast ad placement, are likewise under pressure to make the transition to a data-based marketplace.
There’s an old saying in the data management world: garbage in, garbage out, or GIGO. It means that the results of any data analysis project are only as good as the quality of the data being analyzed. Data quality is of critical importance when data sets are relatively small and structured. If you only have a small sample of data on which to perform your analysis, it better be good data. Otherwise, the resulting insights aren’t insights at all.
The show is a mix of both technical and business-centered Hadoop content. There will be a number of major focuses. One such focus is YARN. As I wrote on SiliconANGLE earlier today:
YARN is as a true Hadoop resource manager, allowing multiple applications – MapReduce, SQL, streaming analysis, etc. – to run on a single cluster of machines simultaneously while maintaining high performance levels. With YARN Hadoop is a true multi-application platform that can serve an entire enterprise.
The benefits of Big Data are often spoken of in the future tense. As in, “Big Data will someday provide enterprises of all types critical insights that allow for increased profitability, improved efficiency and other untold riches.”
Same goes for the technology. Hadoop, some say, will be the foundation of data storage and analytics in the enterprise once it’s proven enterprise-ready.
The reality, however, is that Big Data is today – here and now – delivering on its many promises. We at Wikibon have been documenting Big Data in the Real World for the last two years, including publishing a series of vertical-specific research notes highlighting how enterprises in retail, banking, media, utilities and pharma are leveraging Big Data analytics to drive performance.
Our friends at Forbes.com have put together a fantastic new infographic leveraging data from Wikibon’s Big Data Vendor Revenue and Market Forecast, 2012 – 2017 report. It provides a compelling view of the Big Data universe and illustrates the real revenue vendors are deriving from Big Data. They range from the mega-planets (if you’ll go with me on this analogy) IBM, HP and EMC to the smaller but powerful emerging planets like Hortonworks, 10gen and DataStax.
Ok, not the greatest analogy but still a great infographic:
Then-CEO Sam Palmisano launched IBM’s Smarter Planet initiative five years ago during a speech at the Council on Foreign Relations. IBM would focus its energies, Palmisano said, on helping governments and companies understand and analyze the voluminous data streaming off connected devices and industrial equipment to improve operational efficiencies and deliver better services to citizens and customers.
Since then, IBM has largely had the Industrial Internet, as the concept of has come to be called, to itself. The company’s Smarter Planet division has played a key role in making IBM the biggest Big Data company on the planet and was a lone bright spot in IBM’s otherwise disappointing Q1 2013 results.