Posts Tagged Big Data
Former Republican congressman-turned-TV pundit Joe Scarborough doesn’t buy Nate Silver’s numbers. For Scarborough, they just don’t add up.
Speaking on Morning Joe on Oct. 29, when Silver’s FiveThirtyEight blog put Obama’s chances at reelection somewhere around 75%, Scarborough declared: “Both sides understand [the presidential election] is close, it could go either way, and anybody that thinks this race is anything but a tossup right now is such an ideologue they should be kept away from typewriters, computers, laptops and microphones for the next ten days because they’re jokes.”
The fear (or is it disdain?) is sometimes justified. No developer wants to get locked in to a platform that dictates which tools she can use, which data sources she can integrate, which hardware she must deploy or that makes switching to a competing platform too costly to justify.
Next week theCUBE is back in action, this time covering two Big Data conferences in one. Strata Conference + Hadoop World on theCUBE kicks off live Wednesday (10/24) morning at 10 am ET on SiliconANGLE.tv. We’re broadcasting all day Wednesday and all day Thursday (10/25) from New York City with virtually non-stop live interviews with the smartest nodes at the conference.
We’ve identified the most compelling news and trends that will be developing at the show and programed our coverage to flesh them out in great detail. Among other trending topics, you’ll get full coverage and analysis of the emerging Big Data application development market, the state of real-time analytics in Hadoop environments, and new ecosystem partnerships, as well as some great advice for Big Data practitioners from Big Data practitioners.
It’s Oracle OpenWorld this week and that means more colorful if factually questionable statements from everybody’s favorite egomaniacal billionaire CEO. And, not surprisingly, Larry Ellison’s target was archrival SAP.
“SAP has an in-memory machine, you know, that’s a little bit smaller than what we offer,” Ellison said at OpenWorld yesterday, referring to SAP HANA and Oracle’s own all in-memory database Exadata X3, which debuted this week. “We have 26 terabytes of memory; [SAP offers] 0.5 terabytes of memory.”
In case you missed his point, Ellison put it as succinctly as he could: “The HANA in-memory machine is, like, really small.” (Hat Tip to eWeek)
If you’re interested in Big Data and you find yourself in Boston tomorrow you owe it to yourself to head over to the Hyatt Regency in the Financial District. That’s where The IE Group is putting on its Big Data Innovation Summit and the list of speakers and sessions is impressive.
Here’s just a small sampling of the speakers:
- Facebook’s Mohammad Sabah, a Data Scientist with previous experience at Netflix, will discuss “applying scalable machine learning algorithms for applications ranging from ranking to search to matching.” Sabah, who joined us on theCUBE at Hadoop Summit, will further explore why Hadoop is particularly effective on large and otherwise inaccessible data sets.
If you’ve been unable to keep up with all the competing NoSQL databases that have hit the market over the last several years, you’re not alone. To name just a few, there’s HBase, Cassandra, MongoDB, Riak, CouchDB, Redis, and Neo4J.
To that list you can add Accumulo, an open source database originally developed at the National Security Agency. You may be wondering why the world needs yet another database to handle large volumes of multi-structured data. The answer is, of course, that no one of these NoSQL databases has yet checked all the feature/functionality boxes that most enterprises require before deploying a new technology.
Our friend Matt Asay, who oversees business development for streaming Big Data analytics player Nodeable (read hear about Nodeable’s recent shift in business model), penned a column today sizing up the Hadoop distribution competition. Asay narrows the competitors to two – Hortonworks and Cloudera – and proceeds under the premise that only one of the two can and will survive.
The team at Wikibon is pretty excited about the future of Big Data. From Big Data infographics to revenue forecasts and funding reports, to our own Big Data Manifesto, the market is literally exploding with innovation and development. But the sheer magnitude of the numbers we’re analyzing around Big Data are tremendous in and of themselves.
Don’t believe us? Here are over thirty significant Big Data statistics to consider, broken out by the current environment, the growth of unstructured (user generated) data, the marketplace, and business issues related to Big Data.
The Wikibon community is not the only organization interested in Big Data. Several other companies and publishers are getting into the discussion and visualization (infographics) is a powerful way to present the impact and significance of Big Data.
Here is our “Big List” of the Big Data infographics that have been created in recent months (including ours front and center). Let us know your feedback and what if we’re missing anything (click each infographic thumbnail for the original blog posts).
One of the challenges to understanding cloud computing is that it’s not easy to visualize what the solution really looks like. Before heading to HP Discover, I had the opportunity to tour the SwitchNAP facility in Las Vegas. There are dozens of cloud solutions (including HP, EMC, Joyent, Nirvanix, VMware) hosted in the 407,000 square foot co-location facility, and there’s strong (e.g., guys with guns) cloud security. Taking the tour is a geek paradise – it’s like a James Bond villain stronghold: employees dressed in black, metal desks, red and blue LED lighting, and the most technologically advanced data center that I’ve seen. Switch is not only a showcase for the scalable, dense and efficient power and cooling of cloud solutions, but also has extra capabilities of a networking buying consortium and the US Cloud inter-cloud exchangeto enable lots of interesting cloud deployments.