Are You Making Money With Your Object Storage?

An IT industry analyst article published by Infostor.

by Mike Matchett, Sr. Analyst and Consultant
article_are-you-making-money-with-your-object-storage-1
Object storage has long been pigeon-holed as a necessary overhead expense for long-term archive storage—a data purgatory one step before tape or deletion. We have seen many IT shops view object storage as something exotic they have to implement to meet government regulations rather than as a competitive strategic asset that can help their businesses make money.

Normally, when companies invest in high-end IT assets like enterprise-class storage, they hope to recoup those investments in big ways. For example, they might accelerate the performance of market competitive applications or efficiently consolidate data centers. Maybe they are even starting to analyze big data to find better ways to run the business.

These kinds of “money-making” initiatives have been mainly associated with file and block types of storage—the primary storage commonly used to power databases, host office productivity applications, and build pools of shared resources for virtualization projects.

But that’s about to change.

If you’ve intentionally dismissed or just overlooked object storage, it is time to take deeper look. Today’s object storage provides brilliant capabilities for enhancing productivity, creating global platforms and developing new revenue streams.

…(read the complete as-published article there)

It’s Big Data Warfare – Good versus Evil

(Excerpt from original post on the Taneja Group News Blog)

Whether your data hacker worldview is in terms of “black hat” v.s. “white hat” (or Gandalf’s early grey), “us” v.s. “them”, or, as in the military, “blue” v.s. “red” (the red guys are the ones you aim at), the fact is that exploitable data sets are growing larger and more available day by day. Big data techniques are maturing, ripe for easier adoption, and because they are based on commoditized infrastructure with plenty of open source tooling, both the good guys and the bad guys are fast becoming enabled to take advantage of formerly unavailable information.

…(read the full post)

Enterprise IT Will Dive Into Big Data Solutions in 2013

(Excerpt from original post on the Taneja Group News Blog)

If you are in IT, 2013 is going to be the year that you will want to dive into the “big data” pool if you haven’t been pushed in already. But don’t worry – it’s no longer sink or swim. For one, we’ll be here to help coach you through it all. And while the concepts, terminology and hype have been all over the place, once you start floating around you’ll find that under the surface much of what fills the big data pool is familiar IT infrastructure, data management, and services re-cast around a few easy-to-grasp innovations.

For example, if you are in IT and asked to pick a Hadoop distro to stand up, you’d probably start with evaluating the three main vendor distributions of Hadoop (rather than getting it straight off Apache) followed by other downstream OEM’d and pre-integrated versions.  The main supported distros are from Cloudera, Hortonworks, and MapR.  I didn’t really appreciate the differences until talking with all three individually (at 2012 NY Strata, see below).  

What really struck me was that Hadoop, from an IT perspective, is really a set of distributed data storage services at heart. Sure, you can talk about Hadoop as an architecture based on parallel functional programming that uses map/reduce tasks to move compute process to distributed data sets, or as an analytics platform for huge volumes of unstructured data, or as a commoditized/democratized scale-out data mining solution, but from an IT viewpoint, it looks and smells a lot like a way to cost-effectively store and access large volumes of data.  It’s an innovative twist on the age-old IT data storage problem when faced with ever growing amounts of data – how to support your business cost-effectively deriving value out of growing data stores.

In an nutshell, Hortonworks takes the high-road regarding the open source Hadoop and related Apache projects, and focuses on 100% open source support. There is some good argument that by sticking with open source, you are assured future open source benefits.  If you start paying for bastardized distros and versions, you will, well, have to continue paying for it.

Cloudera is more commercially focused to help business deploy productive big data solutions. While offering a free Cloudera packaged distro version (CDH), they have a premium enterprise management offering when you get over 50 nodes. CDH is the most widely used Hadoop distro today. For example, you can buy a Dell Cloudera pre-integrated cluster, ready-to-go.

MapR thought as along as they were going to focus on helping businesses compete, they shouldn’t be constrained by open source and instead aggressively optimize the whole thing including the core parts for high performance and scalability. MapR also has a free community edition (M3), but the real enterprise solutions are M5/M7 which add sophisticated enterprise storage features directly to Hadoop including mirroring, snapshots, HA, and data placement controls (MapR optimizes other parts too, including now a highly customized Hbase). In other words, at the fundamental level MapR enterprise versions look like a hard-core “big” data storage services (EMC chose MapR for its Greenplum distro).

We’ll dive into more big data topics soon, but wanted to say I thoroughly enjoyed Strata (and the combined Hadoop World) in NY a few months ago. It was sold out to the “walls”. However, folks walking in just to see even the vendor exhibition floor were turned away due to already packing the place to the maximum allowed by fire codes. The Strata/Hadoop co-sponsors O’Reilly and Cloudera should think about stepping up to a much larger venue like the Jarvitz next time. Let vendors have bigger booths and welcome all who are interested drop in. Maybe even make the vendor show floor “free” to IT folks? I’m sure the vendors would love the traffic and attention.

I am looking forward to the future Strata events – one is coming up on the West coast next month (Feb 26-28) and I’d encourage IT folks in the area to check it out. The program web page indicates they recognize the IT impact big data is having with “Big Data for Enterprise IT” themed presentations. Hope to see you there!

…(read the full post)