Zetta Scalabytes Blog

In this blog, hear from Zetta’s founders and leaders about cloud computing, storage and data management best practices and Zetta Enterprise Cloud Storage technology.

Archive for March, 2010

Chris Schin

March 31, 2010

Hosting Primary, Unstructured Enterprise Data in the Cloud – Part 8: Administrative Transparency and Control

Chris Schin, VP Products, is responsible for coordinating all Zetta product-related initiatives including product strategy, direction, and marketing, as well as business model and go-to-market process definition. Prior to joining Zetta, Chris was acting GM and Senior Director for Symantec Protection Network, Symantec's Software as a Service platform.

Hi — this blog series contains concepts that we used to design the Zetta storage solution, based on feedback from enterprise IT professionals and their needs.

 

Here is an outline of this series and hyperlinks to previous posts:

 

This post discusses how a service provider can engender trust from customers through transparent access to administration tools and system information.

 

A good software user interface enables easy & quick access: to information about the functioning of the system (monitor), and to the features available to the user (manage). Placed in the context of an IT storage professional, such a UI should provide:

 

    Zetta Storage Screen Shot

  • An intuitive interface; one that behaves like existing filer controls and enables rapid navigation to trending information and features

     

  • A robust control framework — designed for IT professionals — one that enables access management, access logging, and controls for things like snapshots and replication

     

  • Transparent visibility into storage solution behavior — both good and bad events should be surfaced in order to provide the user confidence that he has access to all available events that are relevant to his data set

     

  • Instant access to support and knowledge, in the form of online ticketing and a maintained knowledgebase

     

  • Zetta Events Screen Shot

  • Both actionable alerts to respond to, and automated self-healing capabilities; what this amounts to is a notification framework with some auto-corrective actions

     

  • The ability to delegated administration based on granular roles and permissions, leveraging existing LDAP permissions

     

  • Access from anywhere (i.e. Web-based)

     

This may not seem like a long or onerous list, but if you have any experience with the UIs of either enterprise NAS filers or cloud storage providers, you’ll have noticed that many of these seemingly simple requirements were not fulfilled.

Twitter iconReading: Hosting Primary, Unstructured Enterprise Data in the Cloud – Part 8: Administrative Transparency and ControlTweet This
Chris Schin

March 03, 2010

Hosting Primary, Unstructured Enterprise Data in the Cloud – Part 7: Non-blocking Performance

Chris Schin, VP Products, is responsible for coordinating all Zetta product-related initiatives including product strategy, direction, and marketing, as well as business model and go-to-market process definition. Prior to joining Zetta, Chris was acting GM and Senior Director for Symantec Protection Network, Symantec's Software as a Service platform.

Hello again and welcome back to my blog series outlining what our customers told us they wanted to see in a cloud storage solution before they would put primary copies of their enterprise data in the cloud. Again, it is important to note that these requirements drove the design and development of the solution we have in market today.

 

This is the outline of the series and hyperlinks to previous posts:

 

This post discusses how a service provider must create a storage solution architecture that can ensure “non-blocking” performance, enabling it to adapt to multiple customer access patterns simultaneously.

 

There is no question that innovations have allowed today’s traditional arrays to scale to huge capacity — hundreds of terabytes per array. But the core array architecture has changed little across time, and this architecture can limit the amount of additional capacity that can be added, and can even prevent existing capacity from being utilized adequately. A massive scale, multi-tenant architecture requires a fundamentally different design — one that borrows heavily from distributed systems design principles.

 

There are effectively three components to any storage solution: the network, the controller, and disk. In a traditional array, purchase-time decisions are made that determine the ratios of each of these to the others, and those decisions are very difficult to alter once the array has been deployed. Unfortunately, circumstances change, and one of these three components almost always becomes the bottleneck, preventing full utilization of the other components. For example, if the workload winds up being more controller-intensive than expected, the disks won’t ever be filled.

 

A service provider who tries to construct a storage service using a series of high-priced, traditional arrays will fall prey to this dynamic in a very acute way — installing multiple arrays doesn’t obviate this issue, it expands it. This is augmented by the fact that there is literally no way to plan in advance for customer behavior when the customer isn’t even identified prior to array purchase, as is the case for a cloud storage service provider.

 

A cloud service provider shouldn’t attempt to use traditional vendor-produced arrays to create a storage service — the costs don’t add up, any single customer’s access pattern could negatively impact others, and the fundamental array architecture is in conflict with the notion of a storage service.

 

Instead, a storage service must be architected using Internet-centric distributed computing principles. Each of the tiers of the architecture — throughput, IOPs, and density — should be able to scale independently of any other tier, allowing the service provider to adapt to customer behavior — singly and in aggregate — as necessary to ensure adequate performance to all and adequate system resource utilization in the aggregate.

 

One additional best practice to mention: unlike computer processors, disks are mechanical devices — they spin at a certain maximum rate. As a result, if enough IOPs hit a disk at the same time, the disk can become snarled and disk throughput can fall off a cliff. Since both IOPs and density are determined by the disk, a storage service should provide a QOS engine — similar to a computer’s scheduler — to ensure that disks never reach a point-of-no-return under load, where IOPs begin to slow exponentially.

Twitter iconReading: Hosting Primary, Unstructured Enterprise Data in the Cloud – Part 7: Non-blocking PerformanceTweet This