Enterprise-grade Object Storage For Advanced Analytics Applications in VMware Tanzu Greenplum Environments
Modern analytic applications including AI/ML can require petabyte scale
data sets. With the amount of data consumed and generated by enterprises accelerating at an unprecedented pace, these analytic applications need to capture, store, and analyze data rapidly and at scale.
VMware Tanzu Greenplum, a massively parallel processing (MPP) data warehouse platform seamlessly integrates with Cloudian HyperStore which provides a limitlessly scalable, secure and cost-effective object storage tier.
This VMware certified solution enables new efficiencies and savings and is ideal for the creation and deployment of advanced analytics models for complex enterprise applications.
Figure 1. VMware Tanzu Greenplum | Cloudian HyperStore solution
Cloudian integrates with VMware Tanzu Greenplum enabling new efficiencies and savings with highly scalable, secure, and cost-effective data storage supporting the creation and deployment of advanced analytics models for complex enterprise applications, at scale.
Challenges
Analytics use cases have expanded dramatically across virtually all industries and topics, wherever there is data. At the same time, analytics data sets have become massive in size. With this increasing volume of data there is also a growing variety of data types including traditional enterprise DB data; log & security data’ web, and mobile, & click steam data. Add to this video and voice data; IOT data as well as JSON, XML geo and graph data, among other data types. All this can easily overwhelm today’s data professionals.
The need for a data analytics platform solution that is affordable, manageable and scalable has never been greater. Key to this solution is data storage that accommodates the varied data types; the ability expand to petabyte scale as use case and operational requirements demand; and support for multi-cluster, multi-cloud, geo-distributed architectures.
SOLUTION BENEFITS
- Enterprise-grade object storage software with proven VMware Tanzu Greenplum platform
- Single data analytics platform that can scale as needs evolve
- Start small and expand without downtime
- Military-grade data storage security
- Hybrid and multi-cloud ready
- Shared storage with 60% TCO savings
- VMware certified for trouble-free integration
- Flexible deployment options: bare metal, VM, and container
- Save time by managing a modern database infrastructure
USE CASES
- Storing database backups
- Staging files for loading and unloading file data
- Enabling federated queries using HyperStore-stored data via VMware Tanzu Greenplum Extension Framework (PXF)
VMware Tanzu Greenplum
VMware Tanzu Greenplum is a massively parallel processing (MPP) fully-featured data platform designed to run the full gamut of analytical workloads, from BI to AI. Greenplum seamlessly integrates with Cloudian HyperStore which provides extensible data lake capability for Greenplum data.
As parallel Postgres for enterprise analytics at scale, Greenplum can rapidly create and deploy models for complex, mission-critical applications in fraud detection, cybersecurity, predictive maintenance, risk management, and many other areas. Greenplum is a modern platform that reduces data silos by enabling the consolidation of more workloads in a single, scale-out environment, including support for converging analytic and operational workloads, like streaming ingestion.
Greenplum users can execute point queries, fast data ingestion, data science exploration, and long-running reporting queries with greater scale and concurrency. Data professionals can test diverse models in parallel on multi-structured data sets – including machine learning, text, graph, and geo-spatial in this single environment at petabyte scale for deep analytical insights.
Cloudian HyperStore
Cloudian HyperStore is a VMware certified, exabyte-scale, enterprise-grade object storage platform providing an elegant, cost-effective solution for your Tanzu Greenplum environment. With unlimited scalability, multi-site and a single global namespace, Hyperstore capacity can be expanded non-disruptively, as your needs evolve, while maintaining complete visibility and control of your data.
Data storage management is simplified with fine grain, bucket-level storage policies. HyperStore is designed to provide the right level of data protection with both erasure coding and replication options that can be applied according to data types. Unmatched data durability of up to 14 nines ensures that your data is protected if one storage node or even an entire data center fails.
This policy driven data replication between sites means archived data is always backed up and available where they are needed automatically, without manual asset management.
Cloudian’s automatic data verification and self-healing functions provides reliability and resilience against hardware failures, while it’s data encryption in-flight and at rest and other security features safeguard valuable assets against threats of deletion or theft via malware.
As a fully S3 API compliant, multi-tenant, multi-data center hybrid cloud storage system, HyperStore can also be used for other storage use cases. HyperStore is designed for simplicity and durability for easy day-to-day operations.