Systems | Development | Analytics | API | Testing

Delivering High Performance for Cloudera Data Platform Operational Database (HBase) When Using S3

CDP Operational Database (COD) is a real-time auto-scaling operational database powered by Apache HBase and Apache Phoenix. It is one of the main Data Services that runs on Cloudera Data Platform (CDP) Public Cloud. You can access COD right from your CDP console. With COD, application developers can now leverage the power of HBase and Phoenix without the overheads related to deployment and management.

How Hybrid and Cloud-Based Architectures are Unlocking the Power of Data

It takes vision, purpose, and skill to unlock the power of data. It also takes the right strategy. For ExxonMobil, Ares Trading (Merck), and the University of California San Diego (UCSD), the right strategy is taking full advantage of the cloud. All three organizations have partnered with Cloudera, leveraging a hybrid or cloud-based architecture to improve the lives of the people who depend on their organizations’ data.

In AI we trust? Why we Need to Talk About Ethics and Governance (part 2 of 2)

In part 1 of this blog post, we discussed the need to be mindful of data bias and the resulting consequences when certain parameters are skewed. Surely there are ways to comb through the data to minimise the risks from spiralling out of control. We need to get to the root of the problem. In 2019, the Gradient institute published a white paper outlining the practical challenges for Ethical AI.

Create your Private Data Warehousing Environment Using Azure Kubernetes Service

For Cloudera ensuring data security is critical because we have large customers in highly regulated industries like financial services and healthcare, where security is paramount. Also, for other industries like retail, telecom or public sector that deal with large amounts of customer data and operate multi-tenant environments, sometimes with end users who are outside of their company, securing all the data may be a very time intensive process.

The Cloudera Enterprise Data Cloud Maturity Report: Uncovering progressive steps towards a hybrid future

Global survey of IT and business executives reveals how a mature data strategy relates to business performance and resilience. Organizations fall under one of four categories when it comes to enterprise data maturity, and they need to be aware of how to address multi-dimensional challenges of a hybrid future.

In AI we Trust? Why we Need to Talk about Ethics and Governance (part 1 of 2)

Advances in the performance and capability of Artificial Intelligence (AI) algorithms has led to a significant increase in adoption in recent years. In a February 2021 report by IDC, they estimate that world-wide revenues from AI will grow by 16.4% in 2021 to USD $327 billion. Furthermore, AI adoption is becoming increasingly widespread and not just concentrated within a small number of organisations.

Empowering Digital Innovation Through Data and the Public Cloud Together with Amazon Web Services

As data continues to grow at an exponential rate, our customers are increasingly looking to advance and scale operations through digital transformation and the cloud. These modern digital businesses are also dealing with unprecedented rates of data volume, which is exploding from terabytes to petabytes and even exabytes which could prove difficult to manage.

Getting Started with Cloudera Data Platform Operational Database (COD)

Operational Database is a relational and non-relational database built on Apache HBase and is designed to support OLTP applications, which use big data. The operational database in Cloudera Data Platform has the following components: Atlas provides open metadata management and governance capabilities to build a catalog of all assets, and also classify and govern these assets. The SDX layer of CDP leverages the full spectrum of Atlas to automatically track and control all data assets.

Addressing the Three Scalability Challenges in Modern Data Platforms

In legacy analytical systems such as enterprise data warehouses, the scalability challenges of a system were primarily associated with computational scalability, i.e., the ability of a data platform to handle larger volumes of data in an agile and cost-efficient way.