Systems | Development | Analytics | API | Testing

November 2021

The Cloudera Enterprise Data Cloud Maturity Report: Uncovering progressive steps towards a hybrid future

Global survey of IT and business executives reveals how a mature data strategy relates to business performance and resilience. Organizations fall under one of four categories when it comes to enterprise data maturity, and they need to be aware of how to address multi-dimensional challenges of a hybrid future.

In AI we Trust? Why we Need to Talk about Ethics and Governance (part 1 of 2)

Advances in the performance and capability of Artificial Intelligence (AI) algorithms has led to a significant increase in adoption in recent years. In a February 2021 report by IDC, they estimate that world-wide revenues from AI will grow by 16.4% in 2021 to USD $327 billion. Furthermore, AI adoption is becoming increasingly widespread and not just concentrated within a small number of organisations.

Empowering Digital Innovation Through Data and the Public Cloud Together with Amazon Web Services

As data continues to grow at an exponential rate, our customers are increasingly looking to advance and scale operations through digital transformation and the cloud. These modern digital businesses are also dealing with unprecedented rates of data volume, which is exploding from terabytes to petabytes and even exabytes which could prove difficult to manage.

Getting Started with Cloudera Data Platform Operational Database (COD)

Operational Database is a relational and non-relational database built on Apache HBase and is designed to support OLTP applications, which use big data. The operational database in Cloudera Data Platform has the following components: Atlas provides open metadata management and governance capabilities to build a catalog of all assets, and also classify and govern these assets. The SDX layer of CDP leverages the full spectrum of Atlas to automatically track and control all data assets.

Addressing the Three Scalability Challenges in Modern Data Platforms

In legacy analytical systems such as enterprise data warehouses, the scalability challenges of a system were primarily associated with computational scalability, i.e., the ability of a data platform to handle larger volumes of data in an agile and cost-efficient way.

Make Your Models Matter: What It Takes to Maximize Business Value from Your Machine Learning Initiatives

We are excited by the endless possibilities of machine learning (ML). We recognise that experimentation is an important component of any enterprise machine learning practice. But, we also know that experimentation alone doesn’t yield business value. Organizations need to usher their ML models out of the lab (i.e., the proof-of-concept phase) and into deployment, which is otherwise known as being “in production”.

New Applied ML Prototypes Now Available in Cloudera Machine Learning

It’s no secret that Data Scientists have a difficult job. It feels like a lifetime ago that everyone was talking about data science as the sexiest job of the 21st century. Heck, it was so long ago that people were still meeting in person! Today, the sexy is starting to lose its shine. There’s recognition that it’s nearly impossible to find the unicorn data scientist that was the apple of every CEO’s eye in 2012.

NiFi as a Function in DataFlow Service

With the general availability of Cloudera DataFlow for the Public Cloud (CDF-PC), our customers can now self-serve deployments of Apache NiFi data flows on Kubernetes clusters in a cost effective way providing auto scaling, resource isolation and monitoring with KPI-based alerting. You can find more information in this release announcement blog post and in this technical deep dive blog post. Any customer willing to run NiFi flows efficiently at scale should now consider adopting CDF-PC.

The Rise of Unstructured Data

The word “data” is ubiquitous in narratives of the modern world. And data, the thing itself, is vital to the functioning of that world. This blog discusses quantifications, types, and implications of data. If you’ve ever wondered how much data there is in the world, what types there are and what that means for AI and businesses, then keep reading!

Defining Simplicity for Enterprise Software as "a 10 Year Old Can Demo it"

Arjun (my son) sat next to me at my desk. He was a bit nervous but we had practiced 3 times before he was ‘on stage’ in front of hundreds of people and the zoom meeting turned to him. My ten year old began to demonstrate how to deploy an Operational Database in AWS, showcasing how auto-scaling worked and how to set up replication. All of the sales team and my colleagues were quite impressed with him, and I am very proud of him.

Introducing Cloudera DataFlow for the Public Cloud

With the rise of streaming data (or, data-in-motion), companies must figure out how to deliver high-scale data ingestion, transformation, and management. In this session, you’ll see how Cloudera Data Platform’s (CDP) new DataFlow service provides real-time data movement capabilities to address hybrid cloud use cases.

Sentry to Ranger - A Concise Guide

Cloudera Data Platform (CDP) brings many improvements to customers by merging technologies from the two legacy platforms, Cloudera Enterprise Data Hub (CDH) and Hortonworks Data Platform (HDP). CDP includes new functionalities as well as superior alternatives to some previously existing functionalities in security and governance. One such major change for CDH users is the replacement of Sentry with Ranger for authorization and access control.

Cloudera Addresses Executive Order on Improving U.S. Cybersecurity with Data Analytics

SANTA CLARA, Calif., Nov. 9, 2021 – Cloudera, the enterprise data cloud company, today announced Cloudera Data Platform capabilities available to help federal agencies meet requirements of the Biden Administration’s Executive Order on improving the Nation’s cybersecurity. Cloudera is committed to supporting the federal government in adhering to this executive order with the company’s technology and special government rates.

A Fresh Squeeze on Data

Guest Author Roozbeh Aliabadi is CEO at ReadyAI Our children have the right to be AI-educated so they can thrive intellectually, emotionally, and morally alongside AI. In the next decade or so, for most children, AI will be their co-workers, drivers, insurance agents, customer service reps, bank tellers, receptionists, radiologists, in short, a natural part of their lives.

Switching from CPUs to GPUs for NYC Taxi Fare Predictions with NVIDIA RAPIDS

Have you ever asked a data scientist if they wanted their code to run faster? You would probably get a more varied response asking if the earth is flat. It really isn’t any different from anything else in tech, faster is almost always better. One of the best ways to make a substantial improvement in processing time is to, if you haven’t already, switched from CPUs to GPUs.

Accelerate Insight with Proactive Data Governance Practices

Becoming a data-driven organization is not exactly getting any easier. Businesses are flooded with ever more data. Although it is true that more data enables more insight, the effort needed to separate the wheat from the chaff grows exponentially. Doing so and truly understanding the data is more important than ever, especially when data privacy regulations are tightening.