Latest Posts

Cloudera's RHEL-volution: Powering the Cloud with Red Hat

Mar 22, 2024 By Blake Tow In Cloudera

As enterprise AI technologies rapidly reshape our digital environment, the foundation of your cloud infrastructure is more critical than ever. That’s why Cloudera and Red Hat, renowned for their open-source solutions, have teamed up to bring Red Hat Enterprise Linux (RHEL) to Cloudera on public cloud as the operating system for all of our public cloud platform images. Let’s dive into what this means and why it’s a game-changer for our customers.

Read Post

Cloudera

Read more about Cloudera's RHEL-volution: Powering the Cloud with Red Hat

A Closer Look at The Next Phase of Cloudera's Hybrid Data Lakehouse

Mar 5, 2024 By Wim Stoop In Cloudera

Artificial Intelligence (AI) is primed to reshape the way just about every business operates. Cloudera research projected that more than one third (36%) of organizations in the U.S. are in the early stages of exploring the potential for AI implementation. But even with its rise, AI is still a struggle for some enterprises. AI, and any analytics for that matter, are only as good as the data upon which they are based. And that’s where the rub is.

Read Post

Cloudera

Read more about A Closer Look at The Next Phase of Cloudera's Hybrid Data Lakehouse

Metadata Management & Data Governance with Cloudera SDX

Mar 4, 2024 By Pablo Quinones In Cloudera

In this article, we will walk you through the process of implementing fine grained access control for the data governance framework within the Cloudera platform. This will allow a data office to implement access policies over metadata management assets like tags or classifications, business glossaries, and data catalog entities, laying the foundation for comprehensive data access control.

Read Post

Cloudera

Read more about Metadata Management & Data Governance with Cloudera SDX

Using Streams Replication Manager Prefixless Replication for Kafka Topic Aggregation

Feb 28, 2024 By Tamas Barnabas Egyed In Cloudera

Businesses often need to aggregate topics because it is essential for organizing, simplifying, and optimizing the processing of streaming data. It enables efficient analysis, facilitates modular development, and enhances the overall effectiveness of streaming applications. For example, if there are separate clusters, and there are topics with the same purpose in the different clusters, then it is useful to aggregate the content into one topic.

Read Post

Cloudera

Read more about Using Streams Replication Manager Prefixless Replication for Kafka Topic Aggregation

Back to the Financial Regulatory Future

Feb 15, 2024 By Joe Rodriguez In Cloudera

It’s hard to believe it’s been 15 years since the global financial crisis of 2007/2008. While this might be a blast from the past we’d rather leave in the proverbial rear-view mirror, in March of 2023 we were back to the future with the collapse of Silicon Valley Bank (SVB), the largest US bank to fail since 2008.

Read Post

Cloudera

Read more about Back to the Financial Regulatory Future

Optimization Strategies for Iceberg Tables

Feb 14, 2024 By Srinivas Rishindra Pothireddi In Cloudera

Apache Iceberg has recently grown in popularity because it adds data warehouse-like capabilities to your data lake making it easier to analyze all your data—structured and unstructured. It offers several benefits such as schema evolution, hidden partitioning, time travel, and more that improve the productivity of data engineers and data analysts. However, you need to regularly maintain Iceberg tables to keep them in a healthy state so that read queries can perform faster.

Read Post

Cloudera

Read more about Optimization Strategies for Iceberg Tables

High Availability (Multi-AZ) for Cloudera Operational Database

Feb 13, 2024 By Andor Molnar In Cloudera

In the previous blog post we covered the high availability feature of Cloudera Operational Database (COD) in Amazon AWS. Cloudera recently released a new version of COD, which adds HA support to Microsoft Azure-based databases in the Cloud. In this post, we’ll perform a similar test to validate that the feature works as expected in Azure, too.

Read Post

Cloudera

Read more about High Availability (Multi-AZ) for Cloudera Operational Database

DNS Zone Setup Best Practices on Azure

Feb 12, 2024 By Dongkai Yu In Cloudera

In Cloudera deployments on public cloud, one of the key configuration elements is the DNS. Get it wrong and your deployment may become wholly unusable with users unable to access and use the Cloudera data services. If the DNS is set up less ideal than it could be, connectivity and performance issues may arise. In this blog, we’ll take you through our tried and tested best practices for setting up your DNS for use with Cloudera on Azure.

Read Post

Cloudera

Read more about DNS Zone Setup Best Practices on Azure

Accelerating Queries on Iceberg Tables with Materialized Views

Feb 8, 2024 By Aman Sinha In Cloudera

This blog post describes support for materialized views for the Iceberg table format in Cloudera Data Warehouse. Apache Iceberg is a high-performance open table format for petabyte-scale analytic datasets. It has been designed and developed as an open community standard to ensure compatibility across languages and implementations.

Read Post

Cloudera

Read more about Accelerating Queries on Iceberg Tables with Materialized Views

Health Care Outside of the Box

Feb 7, 2024 By Monique Hesseling In Cloudera

How enterprise-grade data management creates better and more efficient care. In the last few years, the acceptance of telehealth has become more widespread as patients and providers found they could maintain continuity through phone and video collaboration, instead of in-person visits. In many cases, a level of care that once required a drive to the clinic or hospital could be delivered over a mobile phone or laptop, with no travel and no waiting room.

Read Post

Cloudera

Read more about Health Care Outside of the Box

Systems | Development | Analytics | API | Testing

Latest Posts

Cloudera's RHEL-volution: Powering the Cloud with Red Hat

A Closer Look at The Next Phase of Cloudera's Hybrid Data Lakehouse

Metadata Management & Data Governance with Cloudera SDX

Using Streams Replication Manager Prefixless Replication for Kafka Topic Aggregation

Back to the Financial Regulatory Future

Optimization Strategies for Iceberg Tables

High Availability (Multi-AZ) for Cloudera Operational Database

DNS Zone Setup Best Practices on Azure

Accelerating Queries on Iceberg Tables with Materialized Views

Health Care Outside of the Box

Monthly Archive

Follow Us