Cloudera

CDP Private Cloud ends the battle between agility & control in the data center

Jul 13, 2020 By Tom Deane In Cloudera

As a BI Analyst, have you ever encountered a dashboard that wouldn’t refresh because other teams were using it? As a data scientist, have you ever had to wait 6 months before you could access the latest version of Spark? As an application architect, have you ever been asked to wait 12 weeks before you could get hardware to onboard a new application?

Read Post

Cloudera

Read more about CDP Private Cloud ends the battle between agility & control in the data center

Apache Hadoop YARN in CDP Data Center 7.1: What's new and how to upgrade

Jul 10, 2020 By Szilard Nemeth In Cloudera

This blogpost will cover how customers can migrate clusters and workloads to the new Cloudera Data Platform – Data Center 7.1 (CDP DC 7.1 onwards) plus highlights of this new release. CDP DC 7.1 is the on-premises version of Cloudera Data Platform.

Read Post

Cloudera

Read more about Apache Hadoop YARN in CDP Data Center 7.1: What's new and how to upgrade

Overview of the Operational Database performance in CDP

Jul 9, 2020 By Liliana Kadar In Cloudera

This article gives you an overview of Cloudera’s Operational Database (OpDB) performance optimization techniques. Cloudera’s Operational Database can support high-speed transactions of up to 185K/second per table and a high of 440K/second per table. On average, the recorded transaction speed is about 100K-300K/second per node. This article provides you an overview of how you can optimize your OpDB deployment in either Cloudera Data Platform (CDP) Public Cloud or Data Center.

Read Post

Cloudera

Read more about Overview of the Operational Database performance in CDP

Eliminate the pitfalls on your path to public cloud

Jul 8, 2020 By Wim Stoop In Cloudera

As organizations look to get smarter and more agile in how they gain value and insight from their data, they are now able to take advantage of a fundamental shift in architecture. In the last decade, as an industry, we have gone from monolithic machines with direct-attached storage to VMs to cloud. The main attraction of cloud is due to its separation of compute and storage – a major architectural shift in the infrastructure layer that changes the way data can be stored and processed.

Read Post

Cloudera

Read more about Eliminate the pitfalls on your path to public cloud

How to run queries periodically in Apache Hive

Jul 8, 2020 By Zoltan Haindrich In Cloudera

In the lifecycle of a data warehouse in production, there are a variety of tasks that need to be executed on a recurring basis. To name a few concrete examples, scheduled tasks can be related to data ingestion (inserting data from a stream into a transactional table every 10 minutes), query performance (refreshing a materialized view used for BI reporting every hour), or warehouse maintenance (executing replication from one cluster to another on a daily basis).

Read Post

Cloudera

Read more about How to run queries periodically in Apache Hive

Introducing FlinkSQL in Cloudera Streaming Analytics

Jul 7, 2020 By Marton Balassi In Cloudera

Our 1.2.0.0 release of Cloudera Streaming Analytics Powered by Apache Flink brings a wide range of new functionality, including support for lineage and metadata tracking via Apache Atlas, support for connecting to Apache Kudu and the first iteration of the much-awaited FlinkSQL API. Flink’s SQL interface democratizes stream processing, as it caters to a much larger community than the currently widely used Java and Scala APIs focusing on the Data Engineering crowd.

Read Post

Cloudera

Read more about Introducing FlinkSQL in Cloudera Streaming Analytics

Are you prepared to mature to 'ready-made' data management?

Jun 30, 2020 By Wim Stoop In Cloudera

When it comes to furnishing our living spaces, it seems we go through phases. When I was just setting out and leaving home, IKEA was my preferred furniture store. You make your choice, collect all the flat-pack boxes, lug them home, and after some hex key gymnastics: voilà. You’ve truly made it! Since then, I’ve drifted from the “some assembly required” phase to the “ready-made” one.

Read Post

Cloudera

Read more about Are you prepared to mature to 'ready-made' data management?

CDP Private Cloud ends the battle between agility & control in the data center

Jun 25, 2020 By Tom Deane In Cloudera

As a BI Analyst, have you ever encountered a dashboard that wouldn’t refresh because other teams were using it? As a data scientist, have you ever had to wait 6 months before you could access the latest version of Spark? As an application architect, have you ever been asked to wait 12 weeks before you could get hardware to onboard a new application?

Read Post

Cloudera

Read more about CDP Private Cloud ends the battle between agility & control in the data center

Why an integrated analytics platform is the right choice

Jun 25, 2020 By Wim Stoop In Cloudera

Companies realize that in order to grow, connect products and services, or protect their business, they need to become data-driven. In selecting the tools to realize these goals, organizations effectively have two choices: a self-selected combination of analytics tools and applications or a unified platform that handles all. In this blog we will discuss the challenges of the former choice that will provide justification for the latter.

Read Post

Cloudera

Read more about Why an integrated analytics platform is the right choice

Multi-Raft - Boost up write performance for Apache Hadoop-Ozone

Jun 24, 2020 By Guest Author In Cloudera

Apache Hadoop-Ozone is a new-era object storage solution for Big Data platform. It is scalable with strong consistency. Ozone uses Raft protocol, implemented by Apache Ratis (Incubating), to achieve high availability in its distributed system. My team in Tencent started to introduce Ozone as a backend object storage in production a few months ago and we’re onboarding more and more data warehouse users.

Read Post

Cloudera

Read more about Multi-Raft - Boost up write performance for Apache Hadoop-Ozone

Systems | Development | Analytics | API | Testing

Cloudera

CDP Private Cloud ends the battle between agility & control in the data center

Apache Hadoop YARN in CDP Data Center 7.1: What's new and how to upgrade

Overview of the Operational Database performance in CDP

Eliminate the pitfalls on your path to public cloud

How to run queries periodically in Apache Hive

Introducing FlinkSQL in Cloudera Streaming Analytics

Are you prepared to mature to 'ready-made' data management?

CDP Private Cloud ends the battle between agility & control in the data center

Why an integrated analytics platform is the right choice

Multi-Raft - Boost up write performance for Apache Hadoop-Ozone

Monthly Archive

Follow Us