February 2020

Take Control of Your Destiny, Leave Retail Laggards in the Dust

Feb 25, 2020 By Brent Biddulph In Cloudera

Ongoing reports of the “Retail Apocalypse” were fueled once again in 2019 with more than a dozen well-known retail brands closing their doors forever. On the flip side, a “Retail Renaissance” is well underway – and signs indicate that retail leaders that have already invested in their digital transformation journey will continue to reap rewards well into the future.

Read Post

Cloudera

Read more about Take Control of Your Destiny, Leave Retail Laggards in the Dust

Why Data Chain of Custody is Essential to Reducing Product Liability Risks

Feb 24, 2020 By Michael Ger In Cloudera

When a market grows as quickly as implantable medical devices, set to top a staggering $153.8 billion by 2026, the potential risk to patients can rise as well. As implantable medical devices proliferate, so do the number of costly, life-threatening, and reputation-tarnishing recalls. A single large recall can account for millions of device units.

Read Post

Cloudera

Read more about Why Data Chain of Custody is Essential to Reducing Product Liability Risks

Real-time log aggregation with Apache Flink Part 2

Feb 20, 2020 By Gyula Fora In Cloudera

We are continuing our blog series about implementing real-time log aggregation with the help of Flink. In the first part of the series we reviewed why it is important to gather and analyze logs from long-running distributed jobs in real-time. We also looked at a fairly simple solution for storing logs in Kafka using configurable appenders only. As a reminder let’s review our pipeline again

Read Post

Cloudera

Read more about Real-time log aggregation with Apache Flink Part 2

Day in the Life of a Cloudera Data Platform Admin

Feb 18, 2020 By Cloudera In Cloudera

Cloudera Data Platform (CDP) on Public Cloud makes being an admin for a big data platform even easier thanks to SDX. Watch me spend a day at a temp position for Aperture Cybertronics as their Data Admin. I'll quickly deploy clusters, grants users access, and change performance settings such as autoscaling for the Aperture Cybertornics' staff.

View Video

Cloudera

Analytics
BI

Read more about Day in the Life of a Cloudera Data Platform Admin

Benchmarking Ozone: Cloudera's next-generation Storage for CDP

Feb 14, 2020 By Istvan Fajth In Cloudera

Apache Hadoop Ozone was designed to address the scale limitation of HDFS with respect to small files and the total number of file system objects. On current data center hardware, HDFS has a limit of about 350 million files and 700 million file system objects. Ozone’s architecture addresses these limitations[4]. This article compares the performance of Ozone with HDFS, the de-facto big data file system.

Read Post

Cloudera

Read more about Benchmarking Ozone: Cloudera's next-generation Storage for CDP

Searcher Seismic is utilizing seismic data for the oil and gas industry providing a map to de-risk exploration

Feb 11, 2020 By Diana Yanez-Pastor In Cloudera

In today’s age of technology, the processing of seismic data requires powerful computers, talented researchers, software, and skills. For the Oil and Gas Industry, its paramount to making strategic business decisions. Seismic data accurately helps to plan for wells, reduce the need for further exploration, and minimizes the impact on the environment.

Read Post

Cloudera

Read more about Searcher Seismic is utilizing seismic data for the oil and gas industry providing a map to de-risk exploration

Disk and Datanode Size in HDFS

Feb 6, 2020 By Lokesh Jain In Cloudera

This blog discusses answers to questions like what is the right disk size in datanode and what is the right capacity for a datanode. A few of our customers have asked us about using dense storage nodes. It is certainly possible to use dense nodes for archival storage because IO bandwidth requirements are usually lower for cold data. However the decision to use denser nodes for hot data must be evaluated carefully as it can have an impact on the performance of the cluster.

Read Post

Cloudera

Read more about Disk and Datanode Size in HDFS

How Florida State University is Boosting Student Success and Addressing Data Challenges

Feb 3, 2020 By Matt Spillar In Cloudera

For public universities, metrics such as retention rate and graduation rate are important indicators for standing out in the competitive landscape. These success metrics are paramount to bringing in more students, making them successful, and continuing to grow a strong alumni network.

Read Post

Cloudera

Read more about How Florida State University is Boosting Student Success and Addressing Data Challenges

Systems | Development | Analytics | API | Testing

February 2020

Take Control of Your Destiny, Leave Retail Laggards in the Dust

Why Data Chain of Custody is Essential to Reducing Product Liability Risks

Real-time log aggregation with Apache Flink Part 2

Day in the Life of a Cloudera Data Platform Admin

Benchmarking Ozone: Cloudera's next-generation Storage for CDP

Searcher Seismic is utilizing seismic data for the oil and gas industry providing a map to de-risk exploration

Disk and Datanode Size in HDFS

How Florida State University is Boosting Student Success and Addressing Data Challenges

Monthly Archive

Follow Us