Systems | Development | Analytics | API | Testing

Minimizing Supply Chain Disruptions with Advanced Analytics

January 2020 is a distant memory, but for most, the early days of the pandemic was a time that will be ingrained in memories for decades, if not generations. Over the last 18 months, supply chain issues have dominated our nightly news, social feeds and family conversations at the dinner table. Some but not all have stemmed from the pandemic.

Replace and Boost your Apache Storm Topologies with Apache NiFi Flows

Recently, I worked with a large fortune 500 customer on their migration from Apache Storm to Apache NiFi. If you’re asking yourself, “Isn’t Storm for complex event processing and NiFi for simple event processing?”, you’re correct. A few customers chose a complex event engine like Apache Storm for their simple event processing, even when Apache NiFi is the more practical choice, cutting drastically down on SDLC (software development lifecycle) time.

Five Strategies to Accelerate Data Product Development

With this first article of the two-part series on data product strategies, I am presenting some of the emerging themes in data product development and how they inform the prerequisites and foundational capabilities of an Enterprise data platform that would serve as the backbone for developing successful data product strategies.

Beginner's Guide to Cloudera Operational Database

My name is Shanmukha Kota and I am a recent graduate from University at Buffalo. I interned with Cloudera last summer and joined Cloudera as a software engineer a couple of weeks ago and this is my first experience with CDP and CDP Operational Database. For a new hire college graduate in the industry with only academic experience with HBase, I can only say it is very simple and easy to set up and work with CDP Operational Database.

Accelerate Offloading to Cloudera Data Warehouse (CDW) with Procedural SQL Support

Did you know Cloudera customers, such as SMG and Geisinger, offloaded their legacy DW environment to Cloudera Data Warehouse (CDW) to take advantage of CDW’s modern architecture and best-in-class performance? In addition to substantial cost savings upon moving to CDW, Geisinger is also able to search through hundreds of million patient note records in seconds providing better treatment to their patients.

Future of Data Meetup: Hello, Kafka! (An Introduction to Apache Kafka)

Our “Hello, “ series of introductory “Big Data” topic-focused meetups returns to Boston in July as we deliver our fifth event. This meetup will introduce you to Apache Kafka without assuming you’ve heard anything about the Apache development project, the problems that Kafka was designed to solve or the role it currently plays in modern enterprise data architectures.

A Reference Architecture for the Cloudera Private Cloud Base Data Platform

The release of Cloudera Data Platform (CDP) Private Cloud Base edition provides customers with a next generation hybrid cloud architecture. This blog post provides an overview of best practice for the design and deployment of clusters incorporating hardware and operating system configuration, along with guidance for networking and security as well as integration with existing enterprise infrastructure.