Systems | Development | Analytics | API | Testing

Latest Posts

Announcing support for Apache Flink with the GA of Cloudera Streaming Analytics

We cannot hold our excitement anymore! For the last few months, our Data-in-Motion engineering teams have been working hard to deliver a compelling and critical part of our Cloudera DataFlow (CDF) story. To enhance our Stream Processing and Analytics narrative within the overall Data-in-Motion platform, we give you support for Apache Flink with the general availability of Cloudera Streaming Analytics (CSA).

Placing the Emphasis on Data in the Federal Data Strategy

In mid-June of 2019, the White House Office of Management and Budget (OMB) released the Draft 2019-2020 Federal Data Strategy Action Plan. The plan outlines a series of steps and principles targeting effective governance, responsibilities and best practices for federal agencies’ use of citizen data. When put into place, these action items will allow government agencies to maximize data, improve security and better serve constituents.

How Scania is Driving Logistical Efficiency and Sustainability with Big Data

Organizations in the transportation and manufacturing industries are applying Industrial IoT concepts and technology to transform product development, supply chains, and manufacturing operations. Scania is driving logistical efficiency and sustainability with big data. Scania is a world-leading provider of transport solutions and is leading the shift towards sustainable transport systems. In 2018 it delivered 88,000 trucks, 8,500 buses as well as 12,800 industrial and marine engines to customers.

Introducing Apache Spark on Docker on top of Apache YARN with CDP DataCenter release

Bringing your own libraries to run a Spark job on a shared YARN cluster can be a huge pain. In the past, you had to install the dependencies independently on each host or use different Python package management softwares. Nowadays Docker provides a much simpler way of packaging and managing dependencies so users can easily share a cluster without running into each other, or waiting for central IT to install packages on every node.

Three Trends in Cloud Computing to Expect in 2020

A new year is upon us and that means it’s time to look ahead to what’s coming next. In cloud computing, organizations are going to be making adjustments in 2020 – to accommodate overstrained budgets, new regulations, and shifting technologies. It will be a year of identifying what’s not working and moving toward the right solutions. Let’s take a look at three trends that will impact cloud computing across all industries in the coming year.