Systems | Development | Analytics | API | Testing

Introducing FlinkSQL in Cloudera Streaming Analytics

Our 1.2.0.0 release of Cloudera Streaming Analytics Powered by Apache Flink brings a wide range of new functionality, including support for lineage and metadata tracking via Apache Atlas, support for connecting to Apache Kudu and the first iteration of the much-awaited FlinkSQL API. Flink’s SQL interface democratizes stream processing, as it caters to a much larger community than the currently widely used Java and Scala APIs focusing on the Data Engineering crowd.

Are you prepared to mature to 'ready-made' data management?

When it comes to furnishing our living spaces, it seems we go through phases. When I was just setting out and leaving home, IKEA was my preferred furniture store. You make your choice, collect all the flat-pack boxes, lug them home, and after some hex key gymnastics: voilà. You’ve truly made it! Since then, I’ve drifted from the “some assembly required” phase to the “ready-made” one.

Why an integrated analytics platform is the right choice

Companies realize that in order to grow, connect products and services, or protect their business, they need to become data-driven. In selecting the tools to realize these goals, organizations effectively have two choices: a self-selected combination of analytics tools and applications or a unified platform that handles all. In this blog we will discuss the challenges of the former choice that will provide justification for the latter.

CDP Private Cloud ends the battle between agility & control in the data center

As a BI Analyst, have you ever encountered a dashboard that wouldn’t refresh because other teams were using it? As a data scientist, have you ever had to wait 6 months before you could access the latest version of Spark? As an application architect, have you ever been asked to wait 12 weeks before you could get hardware to onboard a new application?

Multi-Raft - Boost up write performance for Apache Hadoop-Ozone

Apache Hadoop-Ozone is a new-era object storage solution for Big Data platform. It is scalable with strong consistency. Ozone uses Raft protocol, implemented by Apache Ratis (Incubating), to achieve high availability in its distributed system. My team in Tencent started to introduce Ozone as a backend object storage in production a few months ago and we’re onboarding more and more data warehouse users.

The Rise Of Connected Manufacturing And How Data Is Driving Innovation, Part I

This interview was conducted by Cindy Maike, VP Industry Solutions The shift towards Industry 4.0 is improving manufacturing efficiency and the factory of the future will increasingly be driven by technology like the Internet of Things (IoT), Automation, Artificial Intelligence (AI), and Cloud Computing.

Auto-TLS in Cloudera Data Platform Data Center

Wire encryption protects data in motion, and Transport Layer Security (TLS) is the most widely used security protocol for wire encryption. TLS provides authentication, privacy and data integrity between applications communicating over a network by encrypting the packets transmitted between endpoints. Users interact with Hadoop clusters via browser or command line tools, while applications use REST APIs or Thrift.

Build on your investment by Migrating or Upgrading to CDP Data Center

Cloudera Data Platform (CDP) Data Center(DC) is the on-premises release of Cloudera Data Platform. CDP DC combines the best services and components from Cloudera Enterprise Data Hub and Hortonworks Data Platform Enterprise along with new features and enhancements across the stack to deliver the premier on-premises enterprise data platform . This unified distribution is a scalable and customizable platform where you can securely run many types of workloads.

Are Your Machine Learning Models Wrong?

In addition to the very real negative impact on every person around the world, the COVID-19 pandemic is driving business disruptions and closures at an unprecedented scale. Enormous government stimulus programs are resulting in explosions in fiscal deficits, regulators are relaxing capital constraints on banks and central banks are supporting economic stability with a range of interest rate cuts and other stimulus measures.

Enterprise Data Platforms - Should Organizations Build or Buy?

“Build vs Buy” is an important decision every technology strategist has to make. With the rise of open source and the wealth of freely available software, organizations have the flexibility to build custom solutions when off-the-shelf solutions don’t directly address their needs. In the domain of enterprise data platforms, many organizations have leveraged the open-source ecosystem to build tailored solutions, expending a lot of resources in the process.