Palo Alto, CA, USA
Nov 18, 2022   |  By Jimit Patel
Stream processing is about creating business value by applying logic to your data while it is in motion. Many times that involves combining data sources to enrich a data stream. Flink SQL does this and directs the results of whatever functions you apply to the data into a sink.
Nov 16, 2022   |  By Sandra Horn
I recently had the privilege of attending the CDAO event in Boston hosted by Corinium. Tracks represented financial services, insurance, retail and consumer packaged goods, and healthcare. Overall, it struck me that while data science is not new, most firms are still defining the mission of the data office and data officer. It’s clear firms seek to leverage data and embrace its potential insights, but most are forging ahead in largely uncharted territory.
Nov 15, 2022   |  By Wellington Chevreuil
CDP Operational Database (COD) is a real-time auto-scaling operational database powered by Apache HBase and Apache Phoenix. It is one of the main data services that run on Cloudera Data Platform (CDP) Public Cloud. You can access COD from your CDP console.
Nov 8, 2022   |  By Tsz Sze
Cloudera has been working on Apache Ozone, an open-source project to develop a highly scalable, highly available, strongly consistent distributed object store. Ozone is able to scale to billions of objects and hundreds petabytes of data. It enables cloud-native applications to store and process mass amounts of data in a hybrid multi-cloud environment and on premises.
Nov 1, 2022   |  By Tej Tenmattam
It’s no secret that IT modernization is a top priority for the US federal government. A quick trip in the congressional time machine to revisit 2017’s Modernizing Government Technology Act surfaces some of the most salient points regarding agencies’ challenges: In the private sector, excluding highly regulated industries like financial services, the migration to the public cloud was the answer to most IT modernization woes, especially those around data, analytics, and storage.
Oct 28, 2022   |  By Brian Lachance
A recent headline in Wired magazine read “Uber Hack’s Devastation Is Just Starting to Reveal Itself.” There is no corporation that wants that headline and the reputational damage and financial loss it may cause. In the case of Uber it was a relatively simple attack using an approach called Multi Factor Authentication (MFA) fatigue. This is when an attacker takes advantage of authentication systems that require account owners to approve a log in.
Oct 27, 2022   |  By Máté Szalay-Bekő
The Apache Solr cluster is available in CDP Public Cloud, using the “Data exploration and analytics” data hub template. In this article we will investigate how to connect to the Solr REST API running in the Public Cloud, and highlight the performance impact of session cookie configurations when Apache Knox Gateway is used to proxy the traffic to Solr servers. Information in this blog post can be useful for engineers developing Apache Solr client applications.
Oct 26, 2022   |  By Paul Codding
It’s no secret that advancements like AI and machine learning (ML) can have a major impact on business operations. In Cloudera’s recent report Limitless: The Positive Power of AI, we found that 87% of business decision makers are achieving success through existing ML programs. Among the top benefits of ML, 59% of decision makers cite time savings, 54% cite cost savings, and 42% believe ML enables employees to focus on innovation as opposed to manual tasks.
Oct 25, 2022   |  By Brian Lachance
Enabling data and analytics in the cloud allows you to have infinite scale and unlimited possibilities to gain faster insights and make better decisions with data. The data lakehouse is gaining in popularity because it enables a single platform for all your enterprise data with the flexibility to run any analytic and machine learning (ML) use case. Cloud data lakehouses provide significant scaling, agility, and cost advantages compared to cloud data lakes and cloud data warehouses.
Oct 24, 2022   |  By Abhas Ricky
Demand for both entry-level and highly skilled tech talent is at an all-time high, and companies across industries and geographies are struggling to find qualified employees. And, with 1.1 billion jobs liable to be radically transformed by technology in the next decade, a “reskilling revolution” is reaching a critical mass.
Nov 11, 2022   |  By Cloudera
Learn about the recent technical innovations from the Apache Ozone community. Bucket types and FSO improvements Speaker: Ethan Rose
Nov 7, 2022   |  By Cloudera
For years, companies have viewed data the wrong way. They see it as the byproduct of a business interaction and this data often ends up collecting dust in centralized silos governed by data teams who lack the expertize to understand its true value. Cloudera is ushering in a new era of data architecture by allowing experts to organize and manage their own data at the source. Data mesh brings all your domains together so each team can benefit from each other’s data.
Oct 27, 2022   |  By Cloudera
In this meetup, we’ll look at the different options for enriching your data using Apache NiFi. When and why would we prefer using NiFi for enrichment over a potentially more holistic solution, like Flink or Spark? What are the limitations? And how can we get the best of both worlds, performing data enrichment with NiFi when it makes sense and using our CEP engine when that makes the most sense? Join John Kuchmek and Mark Payne to find out!
Oct 27, 2022   |  By Cloudera
Paul Codding introduces Cloudera's Applied ML Prototypes to accelerate machine learning applications in business.
Oct 21, 2022   |  By Cloudera
In this demo, we have shown how an analyst who knows only SQL can work independently to create sophisticated data transformation pipelines without the need for any engineering. Our CDP deployment simplifies all aspects of the software development lifecycle of dbt models.
Oct 13, 2022   |  By Cloudera
The speed at which you move data throughout your organization can be your next competitive advantage. Cloudera DataFlow greatly simplifies your data flow infrastructure facilitating complex data collection and movement through a unified process that seamlessly transfers data throughout your organization. Even as you scale. With Cloudera DataFlow for Public Cloud you can collect and move any data (structured, unstructured, and semi-structured) from any source to any destination with any frequency (real-time streaming, batch, and micro-batch).
Oct 11, 2022   |  By Cloudera
CDP Private Cloud Base 7.1.8 is here! This marks the next wave of Cloudera innovation on-premises for CDP. In this live stream, we’ll go through what’s in our latest release and highlight some of the exciting new features we’ve made available.
Sep 30, 2022   |  By Cloudera
Since its initial release in 2021, Cloudera DataFlow for Public Cloud (CDF-PC) has been helping customers solve their data distribution use cases that need high throughput and low latency requiring always-running clusters. CDF-PC’s DataFlow Deployments provides a cloud-native runtime to run your Apache NiFi flows through auto scaling Kubernetes clusters as well as centralized monitoring and alerting and improved SDLC for developers.
Sep 26, 2022   |  By Cloudera
The Cloudera Applied Machine Learning Prototype (AMP) for continuous model monitoring acts as a customizable template for your data science team to quickly build an accurate way to track drift.
Sep 12, 2022   |  By Cloudera
The Applied Machine Learning Prototype (AMP) for anomaly detection reduces implementation time by providing a reference model that you can build from. Built by Fast Forward Labs, and tested on AMD EYPC™ CPUs with Dell Technologies, this AMP enables data scientists across industries to truly practice predictive maintenance.
Jun 28, 2018   |  By Cloudera
Enterprises require fast, cost-efficient solutions to the familiar challenges of engaging customers, reducing risk, and improving operational excellence to stay competitive. The cloud is playing a key role in accelerating time to benefit from new insights. Managed cloud services that automate provisioning, operation, and patching will be critical for enterprises to leverage the full promise of the cloud when it comes to time to value and agility.
Jun 26, 2018   |  By Cloudera
The adoption of cloud computing in the financial services sector has grown substantially in the past three years on a global basis. Diversification of risk is always a key concern for financial institutions and the seeming safety of having a single cloud provider is not being properly measured from a systemic risk and operational risk perspective.
Jun 12, 2018   |  By Cloudera
This white paper provides a reference architecture for running Enterprise Data Hub on Oracle Cloud Infrastructure. Topics include installation automation, automated configuration and tuning, and best practices for deployment and topology to support security and high availability.
May 17, 2018   |  By Cloudera
A cloud-based analytics platform needs to be easy, unified, and enterprise-grade to meet the demands of your business. This white paper covers how Cloudera's machine learning and analytics platform complements popular cloud services like Amazon Web Services (AWS) and Microsoft Azure, and enables customers to organize, process, analyze, and store data at large scale...anywhere.
May 15, 2018   |  By Cloudera
The Modern Platform for Machine Learning and Analytics Optimized for Cloud.
Mar 25, 2018   |  By Cloudera
In the wake of the global financial crisis, the world has become much more interconnected and immensely more complex. As a result, you can no longer simply look at the past as an indicator of future trends. The financial services industry needs real-time insights into numerous interacting variables to make informed decisions.

Cloudera delivers the modern platform for machine learning and analytics optimized for the cloud. Imagine having access to all your data in one platform. The opportunities are endless. We enable you to transform vast amounts of complex data into clear and actionable insights to enhance your business and exceed your expectations.

The right products for the job:

  • Enterprise Data Hub: Operate with confidence—thanks to comprehensive security and governance—while at the same time enabling unrivaled self-service performance at extreme scale. All in an enterprise-grade solution that lets you run anywhere, on-premises or in hybrid- and multi-cloud environments.
  • Data Science Workbench: Accelerate machine learning from research to production with the secure, self-service enterprise data science platform built for the enterprise.
  • Data Warehouse: A modern data warehouse that delivers an enterprise-grade, hybrid cloud solution designed for self-service analytics.
  • Data Science & Engineering: Cloudera Data Science provides better access to Apache Hadoop data with familiar and performant tools that address all aspects of modern predictive analytics.
  • Altus Cloud: The industry’s first machine learning and analytics cloud platform built with a shared data experience.

The world’s leading organizations choose Cloudera to grow their businesses, improve lives, and advance human achievement.