Systems | Development | Analytics | API | Testing

Realtime data replication into BigQuery with Datastream and Dataflow

How can you replicate data from a relational database in real time? In this video, we’ll show you you can combine Datastream with Dataflow templates to replicate data from a relational database. Watch to learn how you can use this streaming analytics service in unison with Datastream to easily replicate data from Oracle to BigQuery in real time!

Embedded analytics 2.0: Your secret weapon to empowering frontline workers and locking in customers loyalty

Last year, Harvard Business Review and ThoughtSpot published a groundbreaking survey on the business benefits of empowering frontline workers with data. Revenues are higher, operations more efficient, customer service better, and employees happier. And yet, few organizations deploy BI this way, historically held back by the technology, conflicting priorities, and mindset.

Modernizing Data Pipelines using Cloudera Data Platform - Part 1

Data pipelines are in high demand in today’s data-driven organizations. As critical elements in supplying trusted, curated, and usable data for end-to-end analytic and machine learning workflows, the role of data pipelines is becoming indispensable. To keep up, data pipelines are being vigorously reshaped with modern tools and techniques.

Apache Ozone Metadata Explained

Apache Ozone is a distributed object store built on top of Hadoop Distributed Data Store service. It can manage billions of small and large files that are difficult to handle by other distributed file systems. As an important part of achieving better scalability, Ozone separates the metadata management among different services: Ozone Manager (OM) service manages the metadata of the namespace such as volume, bucket and keys.

Jeeves Grows Up: How an AI Chatbot Became Part of Unravel Data

Jeeves is the stereotypical English butler – and an AI chatbot that answers pertinent and important questions about Spark jobs in production. Shivnath Babu, CTO and co-founder of Unravel Data, spoke yesterday at Data + AI Summit, formerly known as Spark Summit, about the evolution of Jeeves, and how the technology has become a key supporting pillar within Unravel Data’s software.

The Ethics of AI Comes Down to Conscious Decisions

This blog post was written by Pedro Pereira as a guest author for Cloudera. Right now, someone somewhere is writing the next fake news story or editing a deepfake video. An authoritarian regime is manipulating an artificial intelligence (AI) system to spy on technology users. No matter how good the intentions behind the development of a technology, someone is bound to corrupt and manipulate it. Big data and AI amplify the problem. “If you have good intentions, you can make it very good.