How leading organizations govern their data to find success

With the increased focus on delivering value customers, it is imperative to build a next generation customer hub that delivers high quality and governed data. In this video we will share best practices for implementing a comprehensive data governance approach. Learn how to leverage the capabilities of the Talend Data Fabric to deploy a forward-looking data management architecture that detects and retrieves metadata from across databases and applications, builds data lineage, and adds traceability.

Hive vs. SQL: Which One Performs Data Analysis Better?

Key differences between Hive and SQL: Big data requires powerful tools. Successful organizations query, manage and analyze thousands of data sets from hundreds of data sources. This is where tools like Hive and SQL come in. Although very different, both query and program big data. But which tool is right for your organization? In this review, we compare Hive vs. SQL on features, prices, support, user scores, and more.

How to configure clients to connect to Apache Kafka Clusters securely - Part 1: Kerberos

This is the first installment in a short series of blog posts about security in Apache Kafka. In this article we will explain how to configure clients to authenticate with clusters using different authentication mechanisms.

Solution Architect: Become the Ultimate Problem Solver

There's an old XKCD cartoon that describes a conversation between a manager and a software developer. This kind of conversation happens all the time. Business leaders know their strategic goals. IT people know what the tech can do. But aligning goals with technology is an ongoing challenge.This is where solution architects come in. They act as a bridge between the business and technical side, and they figure out how to get things done.

Cloudera Operational Database Infrastructure Planning Considerations

In this blog post, let us take a look at how you can plan your infrastructure planning that you may have to do when deploying an operational database cluster on a CDP Private Cloud Base deployment. Note that you may have to do some planning assumptions when designing your initial infrastructure, and it must be flexible enough to scale up or down based on your future needs.

Beware of Creating a New Legacy of Artificial Intelligence Silos

Although the issue of silos in IT and data management are well known, companies appear to be falling back into this trap by not distributing their artificial intelligence (AI) and machine learning (ML) capabilities across their business. New research from Qlik and IDC revealed that just 20 percent of businesses widely distribute these capabilities across the organization.