Systems | Development | Analytics | API | Testing

Upgrade Hortonworks Data Platform (HDP) to Cloudera Data Platform (CDP) Private Cloud Base

CDP Private Cloud Base is an on-premises version of Cloudera Data Platform (CDP). This new product combines the best of Cloudera Enterprise Data Hub and Hortonworks Data Platform Enterprise along with new features and enhancements across the stack. This unified distribution is a scalable and customizable platform where you can securely run many types of workloads. CDP is an easy, fast, and secure enterprise analytics and management platform with the following capabilities.

Loading Data to Redshift: Five Options and One Solution

Around 95 percent of organizations say their inability to manage and comprehend data holds them back. It's no wonder, then, that so many of these companies are loading their data into a single location like Amazon Redshift. Redshift uses SQL to analyze data sets so users can solve organizational problems and make more profitable business decisions.

Make Your AWS Data Lake Deliver with ChaosSearch (Webinar Highlights)

When CTO James Dixon coined the term “data lake” in 2011, he imagined a single storage repository where organizations could store both structured and unstructured data in their raw format until it was needed for analytics. But without the right storage technology, data governance, or analytical tools, the first data lakes quickly became “data swamps” - morasses of data with no organizational structure and no efficient way to access or extract meaningful insights.

Top 3 Reasons Spreadsheets Miss the Mark for Cap Table Management

Cap tables are a valuable tool for a close look at the equity capitalization within your organization. But relying on static spreadsheets makes it difficult to gain a comprehensive, real-time view of your capitalization structure. Sifting through spreadsheets manually and reconciling disconnected systems are both time-consuming and cumbersome.

West Midlands Police Force | Faster Data, Safer Streets

The West Midlands Police force is one of the largest in the UK — and they serve a population of 2.8 million people, which generates a lot of data on a daily basis. See how they used Cloudera to make that data accessible and unified, which allowed West Midlands Police Force to better serve their community.

How Monte Carlo Built A Data Reliability Platform On Snowflake

In today's episode, Daniel Myers from Snowflake interviews Lior Gavish, co-founder of Monte Carlo. Monte Carlo is a data reliability platform built on Snowflake that helps teams trust their data by eliminating data downtime. Powered by Snowflake is a series where we interview technology leaders who are building businesses and applications on top of Snowflake.

Of Muffins and Machine Learning Models

While it is a little dated, one amusing example that has been the source of countless internet memes is the famous, “is this a chihuahua or a muffin?” classification problem. Figure 01: Is this a chihuahua or a muffin? In this example, the Machine Learning (ML) model struggles to differentiate between a chihuahua and a muffin.

6 steps towards healthier data

The value of healthy data is obvious. But how do you build that practice in your own business? The difference between people who live a healthy lifestyle and those who don’t isn’t whether they know how to be healthier — it’s whether or not they prioritize diet, sleep, and exercise in their daily life. The same is true for your data: if you don’t have the infrastructure that supports your customer 360 initiatives , those initiatives become moot.