Systems | Development | Analytics | API | Testing

Data Vault Techniques on Snowflake: Streams and Tasks on Views

Snowflake removes the need to perform maintenance tasks on your data platform and provides you with the freedom to choose your data model methodology for the cloud. When attempting to keep the cost of data processing low, both data volume and velocity can make things challenging.

Building an Automated ML Pipeline with a Feature Store Using Iguazio & Snowflake

When operationalizing machine and deep learning, a production-first approach is essential for moving from research and development to scalable production pipelines in a much faster and more effective manner. Without the need to refactor code, add glue logic and spend significant efforts on data and ML engineering, more models will make it to production and with less issues like drift.

Why ETL is Critical for Ecommerce Data Success & How to Start

It’d be hard to find anyone who’d say that taking a data-driven approach to business decisions is not worthwhile. Yet, so many businesses aren’t doing it because, as simple as it may sound on paper, it takes a great deal of strategic planning to pull off. One of the most crucial tools when it comes to accomplishing a data-driven decision-making process is known as ETL.

DataOps Observability Designed for Data Teams

Today every company is a data company. And even with all the great new data systems and technologies, it’s people—data teams—who unlock the power of data to drive business value. But today’s data teams are getting bogged down. They’re struggling to keep pace with the increased volume, velocity, variety, complexity—and cost—of the modern data stack. That’s where Unravel DataOps observability comes in.

About the State of Value Stream Management in 2022

Value Stream Management (VSM) is about empowering delivery organizations to measure, mitigate, and monitor complexity. Simply put, it aims at improving the flow of value in your organisation. The VSM Consortium recently released their highly anticipated report on “The State of Value Stream Management 2022” . In this post we recap some of the findings and look at it specifically from a software engineering and platform engineering point of view.

SCIM (System for Cross-domain Identity Management)

The identity team at Cloudera has been working to add the System for Cross-domain Identity Management (SCIM) support to Cloudera Data Platform (CDP) and we’re happy to announce the general availability of SCIM on Azure Active Directory! In Part One we discussed: CDP SCIM Support for Active Directory, which discusses the core elements of CDP’s SCIM support for Azure AD.

How To Deploy a HuggingFace Model (Seamlessly)

What if I want to serve a Huggingface model on ClearML? Where do I start? In general, machine learning engineers know by now that a good model serving engine is invaluable when serving models in production. These days, NVIDIA’s Triton inference engine is a popular option to do so, but it is lacking in some respects.

The Biggest Mistake in E-Commerce: More Data Means More Business Value

This is a guest post for Integrate.io written by Bill Inmon, an American computer scientist recognized as the "father of the data warehouse." Inmon wrote the first book and first magazine column about data warehousing, held the first conference about this topic, and was the first person to teach data warehousing classes.