Systems | Development | Analytics | API | Testing

Real-Time Streaming for Data Science

First, we collect data from an existing Kafka stream into an Iguazio time series table. Next, we visualize the stream with a Grafana dashboard; and finally, we access the data in a Jupyter notebook using Python code. We use a Nuclio serverless function to “listen” to a Kafka stream and then ingest its events into our time series table. Iguazio gets you started with a template for Kafka to time series.

Customer Profitability Analysis in E-Commerce

Five things to know about customer profitability analysis: Digital retailers often talk a lot about 'profit' without ever determining the factors that drive profitability in their businesses. One of the biggest contributors to profit in e-commerce is existing and new customers who purchase products and services from online stores. However, the connection between customers and profitability can be unclear unless you carry out the right kind of analysis.

What's New in Amazon EMR Unveiled at DataOps Unleashed 2022

At the DataOps Unleashed 2022 virtual conference, AWS Principal Solutions Architect Angelo Carvalho presented How AWS & Unravel help customers modernize their Big Data workloads with Amazon EMR. The full session recording is available on demand, but here are some of the highlights.

MLOps in BigQuery ML with Vertex AI Model Registry

Without a central place to manage models, those responsible for operationalizing ML models have no way of knowing the overall status of trained models and data. This lack of manageability can impact the review and release process of models into production, which often requires offline reviews with many stakeholders.

How Mercado Libre Builds Upon a Continuous Intelligence Ecosystem with BigQuery and Looker

At Mercado Libre, we are obsessed with unlocking the power and potential of data. One of our key cultural principles is to have a Beta Mindset. This means that we operate in a “state of beta”, constantly asking new questions of our data, experimenting with technologies and iterating our business operations in service of creating the best experiences for our customers.

At Covanta, data health improves the business and the planet

At Talend, we tend to describe poorly organized, unhealthy data as “digital landfills.” But we don’t often talk about actual landfills. That’s right, the ones filled with trash. As anyone watching real estate prices will know, land is a finite resource. It’s crazy to think that we’re still dedicating land to storing our garbage, where it will sit releasing pollutants and greenhouse gases for decades to come.

Assessing the Validity and Relevance of Data To Discover True, Actionable Information and Insights

In a previous article, we talked about the lost art of questioning and its importance when working with data and information to find actionable insights. In this article, we will expand on this topic and explain how questioning differs depending on what stage in the process you are from transforming data and information into insights.

Now in preview, BigQuery search features provide a simple way to pinpoint unique elements in data of any size

Today, we are excited to announce the public preview of search indexes and related SQL SEARCH functions in BigQuery. This is a new capability in BigQuery that allows you to use standard BigQuery SQL to easily find unique data elements buried in unstructured text and semi-structured JSON, without having to know the table schemas in advance. By making row lookups in BigQuery efficient, you now have a powerful columnar store and text search in a single data platform.

Talend acquires Gamma Soft

April 7, 2022, Talend, a global leader in data integration and management, announced today it has acquired Gamma Soft, a market innovator in change data capture (CDC). The addition of Gamma Soft’s highly complementary, enterprise-class change data capture technologies will help customers streamline their data modernization initiatives, including cloud migrations, and support advanced, real-time analytics use cases across hybrid and multi-cloud environments.

Data Chief Live: External data: Your secret weapon in a cookie-less world

How do you get to know your customer in a cookie-less world? Join Rosemary Hua, Global Head of Retail & CPG GTM at Snowflake and Forbes 30 Under 30, Erik Mitchell, founder and principal at Seek Data, Nik Lampropoulos, Global Director of Data, Insights & Analytics, Hogarth Worldwide and Cindi Howson, ThoughtSpot CDSO, as they discuss questions like.