Systems | Development | Analytics | API | Testing

January 2023

Automating reports: Picking the right tool saves time and reduces errors

As every data engineer and analyst can attest, generating reports is one of the most time-consuming and human error prone activities in the day-to-day life of data analysts. Luckily, with the development of technology, data reporting can now be done automatically, which saves you time and reduces mistakes. In this article, you will learn.

How New Lumada Industrial DataOps 5.1 Scales Industrial IoT Solutions

Data quality is fairly simple nomenclature to describe the state of the data being processed, analyzed, fed into AI, and more. But this modest little term belies an incredibly critical and complicated reality: that enterprises require the highest level of data quality possible in order to do everything from developing product and business strategies, and engaging with customers, to predicting the weather and finding the fastest delivery routes.

How-to configure SageMaker with GitHub Actions

GitHub Actions is a powerful continuous integration and continuous delivery (CI/CD) platform that allows developers to automate build, test, and deployment pipelines. Workflows automatically build and test code whenever an event occurs, such as a pull request or a deployment of merged pull requests to production. Best of all, you can use it without leaving the comfort of your own repository!

A Look At New Products Recently Introduced By Snowflake

In this episode of “Data Cloud Now,” Gautam Srinivasan, DCN's India Correspondent chats with Michael Kilberry, Senior Director and Industry Field CTO at Snowflake, about some of the newest product announcements from Snowflake, including Unistore, which brings analytical and transactional data together in a single platform. They also discuss some of the unique characteristics and data needs of businesses in India.

Overcoming 9 Data Governance Challenges

Data governance is the process of managing and protecting data throughout its lifecycle. It involves establishing policies, procedures, and standards for how data is collected, stored, used, and shared. This requires systems that are complex to be put in place by several stakeholders across the organization. Many organizations look at selecting the right software to implement a framework.

How Retailers Optimize Merchandising and Assortment Planning Strategies with the Snowflake Retail Data Cloud

The lingering effects of the global pandemic are merging with inflation to create a perfect storm for retailers looking to find the right inventory stature for the seasons ahead. Companies are getting squeezed between rising supply chain costs and falling consumer confidence. To succeed in this volatile market, McKinsey suggests that retailers “accelerate decision-making tenfold.”

Data Maturity Model: How to Move Up the Ladder

Many businesses find it hard to use data to make business decisions, even though data is becoming an increasingly valuable asset for driving business growth. The data maturity model can help you identify the gaps in your data strategy that are stopping you from reaching a high level of data maturity. In this article, you will learn.

NatWest Helps Customers Track Their Carbon Footprints with the Snowflake Data Cloud

With 19 million customers, NatWest is a household name in banking. We talked to David Charnley, the bank’s Head of Strategy and Transformation for Data and Analytics, to learn how Snowflake’s Data Cloud provides new insights into customer activities. NatWest collaborates across several areas of the business, including risk teams, retail teams, and ESG teams, to produce in-depth analysis of its customers’ carbon footprints. Using Snowflake, the bank combines risk data, customer identity data, transactions, and activity to paint a rich and detailed picture of how the actions customers take every day add to their emissions totals.

Streaming Data Analytics with SQL

Organizations that are trying to capture the value of their streaming data know that they need to transform that data into insights. Managing data in real time can be complex and expensive. SQL is an industry standard tool that gives business analysts the power to ask critical questions of your data without the need to hire additional specialized developers or translate business requirements into code.

Forget IT; Think Business Led Data Governance Initiative

A good data governance strategy should benefit all users of your organization’s data—not just those with technical responsibility for it. Recent years have seen the increasing importance of data as a strategic asset, as several companies have used it to unlock and create value. Increasingly, companies are turning to data governance programs as a foundational pillar of their data strategy (like data mesh) to improve their data sets’ quality, consistency, usability, and security.

Is Data Observability the new Anti-Virus?

We often find it hard to remember the world we left behind, but cast your mind back, say, 20 years, and we lived in a very different world. Personal Computers and the internet were on the rise, and businesses were all becoming connected. This provided companies with immense opportunities in terms of collaboration and digital adoption, and on the flip side, it eased the distribution of computer viruses. Today we barely even think about our antivirus software and policies.

Data Privacy is for Life, Not Just One Day a Year

This Saturday, January 28th sees Data Privacy Day come round again, an international effort to empower individuals and encourage businesses to respect privacy, safeguard personal data and enable trust. As always this should act as a reminder that every individual within an organization requires a basic understanding of their internal privacy rules and regulations.

Manufacturing Data Ingestion into Snowflake

Accessing data from the manufacturing shop floor is one of the key topics of interest with the majority of cloud platform vendors due to the pace of Industry 4.0 adoption. Industry 4.0, also known as the Fourth Industrial Revolution, refers to the emerging trend of technological transformation in manufacturing and related industries.

11 Predictions Data Experts Have for the Year Ahead

It’s 2023 and with the new year comes an opportunity to drive innovation, growth, and digital transformation with data in the face of ongoing economic turbulence. If Snowflake’s report, How to Win in Today’s Data Economy is any indication, data-driven organizations are poised to emerge as the winners of the year with 77% Data Economy Leaders, which is only 6% of those surveyed, experiencing annual revenue growth versus 36% of Data Economy Laggards, the lowest-performing survey group.

Snowflake Workloads Explained: Snowlake for Data Mesh

Snowflake’s cross-cloud platform enables domain teams to seamlessly collaborate and share data products across clouds and regions without copying or ETL. Domain teams can work with tools and languages of their choice, and scale resources independently with Snowflake’s elastic performance engine. With Snowflake, organizations can strike the right balance between domain ownership and governance standards.

Fast-Growing Startups Thrive On Snowflake

Startup companies who build their network infrastructure around Snowflake find they not only can be better at using data to deliver a superior customer experience, but they can easily scale their business when they start to achieve explosive growth. In this episode of “Data Cloud Now,” Gautam Srinivasan, Snowflake India Correspondent chats Ben Gotfredson, Head of Startup Programs at Snowflake, talks about all the ways Snowflake energizes startup companies, as well as the huge business opportunity for those who want to build an app directly on Snowflake.

Data Driven Marketing: Small Businesses' Ticket to the Top

According to the U.S. Small Business Administration’s Office of Advocacy, small businesses account for approximately 99.9% of all businesses. That’s a massive chunk of the U.S. economy. While they are high in number, the issues they face are relatively higher, too. Nearly 35% of small business owners report ‌that they aren’t generating any profits, with inflation being the biggest of their worries.

How to launch a modern analytics strategy

We’ve established that we’re living in the defining decade of data. Data underpins the seismic technology shifts of the past few years, transforming the way we buy, work, make business decisions, even value our companies. As ThoughtSpot’s co-founder Ajeet Singh said, “Once in a generation, the opportunities to create a legacy increase massively. It happens when truly tectonic shifts happen in the ecosystem. We’re living through one of those times.”

Continual + Hightouch: The AI for CX Upgrade

With inflation and other disruptive market dynamics massively impacting consumer behavior, is it any surprise that personalization tops the list of strategic actions for CMOs in 2023? Yep, people tend to stick around when digital products and experiences fulfill their personal needs quickly and accurately. And topping the list of powerful tools for personalization? Machine learning and AI, of course, from product recommendations to targeted offers based on digital customer and behavioral data.

Why Column-Aware Metadata Is Key to Automating Data Transformations

Data, data, data. It does seem we are not only surrounded by talk about data, but by the actual data itself. We are collecting data from every nook and cranny of the universe (literally!). IoT devices in every industry; geolocation information on our phones, watches, cars, and every other mobile device; every website or app we access—all are collecting data. In order to derive value from this avalanche of data, we have to get more agile when it comes to preparing the data for consumption.

Scania Uses Data Mesh and Snowflake's Data Cloud to Drive Transport Sustainability

Scania is at the forefront of a more autonomous, connected, electric future for the transportation industry. Find out why its Head of Data and Information Management uses data mesh—and Snowflake—to make it a reality. Scania is a global truck, bus, and industrial engine manufacturer and offers an extensive range of related services so its customers can focus on their core business.

Top 5 analytics and data engineer skills you should know in 2023

Analytics engineer is the latest role that combines the technical skills of a data engineer with the business knowledge of a data analyst. They are typically coding in SQL, building dbt data models, and automating data pipelines. You could say they own the steps between data ingestion and orchestration. Whether you are a seasoned analytics engineer or new to the field, it’s important to continually learn new things and improve the work you’ve already done.

Eckerson Report: Data Observability for Modern Digital Enterprises

This Eckerson Group report gives you a good understanding of how the Unravel platform addresses multiple categories of data observability—application/pipeline performance, cluster/platform performance, data quality, and, most significant, FinOps cost governance—with automation and AI-driven recommendations.

How to Get Data from Multiple Sources

Five things to know about how to get data from multiple sources: These days, organizations have more data at their fingertips than ever before and collect an incredible number of data sets from various sources. This creates a paradox for businesses such as e-commerce retailers struggling to deal with data complexity. With a deluge of information (and more arriving every day), how can you get data from multiple sources efficiently and unlock the hidden insights that it contains?

McKinsey Acquires Iguazio: Our Startup's Journey

8 years ago, when I founded Iguazio together with my co-founders Yaron Haviv, Yaron Segev & Orit Nissan-Messing, I never thought I would be making this announcement on our company blog: McKinsey acquired Iguazio! When we first embarked on this journey, we realized that while AI has the ability to transform any industry - from banking to retail to manufacturing - in reality most data science projects fail.

Top 7 Soft Skills Required in Data Teams for Project Success

Many organizations focus on the data engineering or development qualifications they require to connect specific data sources and manage data projects. But that is only half of what is needed. Soft skills are so important and sometimes overlooked. Soft skills support data management success because they help individuals effectively communicate and collaborate with others, understand and anticipate the needs of stakeholders, and make data-driven decisions.

Dynamic pricing strategy: 7 steps for successful implementation

From airline tickets going through the roof during holiday seasons to Uber and other ride-sharing services charging higher prices in rush hour, we have become accustomed to paying different prices for the same services. Traditionally, dynamic pricing was a tool reserved for industry giants like Amazon because of its implementation complexity and price tag.

Data Integration & Modeling: The Unsung Heroes of the Marketing Data Stack?

Marketing data integration is the process of combining marketing data from different sources to create a unified and consistent view. If you’re running marketing campaigns on multiple platforms—Facebook, Instagram, TikTok, email—you need marketing data integration. Why? Because being able to assimilate data from different channels and across multiple marketing touchpoints gives you visibility into the overall impact of a campaign, event, or another marketing effort.

The Top 7 ETL Events & Conferences 2022

There are few topics in the world right now that are hotter than data and its related fields. As technology, machine learning, and computer algorithms continue to expand, so does the way that companies can use this information to benefit their business. Extract, Transform, and Load (ETL) remains one of the most important processes in the area of big data. This area is absolutely booming, and so is the demand to learn more about its various processes and components.

Hollywood Creativity

I just got an email from a venture capitalist. For about the hundredth time, the venture capitalist told me they were anxious to invest money in us. The only qualification was that we needed to already have at least $10 million in sales. If we had $10 million in sales, we wouldn’t need to be talking with the venture capitalist. How stupid is that? I suggested to the venture capitalist that they go invest in IBM or ATT because they do have $10 million in sales.

Stitch vs. Datastream vs. Integrate.io: Pricing, Features and Reviews

Do you know where your data is? Most organizations store data in various destinations (in-house databases, SaaS locations, cloud-based apps, etc.), which makes running analytics far more complicated. Imagine pulling data from all these destinations into one data warehouse or data lake. Life would be so much easier... "But doesn't this require a lot of code?" you may ask. Not necessarily.

Snowflake Workloads Explained: Applications

Snowflake’s platform powers applications with virtually unlimited performance, concurrency, and scale. Delivered as a service, Snowflake handles the infrastructure complexity, so you can focus on innovating. With Snowflake for Applications, enjoy near-unlimited scalability and concurrency and accelerated time-to-market with no SRE/DevOps burden.

Driving Data, Delivering Value: Data Leaders to Watch in 2023

The Chief Data Officer is arguably one of the most important roles at a company, particularly those that aspire to be data-driven. CDO appointments and the elevation of data leaders have accelerated in recent years, and the role has morphed as perceptions of data have evolved. Responsibilities span strategy and execution, people and processes, and the technology needed to deliver on the promise of data.

Best data modeling methods for data and analytics engineers

Recently, I published a blog on whether self-service BI is attainable, and spoiler alert: it certainly is. Of course, anything of value usually does require a bit of planning, collaboration, and effort. After the article was published, I began having conversations with technical leaders, analysts, and analytics engineers, and the topic of data modeling for self-service analytics came up repeatedly.

Top 6 Python ETL Tools for 2023

Extract, transform, load (ETL) is a critical component of data warehousing, as it enables efficient data transfer between systems. In the current scenario, Python is considered the most popular language for ETL. There are numerous Python-based ETL tools available in the market, which can be used to define data warehouse workflows. However, choosing the right ETL tool or your needs can be a daunting task.

Fivetran vs. Matillion vs. Integrate.io: A Comprehensive Comparison

In today's increasingly digital world, businesses of all sizes rely on data to make informed decisions and drive growth. This is why more and more organizations have started using data warehouse platforms. These crucial tools help businesses store, manage, and analyze data in one central location. In addition, a data warehouse platform makes accessing and processing large amounts of data easier, enabling businesses to gain valuable insights and improve their operations.

How to Optimize Huggingface Models for Production

Deploying models is becoming easier every day, especially thanks to excellent tutorials like Transformers-Deploy. It talks about how to convert and optimize a Hugging face model and deploy it on the Nvidia Triton inference server. Nvidia Triton is an exceptionally fast and solid tool and should be very high on the list when searching for ways to deploy a model. If you haven’t read the blogpost yet, do it now first, I will be referencing it quite a bit in this blogpost.

The Growing Need for Advanced Analytics to Fuel 5G and Edge Solutions

Organizations have been focused on enhancing customer experiences to enable quicker responses to services and to provide localized behavior for many years now. However, with the Internet of Things (IoT), Smart Cities, Gaming technologies and Self-Driving Cars going more mainstream, there is an even greater need for organizations to react faster to customer behavior and bring solutions closer to the customers.

Questions around Transparency in AI models with Tom Davenport

Often the question around bias is raised whenever the conversation turns to AI. Tom Davenport, author of “Working with AI: Real Stories of Human-Machine Collaboration” points out that bias is not limited to AI, but also finds root in many human decision-makers as well. Actually, according to Tom, the bigger threat is ignoring that working with AI is going to increasingly be a part of our human work experience.

Data Mesh and other Alternatives for Data Chiefs in 2023

Title: Data Mesh and other Alternatives for Data Chiefs in 2023 Description: The data world exploded in 2022 with a heated debate around data mesh. We had to talk to Tony Baer of DBinsights to get a better understanding of his perspective and criticism of data mesh. Most importantly, we needed to know what it is he recommends we use instead!

Understanding The Risks and Rewards of Data Observability

Data observability is the ability to monitor and understand the data that flows through an organization's systems. Organizations can monitor their data in real-time, detect anomalies, and take corrective action based on alerts. Organizations use data observability to collect, analyze, and visualize data from various sources to manage their system's behaviour across the data ecosystem.

A Simplified Guide to Cloud Data Platform Architecture

Since the 2006 launch of Amazon Web Services (AWS), the world’s first hyper-scale public cloud provider, thousands of data-driven businesses have shifted on-premise data storage and analytics workloads into the cloud by architecting or adopting a cloud data platform. As the volume, variety, and velocity of enterprise data continues to grow in 2023, cloud data platforms with legacy tech and complex architectures are becoming increasingly time-consuming and costly to manage.

Top 3 data visualizations for finance professionals

Data plays a profound role in finance. In fact, some might argue that finance professionals are some of the most data-driven individuals in an organization. That’s because finance data, and the insights you draw from it, can literally make or break a company. This is especially true in times of economic uncertainty, when businesses are trying to make data-driven decisions about where to invest and cut resource allocation.

How to practice responsible AI - Scott Zoldi

This episode features an interview with Scott Zoldi. He is the Chief Analytics Officer at FICO where he is responsible for the analytic development of FICO’s product and technology solutions. Scott is involved in developing new analytic products and applications, and has authored more than 100 patents. His current focus is on self-learning analytics to detect cyber security attacks. On this episode, Scott talks about how to attract and retain world-class data scientists, the importance of following a model governance process, and responsible AI.

Changing Consumer Behavior Transforms The Retail & CPG Industries

It’s no secret that consumer behavior and expectations changed dramatically during the two years of the COVID pandemic, changes that have disrupted supply chains and raised the marketing and customer service bar for retailers. In this episode of “Data Cloud Now,” host Gautam Srinivasan chats with Freddy Guard, Snowflake’s Head of Retail & CPG for North America, about those changes and how it has accelerated the demand for cloud-based, data-driven solutions to meet the challenges those changes have engendered.

7 Easy Steps to Building an Actionable Data Strategy

With a clear framework, best practices, and case studies. Modern enterprises are struggling with an overabundance of raw data and underutilization of data assets for achieving business objectives. The right data strategy helps you unlock the hidden potential from the stored-but-seldom-used enterprise data. In this article, you will learn.

5 Data Management Trends For 2023

Every year analysts, vendors, thought leaders, and everyone in between like to surmise the upcoming trends for the year. I am going to do something a little different this year. I am discussing some trends, just like everyone else, but basing them on what we are seeing with customers and how they are succeeding with the Integrate.io platform. Not just succeeding, but levering complex and diverse data sets to enable better business decisions and support growth.

Data Warehouse Best Practices: 6 Factors to Consider in 2023

Data warehousing is the process of collating data from multiple sources in an organization and store it in one place for further analysis, reporting and business decision making. Typically, organizations will have a transactional database that contains information on all day to day activities. Organizations will also have other data sources – third party or internal operations related. Data from all these sources are collated and stored in a data warehouse through an ELT or ETL process.

7 Best Data Analysis Tools

Five things to know about this topic: Just about every process used within a business generates some form of data. While some may see this information as useless, data analysis tools can turn it into a resource that helps your brand make better decisions in every aspect of its operations. Not all analytical tools are equal. However, the ones on this list can help you generate incredible insights that result in better decision-making.

Panel recap: What Is DataOps observability?

Data teams and their business-side colleagues now expect—and need—more from their observability solutions than ever before. Modern data stacks create new challenges for performance, reliability, data quality, and, increasingly, cost. And the challenges faced by operations engineers are going to be different from those for data analysts, which are different from those people on the business side care about. That’s where DataOps observability comes in.

Analyzing Your Call Center Data with Drill-Down Processing

A recent study on call center statistics found that 91% of consumers reported poor customer service in 2021. Providing high-quality service is essential, especially today, to retain customers and drive more business. Quality service is only one important metric in running a profitable call center. No matter your goal, the first step is understanding what's going on in your call center.

How To Build A High-quality BI Dashboard With The Best Software Test Manager

Business intelligence (BI) involves converting data into valuable insights using software and services. It influences a company’s strategic and tactical business preferences. BI tools access and analyze data sets and show analytical results in the form of reports, graphs & charts, dashboards, summaries, and maps. This process provides in-depth insight into the situation of the business.

Discover the Advantages of Having Global Views from Angles for Oracle

The move to the cloud continues at a fast pace and if your organization embraces the future of operational reporting, then you need a plan to ensure consistent enterprise-wide reporting during your cloud journey. A top challenge of cloud migration is the need to produce consolidated reporting and analytics that cover all your Oracle ERP instances.

Cloud Object Storage-based Architectures are Natively Scalable and Available

There is a long history of clustering architectures with respect to building distributed databases for two primary reasons. The first is scalability. If a cluster of nodes has reached its capacity to perform work, adding additional nodes are introduced to handle the increased load. The second is availability. The ability to ensure that if a node fails, let’s say during ingestion and/or querying, remaining nodes would continue to execute due to state replication.

How CommonLit Saves 22 Days of Engineering Resources a Year with Integrate.io

CommonLit implements Integrate.io’s data replication solution, which replicates millions of rows a month with zero issues. Industry-leading tool replicates data quickly and consistently; predictable pricing makes it easy to manage team budgets, and white-glove support ensures zero outages or problems.

Being a Steward of Data and Insights - Robert Brown

This episode features an interview with Robert Brown, the Senior Director of Research for the Venture Forward Initiative at GoDaddy. This is his 13th year at GoDaddy, having started as Director of Database Marketing. Prior to GoDaddy, Robert served as Director of Pulte Homes for 9 years. On this episode, Robert talks about tiering data for smarter decisioning, developing intrinsic motivation in employees, and being a successful steward of data and insights.

Data Observability: 7 Trends to Watch in 2023

As organizations look to scale up and improve the business value of their growing data volumes, certain data trends have garnered the attention of data and business professionals alike. With this growth promising to continue in the upcoming year, data leaders are looking to implement tools to enrich their organization’s data like never before. Here are seven trends you can watch for in the new year.

Snowflake Announces Intent to Acquire Myst

Snowflake customers leverage the Data Cloud to bring all their data together and capitalize on the near-infinite resources of the cloud. But how can this data be used to look ahead? How can we use yesterday’s evidence to plan for tomorrow? The answer—time series forecasting. Time series forecasting is one of the most applied data science techniques in business. It is used extensively in supply chain management, inventory planning, and finance.

Top 3 Data and Analytics Trends to Prepare for in 2023

2022 was without a doubt a landmark year for business intelligence (BI) and analytics. With continued development of innovative and sophisticated technologies such as contextual analytics, analysis of our business data is more accessible than ever before. Many of these same trends will continue to grow into 2023, but the data analytics space is ever evolving.

The Best Data Modeling Tools: Advice & Comparison

Do you know how much data your company stores? Do you know the types of data being utilized for any given purpose? Can you picture how data flows from one system to another? The goal of data modeling is to help you understand aspects like these. By giving you a visual representation of data within your systems, data modeling tools help you better store, manage, and utilize your data by optimizing the underlying architecture.

How Fivetran Ensures That Data Moves Reliably Through Data Pipelines

Fivetran, the provider of connectors that feed data into data pipelines, has had a long-standing, symbiotic relationship with Snowflake. In this episode of “Data Cloud Now,” Gautam Srinivasan, Snowflake India Correspondent chats with TJ Chandler, Managing Director for the APAC Region at Fivetran, about that relationship and about Fivetran’s mission “to make access to data as simple and reliable as electricity.”

Fivetran Alternatives - Keboola is what you are looking for in 2023

When researching your next ETL and ELT tool, you should consider Keboola as one of the best Fivetran alternatives. In this blog article, we’re going to compare Keboola and Fivetran side-by-side and show you how Keboola can simplify your data operations. We’re going to evaluate both tools based on these critical product features: Here is a quick breakdown summary of the comparison between Keboola and Fivetran: ‍

Merging Data Literacy With Data Pipeline Success

In general, the concepts of data literacy and creating successful data pipelines seem totally disconnected. Data literacy involves insuring that data consumers have the knowledge and capabilities to understand and interact with data in a way that will provide them with the answers and value they need to do their jobs and benefit their organizations. While data pipelines require technical expertise to move, connect, and store data across the company's data ecosystem.

Happy New Year from Yellowfin: Our 2023 Commitments

Happy New Year from the Yellowfin team, and welcome to our 2023 wrap-up! Following a year full of product feature updates, company changes and new initiatives, this blog provides a helpful summary for all our customers and followers on our future 2023 product roadmap for the Yellowfin embedded analytics suite, and a look back at last year’s biggest news.