Systems | Development | Analytics | API | Testing

July 2021

CDC in Salesforce and How to Export Attachments

Salesforce is one of the most popular customer relationship management (CRM) solutions. A lot of companies use it for every aspect of their business. In fact, Salesforce bills itself as a Customer 360 platform. If you’re familiar with Salesforce, you know there are a lot of tools. The right choices allow you to get the most actionable use of the wealth of data from the platform. We’ve talked before about how to create a Salesforce ETL pipeline.

The Dark Truth Behind Session Recording

A company is entitled to use session recording or session replaying as long as their marketing and analytics needs require so. However, as enticing the recording of everything the user does at all times can be, even within the existing regulations, there is a high chance that doing so will quickly push the data towards a non-compliant realm. And even in cases where regulations may not be explicit on the matter, we see more and more how the industry is leaning towards discouraging these practices.

Free Cycle Time Readout for GitLab Users

To celebrate GitLab’s latest release and GitLab Commit 2021 we offer free cycle time readout of all of your personal and company projects hosted on GitLab.com for a limited time. Use this link and your GitLab credentials to sign into Logilica Insights and we let you know about your software teams cycle time, delivery velocity and much more.

BigQuery Admin reference guide: Query processing

BigQuery is capable of some truly impressive feats, be it scanning billions of rows based on a regular expression, joining large tables, or completing complex ETL tasks with just a SQL query. One advantage of BigQuery (and SQL in general), is it’s declarative nature. Your SQL indicates your requirements, but the system is responsible for figuring out how to satisfy that request. However, this approach also has its flaws - namely the problem of understanding intent.

Why you need metadata management and how to approach it

As your data operations evolve, they become messier. Diverse data sources and data models at their sources, multiple movements of data throughout your platform, and cobbled-up infrastructure, which has grown in complexity through every deployment have made it hard to identify, trace, classify, and understand your data assets. This can be as simple as an analyst spending hours trying to figure out where a data attribute in a table came from and whether it is trustworthy.

ThoughtSpot Success Series #9 - Group Design, Privileges & Sharing

Introducing the ThoughtSpot Success Series! Want to expand your knowledge of ThoughtSpot? Want to learn some great tips and tricks? Join ThoughtSpot's Customer Success team and other users like yourself as we discuss various topics in our new Success Series. During this event, we’ll review best practices to implement when creating groups and assigning users to make privilege management easier. In this 1-hr event, you'll learn:

Data Onboarding: What You Need to Know

Getting your customer's data on the platform quickly and effectively is crucial for any business. How well you onboard new data will affect your success and your customer's experience. Effective onboarding affects so many aspects of a company's success that it's necessary to take a detailed look into all aspects of the process. The following is everything you need to know about data onboarding.

Four Questions To Accelerate Edge-to-Cloud AI Strategy Development

“More than 15 billion IoT devices will connect to the enterprise infrastructure by 2029.” Finding data is not going to be a challenge, clearly, but taking advantage of it all to drive business outcomes will be. Combining AI and machine learning (ML) with data collection and processing capabilities of the edge and the cloud may hold the answer.

Massive Data Transformation With Pepsi | Rise of The Data Cloud Podcast

Interested in how to transfer e-commerce to the cloud, what data transformation looks like on a massive scale, and how to increase your ROI? In this episode, Vaibhav Kulkarni, Head of Data Products & Infrastructure Engineering at PepsiCo, talks about these topics and more. We hope you enjoy! Connect with Vaibhav Kulkarni

Design considerations for SAP data modeling in BigQuery

Over the past few years, many organizations have experienced the benefits of migrating their SAP solutions to Google Cloud. But this migration can do more than reduce IT maintenance costs and make data more secure. By leveraging BigQuery, SAP customers can complement their SAP investments and gain fresh insights by consolidating enterprise data and easily extending it with powerful datasets and machine learning from Google.

What is Change Data Capture in SQL Server?

For more than three decades, Microsoft SQL Server has helped countless organizations store and manage their enterprise data, and it’s still one of the most widely used software applications on the planet. According to the DB-Engines database ranking, SQL Server remains the third most popular database management system, just behind Oracle and MySQL. Change data capture (CDC) is essential functionality for many businesses, especially those with real-time ETL use cases.

Using AI/ML to Increase Gaming Monetization

Gamers are not shy about reaching into their wallets for premium content and features. They also won’t hesitate to tap the uninstall button at the first sign of trouble. It’s not uncommon for a gamer to boot up a hotly anticipated new game or revisit an old favorite only to put it down days or weeks later. The culprit is often gaming monetization issues that get in the way of what would otherwise be a long-term rewarding gaming experience.

Choosing an ERP: 5 Reasons Your Company Needs NetSuite

Enterprise Resource Management (ERP) software. Business professionals have heard of it, but they may not understand how it can help their business. In short, an ERP gathers data from multiple departments, including accounting, human resources, sales and marketing, and inventory, into a central database. This database then allows members of management and other key employees to analyze their current processes, spot workflow weaknesses, and improve them.

The 6 Soft Skills Data Engineers Need to Succeed

Soft skills can be almost as important as data engineering skills when you apply for a job. Soft skills can make the difference between stress and efficiency or being unsatisfied with your position and a raise. When data engineers and data scientists earn bachelor’s degrees, they usually take classes in topics like data warehousing, programming languages, machine learning, and data science.

Five Strategies to Accelerate Data Product Development

With this first article of the two-part series on data product strategies, I am presenting some of the emerging themes in data product development and how they inform the prerequisites and foundational capabilities of an Enterprise data platform that would serve as the backbone for developing successful data product strategies.

[MLOps] The Clear SHOW - S02E13 - mlops_this: Copilot Shenanigans

Ariel should have known better than to mess with shitposts on mlops.community ;) Here is a ClearML pipeline integrated with the notorious mlops_this generated by GitHub's Copilot. ClearML is the only open-source tool to manage all your MLOps in a unified and robust platform providing collaborative experiment management, powerful orchestration, easy-to-build data stores, and one-click model deployment.

How to ensure data integrity with analytics testing

Data collections and analysis is critical to ongoing business operations, but maintaining data integrity is an often overlooked problem. Ensuring data integrity is not only a consumer trust issue, but is often also mandated by legal regulations. Without accurate data, business leaders could make decisions that are slightly (or majorly) misguided.

Demystifying a Heroku Salesforce Connector

There's been some mystery associated with the Heroku Salesforce connect that links the Salesforce CRM with the Heroku Postgres database service. This connection is an add-on called Heroku Connect that enables multiple processes and transfers with no coding required and simple point and click executions. Heroku Connect offers bi-directional data synchronization while also creating secure data transfers. This connector also allows a user the freedom to customize and streamline their data transfer processes.

How Military Data Innovation Drives Higher and Lower Risk Tolerance

“Data acumen” is a powerful new term. I heard it used recently in relation to a historically hard problem for the Department of Defense (DoD). The problem is the speed of data change. David Spirk, DoD’s Chief Data Officer, gave an historic keynote at GovCon Wire’s Data Innovation Forum, which was held in June 2021. He was joined by three key leaders in DoD data: Thomas Sasala (Navy), Eileen Vidrine (Air Force) and David Markowitz (Army).

7 Steps to Operationalize Your Data Warehouse

Organizations may struggle with getting the full value of their data without knowing it. Their data science team uses data warehouses to power business intelligence solutions to create reports, dashboards, and other data visualizations. However, the time it takes for this information to reach teams makes it difficult to use for daily decision-making. Operationalizing data warehouses sends these insights directly into daily operations systems, thereby allowing for immediate access.

Beginner's Guide to Cloudera Operational Database

My name is Shanmukha Kota and I am a recent graduate from University at Buffalo. I interned with Cloudera last summer and joined Cloudera as a software engineer a couple of weeks ago and this is my first experience with CDP and CDP Operational Database. For a new hire college graduate in the industry with only academic experience with HBase, I can only say it is very simple and easy to set up and work with CDP Operational Database.

Extending the power of Chronicle with BigQuery and Looker

Chronicle, Google Cloud’s security analytics platform, is built on Google’s infrastructure to help security teams run security operations at unprecedented speed and scale. Today, we’re excited to announce that we’re bringing more industry-leading Google technology to security teams by integrating Chronicle with Looker and BigQuery.

5 Real-time Streaming Platforms for Big Data

Real-time analytics can keep you up-to-date on what’s happening right now, such as how many people are currently reading your new blog post and whether someone just liked your latest Facebook status. For most use cases, real-time is a nice-to-have feature that won’t provide any crucial insights. However, sometimes real-time is a must. Let’s say that you run a big ad agency.

Understanding Data-Driven CPQ

Most companies offering any kind of service or product answer this question from consumers or potential clients all the time: "How much does it cost?" Or, the much harder question: "How much will it cost if I choose these services with these extras for my particular company/house/yard/situation, etc.?" The tough part is that pricing services or software usually involves too many variables.

How to do data transformation in your ETL process?

Working with raw or unprocessed data often leads to poor decision-making. This explains why data scientists, engineers, and other analytic professionals spend over 80% of their time finding, cleaning, and organizing data. Accordingly, the ETL process - the foundation of all data pipelines - devotes an entire section to T, transformations: the act of cleaning, molding, and reshaping data into a valuable format.

Crux chose BigQuery for rock-solid, cost-effective data delivery

At Crux Informatics, our mission is to get data flowing by removing obstacles in the delivery and ingestion of data at scale. We want to remove any friction across the data supply chain that stops companies from getting the most value out of data, so they can make smarter business decisions. But as you may know, if you’re in the business of data, this industry never stands still. It’s constantly evolving and changing.

What is the Difference Between FTP and SFTP?

The ETL (extract, transform, load) process depends on quickly, efficiently, and securely transferring information between sources and targets. However, there are multiple options for data transfer protocols, including FTP and its close relative SFTP. So what’s the difference between FTP and SFTP, and how can you decide which one to use for your enterprise data? We have all the answers below.

[MLOps] The Clear SHOW - S02E12 - Goodbye Fig .1 [Sculley15]

Sometimes, even in a field as young and bustling, one has to say goodbye to an old friend. Today we bid adieu to Fig. 1 of D. Sculley et al., AKA "Hidden technical debt in Machine learning systems." Listen to Ariel Biller explaining what's going on and what are we going to use in lieu of Fig. 1

Transforming supply chain and logistics analytics at Avnet with ThoughtSpot and Azure Synapse

Supply chain and logistics operations can be a company's biggest source of financial risk or competitive advantage. The key is reconciling external supplier data like tariff and shipping information with internal data to deliver insights across teams and geographies.

20 Ways to Increase Your Revenue with Netsuite

NetSuite is a powerful Enterprise Resource Planning (ERP) platform that gives organizations a way to use one platform for countless essential business operations. However, it also offers a goldmine of revenue-increasing capabilities. These 20 methods for increasing revenue reveals the true potential of using NetSuite for critical business processes.

SFTP to Salesforce - Guide to a Secure Integration

Organizations have numerous departments and employees of all skill sets using the same CRM. When it comes to Salesforce, transferring data and files from an external location can be complicated, especially when dealing with confidential information. A great solution to securing the data transfer is to use an SFTP. Here is a step-by-step guide on how to integrate your systems from SFTP to Salesforce.

Accelerate Offloading to Cloudera Data Warehouse (CDW) with Procedural SQL Support

Did you know Cloudera customers, such as SMG and Geisinger, offloaded their legacy DW environment to Cloudera Data Warehouse (CDW) to take advantage of CDW’s modern architecture and best-in-class performance? In addition to substantial cost savings upon moving to CDW, Geisinger is also able to search through hundreds of million patient note records in seconds providing better treatment to their patients.

Widen Your Focus to Drive Better Business Decisions

Data and technology are often hailed as a magic ingredient that can help solve so many problems. But, as I discussed with Joe DosSantos on Data Brilliant – it’s just one piece of the puzzle. In order to truly navigate uncertainty, the key is widening your focus and opening up your information sources. It comes down to balancing breadth of perspective with depth of expertise.

A Reference Architecture for the Cloudera Private Cloud Base Data Platform

The release of Cloudera Data Platform (CDP) Private Cloud Base edition provides customers with a next generation hybrid cloud architecture. This blog post provides an overview of best practice for the design and deployment of clusters incorporating hardware and operating system configuration, along with guidance for networking and security as well as integration with existing enterprise infrastructure.

Optimizing Risk and Exposure Management - Roundtable Highlights

We recently hosted a roundtable focused on optimizing risk and exposure management with data insights. For financial institutions and insurers, risk and exposure management has always been a fundamental tenet of the business. Now, risk management has become exponentially complicated in multiple dimensions. In this session we explored what firms are doing to approach the uncertainty with more predictability.

CNC: The journey from Excel spreadsheets to automated data pipelines and fast, reliable insights

Founded in 1991, CNC (Czech News Center) is one of the largest media companies in the Czech Republic. They offer dozens of print and online publications to the Czech market, including Blesk, Aha!, and E15. A commitment to journalistic integrity has enabled their growth, now reaching millions of readers. They are currently undergoing a vast digitalization process with the aim to become the fastest-growing and largest media house in the Czech Republic.

Future of Data Meetup: Hello, Kafka! (An Introduction to Apache Kafka)

Our “Hello, “ series of introductory “Big Data” topic-focused meetups returns to Boston in July as we deliver our fifth event. This meetup will introduce you to Apache Kafka without assuming you’ve heard anything about the Apache development project, the problems that Kafka was designed to solve or the role it currently plays in modern enterprise data architectures.

Understanding jobs & the reservation model in BigQuery

What are jobs in BigQuery and how does the reservation model work? In this episode of BigQuery Spotlight, we’ll review jobs, reservations, and best practices for managing workload in BigQuery. We’ll also walk you through the difference between BI Engine reservations and standard reservations, so you can decide what will work best for you.

ThoughtSpot Success Series #8 - Row Level Security Design Patterns

Join ThoughtSpot's Customer Success team & other users as we discuss various topics in our new Success Series. During this event, we discussed a high-level overview of Row Level Security Design Patterns and some common scenarios it solves. In this 30-minute event, you'll learn:

Why Does My Business Need to Transform Data?

Fivetran pipelines reliably load your data to your chosen destination, but then what? Without joining, filtering, and aggregating your data, your business can’t produce data models to answer critical business decisions. This is why data transformations are essential to every business looking to maximize value from the data they collect from disparate sources.

The Future of the Modern Data Stack

The Modern Data Stack is quickly picking up steam in tech circles as the go-to cloud data architecture, and although its popularity has been quickly rising, it can be ambiguously defined at times. In this blog post we’ll discuss what it is, how it came to be, and where we see it going in the future. Regardless of whether you’re new to the modern data stack or have been an early adopter, there should be something of interest for everyone.

When to Use Change Data Capture

Automated ETL (extract, transform, load) and data integration workflows are essential for the modern data-driven organization, and they can swiftly and efficiently migrate data from sources to a target data warehouse or data lake. But ETL must run at regular intervals — or even in real-time — so how can you know which information is fresh and which information you’ve already ingested? Solving this problem is the goal of change data capture (CDC) techniques.

Why CFOs Should Champion the Consumption Business Model

Traditionally, CFOs focus on cost and cost control when managing the financial actions of a company. “Value” is assessed by putting a quantifiable number behind everything. It’s time to shift that mindset and take into account a different kind of value—the kind you can derive from an investment. Rather than worrying solely about the bottom line, ask the question: If you spend more today, what will you end up creating tomorrow?

Demo Jam Live: Perform Flink stream processing and analytics using SQL

Is your business looking for a simpler way to access digital information faster? Do you know your developer and analytics teams, who have SQL skills, can now easily create streaming analytics for your business needs? This new demo jam webinar will showcase Cloudera Streaming Analytics with SQL Stream Builder and demonstrate how easy it is to create streaming queries using Apache Flink. Just like the previous session, this will be a no-slide, highly interactive demo-only session where you get to choose what you want to see based on live polling. This session is led by Kenny Gorman, Product Owner of Streaming Processing and Erik Beebe, Principal Stream Processing Engineer.

The Wonderful World of Data Governance with Disney Streaming's Anita Lynch | Rise of The Data Cloud

In this episode, Anita Lynch, Vice President of Data Governance at Disney Streaming, talks about the importance of first-party data, the nuances of data governance and privacy, how to prioritize data, and much more. Connect with Anita Lynch

EDI Integration & Why It's Important to Your Business

In the age of digital transformation, EDI (Electronic Data Exchange) technology is used to create new business information flows that never existed on paper. As the technology grows, companies need to incorporate more than traditional EDI standards. There are country-specific laws (HIPAA, GDPR, etc.), data collection and protection guidelines, privacy laws, business intelligence guides, and more.

5 Reasons Why Row-Level Security is Wrong for Your Data Warehouse

Extracting your data from multiple sources and collecting it in a centralized data warehouse is hard enough—but how do you keep it safe once it arrives? Organizations use a variety of data security methods to protect sensitive and confidential information, including share-level security (all users sharing a password) and user-level security (with separate accounts and passwords for each user).

Integrating Data to Build Emotional Health: How SU Queensland Uses Talend to Enrich Service Delivery

The mission statement is so direct and uncomplicated. SU Queensland, a non-profit organization based in Australia, is all about “bringing hope to a young generation.” The realities of delivering on this charter, of course, are multi-dimensional and complex. SU Queensland provides a wide range of services to schools, churches, and community groups across Australia, including youth camps, school chaplains, community engagement programs, and training and support for youth workers.

Operationalizing Machine Learning for the Automotive Future

It’s no secret that global mobility ecosystems are changing rapidly. Like so many other industries, automakers are experiencing massive technology-driven shifts. The automobile itself drove radical societal changes in the 20th century, and current technological shifts are again quickly restructuring the way we think about transportation. The rapid progress in AI/ML has propelled the emergence of new mobility application scenarios that were unthinkable just a few years ago.

Delivering Modern Enterprise Data Engineering with Cloudera Data Engineering on Azure

After the launch of CDP Data Engineering (CDE) on AWS a few months ago, we are thrilled to announce that CDE, the only cloud-native service purpose built for enterprise data engineers, is now available on Microsoft Azure. CDP Data Engineering offers an all-inclusive toolset that enables data pipeline orchestration, automation, advanced monitoring, visual profiling, and a comprehensive management toolset for streamlining ETL processes and making complex data actionable across your analytic teams.

A CDO's Field Guide to Finding Value in Data

A proverb from the Democratic Republic of the Congo says, “A good chief is like a forest: Everyone can go there and get something.” And, a Chief Data Officer is no exception. According to Forrester Research published in January 2021, data leaders today face a broad mandate as the role has expanded over the years. In the early years, CDOs mostly reported to technology leaders.

[MLOps] The Clear SHOW - S02E11 - DIY Strikes Back! Building the Model Store!

Ariel extends ClearML's "experiment first" approach towards a "model first" approach - by building a model store. See how easy it is to add metadata to the model artifacts. + Colab notebook (uses the demo server, just run it and see what happens) ClearML is the only open-source tool to manage all your MLOps in a unified and robust platform providing collaborative experiment management, powerful orchestration, easy-to-build data stores, and one-click model deployment.

Why rebuilding data trust is key to driving business value with analytics

As a modern data leader, you know that real-time access to data-driven insights is key to driving higher levels of business growth and innovation, and better customer experiences. You also know that when frontline employees have easier access to data they’re able to make better decisions that ultimately boost your bottom line. But what happens when employees don’t trust the data in front of them?

5 data-driven solutions for global supply chains disruptions

In 2020, the pandemic tested supply chains in a manner few have seen in our lifetimes, with businesses like Apple struggling to predict demand and keep factory lines moving. The weaknesses exposed by this crisis are not brand new, but they should be a wake-up call that current strategies are not sustainable. The limitations of modern supply chains were becoming apparent last year when companies struggled to react to new tariffs and restrictions caused by Brexit and the U.S.-China trade war.

Choose the Right ERP Software for Business

Enterprise resource planning, or ERP, the software helps companies manage their day-to-day activities in most departments. It offers organization and compliance features that streamline the process and make it user-friendly. ERP software shoppers probably know they need top-quality ERP software, but how do they know how to choose the right one for their needs?

[MLOPS] From #GTC21: Best Practices in Handling Machine Learning Pipelines on DGX Clusters

Learn how to set up and orchestrate end-to-end ML pipelines, leveraging large DGX clusters. We'll demonstrate how to orchestrate your training and inference workloads on DGX clusters, with optional setup of remote development environments leveraging the multi-instance GPUs on the NVIDIA A100. We'll also show how pipelines can be built to serve both research and deployment workloads, all while leveraging the compute inherent in the DGX cluster.

[MLOPS] From #GTC21: How to Supercharge Your Team's Productivity with MLOps

Learn how to structure a data scientist-first orchestration setup that allows your DS team to self-manage their allocated NVIDIA GPU clusters, without needing continuous hand-holding from DevOps/IT. We'll demonstrate this setup while using NVIDIA Clara Train SDK to walk through best practices in orchestration, experiment management, and data operations and pipelining. While examples will be health-care-focused, the concepts demonstrated are agnostic to any ML/DL use case in any industry.

[MLOPS] From #GTC21: Workshop - Demonstrating an End-to-End Pipeline for ML/DL Leveraging GPUs

Learn how to take models from research into deployment in an efficient and scalable manner. We'll demonstrate workflows and methodologies so that your data science team can make the most of their NVIDIA hardware systems and software tools (including TRITON!).

5 Reasons You Should Mask PII

If personally identifiable information (PII) falls into the wrong hands, it could have devastating consequences for both you and the affected individuals. But what if you could transform that information so that it would be useless to any attacker? That’s exactly what PII masking seeks to do. So what is PII data masking exactly, and how does PII masking help safeguard your sensitive and confidential information from PII data breaches? Keep reading for all the answers.

What You Need to Know About NetSuite

ERP (enterprise resource planning) software helps organizations streamline and optimize their processes and workflows by maximizing efficiencies and enabling better reporting, intelligence, and analytics. From human resources and logistics to finance and sales, nearly every department can benefit from the judicious use of the right ERP software.

Cloudera Operational Database Replication in a Nutshell

In this previous blog post we provided a high-level overview of Cloudera Replication Plugin, explaining how it brings cross-platform replication with little configuration. In this post, we will cover how this plugin can be applied in CDP clusters and explain how the plugin enables strong authentication between systems which do not share mutual authentication trust.

4 Considerations When Building Your Government Data Strategy

If you’ve followed Cloudera for a while, you know we’ve long been singing the praises—or harping on the importance, depending on perspective—of a solid, standalone enterprise data strategy. While certainly not a new concept, Government missions are wholly dependent on real time access/analysis of data (wherever it may be (legacy data centers or public cloud) to render insight to support operational decisions.

How to Move Kubernetes Logs to S3 with Logstash

Sometimes, the data you want to analyze lives in AWS S3 buckets by default. If that’s the case for the data you need to work with, good on you: You can easily ingest it into an analytics tool that integrates with S3. But what if you have a data source — such as logs generated by applications running in a Kubernetes cluster — that isn’t stored natively in S3? Can you manage and analyze that data in a cost-efficient, scalable way? The answer is yes, you can.

How Enterprise Data Lakes Help Expose Data's True Value

For all of the buzz surrounding both artificial intelligence and data-driven management, many companies have seen mixed results in their quest to harness the value of enterprise data. To avoid those pitfalls, we mixed best-of-breed and proprietary solutions to develop our enterprise data platform (EDP), focusing much of our attention on a combination of smart changes in technology, culture and process for data lakes.

Why Yellowfin built our own CRM analytics solution

Like most organizations, Yellowfin has a CRM tool. The data in your CRM should be able to help you understand how you’re selling and how you win. But everyone I speak to is frustrated by the analytics they get from their CRM. We realized very quickly that the reporting in our CRM tool wasn't meeting our needs, so we built our own solution.

The Data Chief Live: How to Organize Data & Analytics Teams

Join The Data Chief Live, July 8, on How to Organize Data and Analytics Teams! Hear best practices to bigger business value on whether a centralized, decentralized, or hybrid organizational model is best. Join Jennifer Redmon, Cisco, John Thompson, CSL Behring, Cindi Howson, ThoughtSpot and other thought leaders Live.

How to Prepare Data for Microsoft Power BI

For data analysts, business intelligence professionals, and CTOs to optimize and scale business operations, they must first understand the business data that is available to them. One of the best platforms to turn complicated data points from multiple platforms into a singular, coherent data set is Microsoft Power BI. However, you must first prepare the data sets to eliminate fragmentations and create structural consistency. This article explores how to prepare data for Microsoft Power BI.

Two Ways to Migrate Hortonworks DataFlow to Cloudera Flow Management

Hortonworks DataFlow (HDF) 3.5.2 was released at the end of 2020. The new releases will not continue under HDF as Cloudera brings the best and latest of Apache NiFi in the new Cloudera Flow Management (CFM) product. Getting the latest improvements and new features of NiFi is one of many reasons for you to move your legacy deployments of NiFi on this new platform. To that end, we released a few blog posts to help you migrate from HDF to CFM.

Get the most out of Shopify Analytics

Running an eCommerce store is very much like flying a plane - you can reach unprecedented heights, but you won't be able to do it blindfolded. You have to see where you are going to touch the skies. E-commerce analytics gives you the guidance to make the right choice and scale your online store to new heights. In this article, we will take a deep dive into Shopify Analytics Shopify offers analytics as an out-of-the-box default service to all Shopify store owners and admins.

Four ways static dashboards are costing your business

Ask any analyst how they spend the majority of their work day and they’ll tell you: Performing remedial tasks that provide no analytics value. 92% of data workers report that their time is being siphoned away performing operational tasks outside of their roles. Data teams waste an inordinate amount of time maintaining the delicate data-to-dashboards pipelines they’ve created, leaving only 50% of their time to actually analyze data.

Achieving Data Agility Fuels Growth for Financial Services

Data paves the way for every strategic move made by banks and insurance companies. Whether looking to create a new service, complying with regulations, or overhauling and re-engineering legacy operations, a massive data project is always central to the effort. For financial services businesses, the pace at which they can reshape and repurpose data has become a key determinant of their ability to predict market trends and meet client expectations.

BigQuery admin reference guide: Tables & routines

Last week in our BigQuery Reference Guide series, we spoke about the BigQuery resource hierarchy - specifically digging into project and dataset structures. This week, we’re going one level deeper and talking through some of the resources within datasets. In this post, we’ll talk through the different types of tables available inside of BigQuery, and how to leverage routines for data transformation.

PII Pseudonymization: Explained in Plain English

Data processors handle an abundance of data — including personal information about individuals. As the collection and use of data become more widespread, governments continue to enact laws that protect personally identifiable information (PII). Failing to comply with such laws means risking serious fines and penalties, and damaging public trust. Masking PII through pseudonymization is one way to protect it.

What's new with BigQuery ML: Unsupervised anomaly detection for time series and non-time series data

When it comes to anomaly detection, one of the key challenges that many organizations face is that it can be difficult to know how to define what an anomaly is. How do you define and anticipate unusual network intrusions, manufacturing defects, or insurance fraud? If you have labeled data with known anomalies, then you can choose from a variety of supervised machine learning model types that are already supported in BigQuery ML.

Privacy & Security Rules for Healthcare Marketers

Marketing is more complex if you're engaged in the healthcare field. Whether you work with patients or market to consumers interested in healthcare products, it's important to understand HIPAA guidelines. This article explains the basics of HIPAA Privacy and Security Rules, and how this legislation affects your marketing strategy.

2021: The Year Banks Rethink Their API Strategy

Co-authored by Samta Bansal More than two million British people use Open Banking-based products today, a number that, despite the disruption of COVID-19, is double that seen at the start of 2020. No wonder International Banker dubbed 2021 “The Year of Open Banking.” As open banking spreads worldwide, its growth comes with the pressing need to drive standardization that bridges geographic boundaries and regulatory frameworks.

Yellowfin 9.6 release highlights

9.6 is focused on Yellowfin features that enhance the way our customers build, design and embed stunning analytical content, which include data storytelling, augmented analytics, actionable dashboards — and provide a high ease-of-use experience. As always, you can read the full list of updates in our release notes page, and view our release highlights video below to see the new enhancements demonstrated.

What the Death of the Cookie Means for Marketing Analytics

Google Chrome is moving ahead with its plans to deprecate third-party cookies in 2022. The death of the cookie follows more downstream changes to Internet privacy. Marketers need to tackle how these changes will impact their advertising reach but also their ability to collect, measure and analyze their ad data.