Systems | Development | Analytics | API | Testing

January 2020

Fresh Features: automated data discovery goes prime time

Here’s part 2 of ‘Fresh Features’ of Yellowfin 9. And this week, we’re looking at Signals. Signals is Yellowfin’s unique and powerful automated data discovery product. It will automatically scan your dimensional data and find any significant changes in your data. Then, it automatically sends you an alert, complete with analysis and correlations to help you take swift action to nip issues in the bud or build on successes.

Cloudera Data Warehouse - What You Should Know

Cloudera Data Warehouse is just one of the many experiences you can use on the Cloudera Data Platform (CDP). Cloudera Data warehouse packages up the projects you may already know and use such as Impala and Hive into a service. This Service runs on Kubernetes which gives it the ability to pause, resume, scale up, or down quickly and automatically.

From GDPR to CCPA, the right to data access is the Achilles' Heel of data privacy compliance and customer trust - Part 3

In the first and second blog posts we explained the importance of DSAR as well as how the customer experience can be impacted if the process is not well managed. In this last part, we will go through a few tips that could help you to be DSAR champions!

How Cloudera Enables R Users to Optimize Their Data Science and Machine Learning Workflows

This week, R users from around the world convene in San Francisco for rstudio::conf 2020. With a packed agenda of new package announcements and case studies highlighting successful applications of R across different industries, it’s evident that R and the ecosystem of tools around it make up a vital part of the data science and machine learning landscape.

Bringing action into analytics

One of the really interesting topics in business intelligence is the desire for organizations to take action from data. Traditionally this has been a huge hurdle for organizations to achieve with the reporting tools that they have. While most use dashboards as a source of information it certainly doesn't prompt or drive them to act. This means they can’t close the loop on the insights and the decisions that it drives.

Deep Learning for Anomaly Detection

We are excited to release Deep Learning for Anomaly Detection, the latest applied machine learning research report from Cloudera Fast Forward Labs. Anomalies, often referred to as outliers, are data points or patterns in data that do not conform to a notion of normal behavior. Anomaly detection, then, is the task of finding those patterns in data that do not adhere to expected norms. The capability to recognize or detect anomalous behavior can provide highly useful insights across industries.

Is Predictive Analytics the Answer to Happier Customers in Telecom?

Technology is rapidly transforming how customers deal with businesses—and how businesses cater to customers. Advancements in technology have enabled customers to share their experience, both good and bad, of using a product or a service they received. Though all this information is used by businesses to measure customer satisfaction, how can they be certain that they're personalizing offers that are relevant to customers and tapping into the real opportunities?

Understanding Healthcare's New Industry Imperative: Data Chain of Custody

One of the first recorded medical devices was the stethoscope in 1816. Fast forward more than a century to 2019, where the world witnessed the creation of an award-winning multi-sensor, implantable cardiac device able to predict potential heart failure weeks in advance. The data and analytics streamed and analyzed from new connected devices are transforming healthcare as we know it. However, a real challenge in this environment is the sheer volume and scope of data that must be managed and protected.

Fresh Features: Yellowfin 9 Charts

With so much packed into the latest Yellowfin 9 release, we figured it would be great to let you know about some of the coolest features (which will really transform how you do analytics!) in this series of blog posts - Fresh Features. At Yellowfin, we’re super excited that you will be benefiting from the huge amount of work our development team have been doing behind the scenes to complete revamp Yellowfin’s look, feel, and functionality.

Insurance in 2020 & Beyond - Learning from the past decade to plan for the next

Like many other people, I used time over the recent holidays to clean out and organize my digital files. In that process, I finally trashed the speaking notes for a panel I participated in at SMA’s (Strategy Meets Action) first summit in 2012 when I worked at a large global insurer. During that session, a gentleman in the audience asked me what I thought about “big data” and its implications for Insurance.

How to create a data culture through data storytelling

One of the core reasons that organizations invest in analytic solutions is because they want to get everyone in their organization on the same page. They want everyone to understand what's happening and why it's happening so that individuals know what they need to do to be successful and drive outcomes for the organization.

Real-time log aggregation with Flink Part 1

Many of us have experienced the feeling of hopelessly digging through log files on multiple servers to fix a critical production issue. We can probably all agree that this is far from ideal. Locating and searching log files is even more challenging when dealing with real-time processing applications where the debugging process itself can be extremely time-sensitive.

How the Acquisition of Waterline Data Will Help Hitachi Vantara Scale Your Digital Advantage

The opportunity to create new economic, social and environmental value by unlocking the “good” in data is immense. While the problems we face as a society may be getting harder to solve, the advances we can make when we break down the silos between the physical and digital worlds are profound.

Spark APIs: RDD, DataFrame, DataSet in Scala, Java, Python

Once upon a time there was only one way to use Apache Spark but support for additional programming languages and APIs have been introduced in recent times. A novice can be confused by the different options that have become available since Spark 1.6 and intimidated by the idea of setting up a project to explore these APIs.

These Two Trends Will Put an End to Business as Usual in 2020

Where did the last decade go? Seems like it was just 2010 and I was writing about the future of business in 2020, well it is nowhere! I’ve spent much of my career in finance/accounting and management consulting and the last decade+ helping companies link their business and technology strategies with a focus on data and analytics. Where will we head in 2020 and this next decade?

Unravel Earns Prestigious SOC 2 Security Certification

RELATED BLOG POSTS Unraveling the Complex Streaming Data Pipelines of Cybersecurity Best Practices Blog 5 Min Read Security is top of mind for every enterprise these days. There are so many threats they can hardly be counted, but one commonality exists: data is always the target. Unravel’s mission is to help organizations better understand and improve the performance of their data-based applications. We’re a data business, so we appreciate the scope and implications of these threats.

Qlik Welcomes RoxAI - Advanced Alerting and Intelligent Automation Comes to Qlik Sense

I’m excited to share today that we announced the acquisition of RoxAI and its Ping intelligent alerting software at our 2020 sales kickoff. This is becoming an exciting annual tradition – last year we similarly announced from the SKO stage our intent to acquire Crunch Data, which has been rebranded as Insight Bot. Like that announcement, RoxAI and Ping will help increase the value of data through insights delivered to users where they work.

3 components you need to create the ultimate dashboard design

In the ideal world, when a developer builds a dashboard, they want that dashboard to be used to its fullest extent. Dashboards would engage users with their design, provide users with both information and the means to take immediate action (without leaving the dashboard), and become part of users’ everyday workflows.

What I would do if I was starting a software company today

Starting a software company today is very different than it was 15 years ago. The fundamental reason for this is that mega-vendors now exist across all product ranges. For any product you can think of there is already a mega-vendor in the space delivering it in the cloud. This means if you want to start a software business now you have to do it differently. If I was starting out today, there are three things I would do.

From GDPR to CCPA, the right to data access is the achille's heel of data privacy compliance and customer trust - Part 2

In the first part of this series, I explained what is DSAR and why the organizations should care about it. Now, let’s take a look at how the process can be perceived by the customers. Our recent GDPR benchmark research shows that the road can be tortuous.

Announcing support for Apache Flink with the GA of Cloudera Streaming Analytics

We cannot hold our excitement anymore! For the last few months, our Data-in-Motion engineering teams have been working hard to deliver a compelling and critical part of our Cloudera DataFlow (CDF) story. To enhance our Stream Processing and Analytics narrative within the overall Data-in-Motion platform, we give you support for Apache Flink with the general availability of Cloudera Streaming Analytics (CSA).

What I would do if I was starting a software company today

Starting a software company today is very different than it was 15 years ago. The fundamental reason for this is that mega-vendors now exist across all product ranges. This means if you want to start a software business now you have to do it differently. If Glen Rabie, Yellowfin's CEO, was starting out today, here are three things he would do.

How Businesses are Using Machine Learning Anomaly Detection to Scale Partner and Affiliate Tracking

Today’s business needs make it virtually impossible to function without relying on an extensive network of partners and third-party providers. An IBM study found that 70 percent of businesses were looking to increase their external partnerships.

Updated Cloudera Manager Tour

Cloudera Manager's look has been updated with the arrival of the Cloudera Data Platform. Although CDP is largely configured and controlled through the Control Plane, there are still some options available to you in Cloudera Manager when working with an Environment or in a Data Hub cluster. This quick tour of the different views and menus will hopefully help you align yourself to the new layout.

Like the Infinity Stones, keep your Talend services as far apart as possible

Ok, so we all probably know why keeping all Infinity Stones in one place is a bad idea, right? You must now be wondering what the relationship between Infinity Stones and Talend could be. Worry not, Thanos isn’t coming and there is a reasonable explanation behind the MCU fandom references, I promise.

Placing the Emphasis on Data in the Federal Data Strategy

In mid-June of 2019, the White House Office of Management and Budget (OMB) released the Draft 2019-2020 Federal Data Strategy Action Plan. The plan outlines a series of steps and principles targeting effective governance, responsibilities and best practices for federal agencies’ use of citizen data. When put into place, these action items will allow government agencies to maximize data, improve security and better serve constituents.

Panguin Tool - Ace at Search Engine Optimization

Panguin SEO Tool is one of the best free Search Engine Optimization tools available in the market. It shows you the impacts and effects of all Google algorithm updates on your website. To earn more views by Search Engine Optimization, it is very important to learn what has affected your website in the past and caused you fewer views. Panguin directly shows what could’ve caused it so it can be easily resolved.

Supreme Software and Expert's Choice - the FinancesOnline Awards for Yellowfin analytics

Awards are great. They’re a little piece of recognition for all the work you’ve put into something. Whether it’s a big name attached to the award, like Gartner or BARC, or a lesser-known name, every award says something about the business given it. And Yellowfin was just awarded three by FinancesOnline - the Supreme Software award, Expert’s Choice award, and the Great User Experience certificate. So what do they say about this organization?

Talend's Next Chapter

Today we open a new chapter at Talend, in which we begin our journey from a $250M company to a $1 billion cloud market leader. Over the last six years, I've been honored to help build and lead the team that brought Talend from a $50M startup, through its IPO in 2016 to become a quarter-billion-dollar company. Together, we built one of the fastest-growing cloud businesses in the world.

How Scania is Driving Logistical Efficiency and Sustainability with Big Data

Organizations in the transportation and manufacturing industries are applying Industrial IoT concepts and technology to transform product development, supply chains, and manufacturing operations. Scania is driving logistical efficiency and sustainability with big data. Scania is a world-leading provider of transport solutions and is leading the shift towards sustainable transport systems. In 2018 it delivered 88,000 trucks, 8,500 buses as well as 12,800 industrial and marine engines to customers.

The wild days of software have come to an end

It was easy to start a software company 15 years ago. There was a huge transformation from desktop to the cloud and that created an opportunity for any vendor to establish their place at the table offering cloud-based software. But it’s now more difficult to create new products that are significant and compete because we have mega-vendors in the cloud. The days of being able to bring a really significant product to market are over.

Introducing Apache Spark on Docker on top of Apache YARN with CDP DataCenter release

Bringing your own libraries to run a Spark job on a shared YARN cluster can be a huge pain. In the past, you had to install the dependencies independently on each host or use different Python package management softwares. Nowadays Docker provides a much simpler way of packaging and managing dependencies so users can easily share a cluster without running into each other, or waiting for central IT to install packages on every node.

From GDPR to CCPA, the right to data access is the achille's heel of data privacy compliance and customer trust - Part 1

In December 2019 we released the second edition of our data privacy benchmark, and this year again, results are shocking: 18 months after GDPR came into force, 58% of surveyed companies are not performing with data privacy. The issue relates to the right of access, which gives individuals the right to obtain a copy of their personal data.

Three Trends in Cloud Computing to Expect in 2020

A new year is upon us and that means it’s time to look ahead to what’s coming next. In cloud computing, organizations are going to be making adjustments in 2020 – to accommodate overstrained budgets, new regulations, and shifting technologies. It will be a year of identifying what’s not working and moving toward the right solutions. Let’s take a look at three trends that will impact cloud computing across all industries in the coming year.

The Key Principles of a Successful Time Series Forecasting System for Business

An emerging field of data science uses time series metrics to develop an educated estimate of future developments in business such as revenue, sales, and demand for resources and product deliverables. A forecast is based on historical data of a given metric plus other relevant factors. Accurate forecasts are an important aspect of corporate planning.

Data privacy hidden gems in Talend Component Palette: Part 2

Data Privacy is becoming the main buzz word in technical circles day by day. Sometime back, we thought that illegal gathering personal identifiable information from data servers can happen only in James Bond and Mission Impossible movies. But technology is changing quite rapidly and in this era of global virtual connectivity, customer private information is becoming more and more insecure. The news of customer data getting misused by data analytics companies, data theft from major banks, etc.

The Top 10 Anomalies of the Last Decade

As a company known for our anomaly detection, we know a thing or two about spotting irregularities. So as we reached the end of 2019, we couldn’t help but think back on the 2010s and the anomalies that shook the world. Once we got to listing them, it really became tough to pick just 10. Ultimately, after much debate, we ranked them based on their impact, newsworthiness and how utterly unexpected they were.

Why SaaS vendors should invest in automated analytics

One of the challenges with being a fast-growing SaaS vendor is that you're so consumed with running your business that you're unlikely to stop and take a breath. If you’re growing quickly, there can also be a perception that you don’t really need to look at what’s happening in your business. Instead, you may divert resources to building a new product or marketing rather than actually thinking about your business.