Analytics

Understanding Snowflake's Resource Optimization Capabilities

The only certainty in today’s world is change. And nowhere is that more apparent than in the way organizations consume data. A typical company might have thousands of analysts and business users accessing dashboards daily, hundreds of data scientists building and training models, and a large team of data engineers designing and running data pipelines. Each of these workloads has distinct compute and storage needs, and those needs can change significantly from hour to hour and day to day.

Why Hiring a Data Analyst Won't Solve Your Business Problems

As businesses increasingly leverage data-driven decision making, the ability to use and understand data at the company-wide level becomes mission critical. While tech behemoths like Netflix, Airbnb, and Spotify have strong data cultures built over the last decade, most companies often face challenges getting up and running with data.

Reasons why your Big Data Cloud Migration Fails and Ways to Overcome

The Cloud brings many opportunities to help implement big data across your enterprise and organizations are taking advantage of migrating big data workloads to the cloud by utilizing best of breed technologies like Databricks, Cloudera, Amazon EMR and Azure HDI to name a few. However, as powerful as these technologies are, most organizations that attempt to use them fail. Join Chris Santiago, Director of Solution Engineering as he shares the top reasons why your big data cloud migration fails and ways to overcome it.

New Multithreading Model for Apache Impala

Today we are introducing a new series of blog posts that will take a look at recent enhancements to Apache Impala. Many of these are performance improvements, such as the feature described below which will give anywhere from a 2x to 7x performance improvement by taking better advantage of all the CPU cores. In addition, a lot of work has also been put into ensuring that Impala runs optimally in decoupled compute scenarios, where the data lives in object storage or remote HDFS.

Do You Trust the Health of Your Data?

Today, companies can measure every aspect of business health, except the health of their data which drives business decisions. Data is vital to inform critical decisions such as identifying new routes to market, systems to support business agility, and more resilient supply chains. As Harvard Business Review puts it, “Your organization’s data is the source of both the opportunity and the challenges to your innovation.

All you need to know about data architecture

Data architecture is a hot topic right now. And rightfully so. Technological advances bring out a myriad of new solutions that go beyond the traditional relational databases and data warehouses. They enable companies to accelerate their entire data pipeline (or at least remove painful bottlenecks) and shorten the analytic cycles. The portfolio of data assets managed by companies is also growing.

Is Elasticsearch the Ultimate Scalable Search Engine?

For enterprise applications and startups to scale, they need to manage large volumes of data in real-time. Customers must have the ability to search for any product or service from your database within seconds. When you manage a relational database, data is spread across multiple tables. So, customers may experience lag during search and data retrieval. However, this is different with Elasticsearch and other NoSQL databases.