Systems | Development | Analytics | API | Testing

August 2020

MLOps for Python: Real-Time Feature Analysis

Data scientists today have to choose between a massive toolbox where every item has its pros and cons. We love the simplicity of Python tools like pandas and Scikit-learn, the operation-readiness of Kubernetes, and the scalability of Spark and Hadoop, so we just use all of them. What happens? Data scientists explore data using pandas, then data engineers use Spark to recode the same logic to scale or with live streams or operational databases.

Migration Supporting Real-Time Analytics for Customer Experience Management

Service Management Group (SMG) offers an easy-to-use experience management (XM) platform that combines end-to-end customer and employee experience management software with hands-on professional services to deliver actionable insights and help brands get smarter about their customers. The XM platform, smg360, helps customers across verticals, including restaurants, retail, and healthcare, drive changes that boost loyalty and improve business outcomes.

Streaming Analytics in the Real World

From leading banks, and insurance organizations to some of the largest telcos, manufacturers, retailers, healthcare and pharma, organizations across diverse verticals lead the way with real-time data and streaming analytics. These businesses use data-fueled insights to enhance the customer experience, reduce costs, and increase revenues. And Cloudera is at the heart of enabling these real-time data driven transformations.

Sitechecker - Covers All Stages of SEO Campaigns of Any Scale

SEO is a very important aspect of the digital marketing strategy for a business website or a blog. With that being said, precaution needs to be taken when implementing this marketing strategy and you should have a reliable tool to gauge the performance of your SEO strategy. With plenty of tools and services on the market, it is often overwhelming for SEO professionals and digital marketers to choose the best. Sitechecker is a known-name and many SEO professionals recommend it.

Faster Application Development with Cloudera Operational Database (COD) Demo Highlight

IT is no longer relegated to the IT group. Lines of business are building new business applications that can drive their business’s top and/or bottom lines. These applications are increasingly stateless -- meaning that they rely on their underlying operational database to manage their state and work with IT to build, deploy and manage the database infrastructure. The application development lifecycle is accelerating with the broad adoption of cloud and the rise of dbPaaS where the database is fully managed and self-optimizes for the applications. In this session, we will show you how the Cloudera Operational Database offers an accelerated on-ramp to app development by offering a modern multi-model database that eliminates infrastructure management.

The Future Of The Telco Industry And Impact Of 5G & IoT - Part II

In part 2 of the series focusing on the impact of evolving technology on the telecom industry, we sat down with Vijay Raja, Director of Industry & Solutions Marketing at Cloudera to get his views on how the sector is changing and where it goes next. Hi Vijay, thank you so much for joining us again. To continue where we left off, as industry players continue to shift toward a more 5G centric network, how is 5G impacting the industry from a data perspective?

Mini Charts - Another Way To Visualize Your Data

Have you ever felt that, by looking at just numbers, that you are probably missing out on the bigger picture? That’s, of course, why we have visualizations – to form patterns out of those numbers that we can then interpret, as well as glean insights from and deepen our understanding of what the data reveals. Now, what if we bring numbers and visualizations together in a table, one way to explore those numbers and find hidden gems of meaning? Let’s have a look at mini charts!

Webinar - 4 Reasons to Join the Snowflake Community

From sharing Snowflake best practices with other users to networking and talking tech with global data experts, the benefits of joining the Snowflake Community are endless. Join us virtually on August 27 to learn about four reasons to become a member of the Snowflake Community. During this 60-minute webinar, the Snowflake Community team will share an overview of the programs and resources available to community members. Sign up to learn how joining the Snowflake Community will help you continually expand your data expertise, support your professional growth, scale your industry network, and influence Snowflake’s product roadmap. 

Connect the Data Lifecycle: The power of data

There’s no doubt that cloud has become ubiquitous, and thank goodness for that in 2020. We wouldn’t have survived the challenges of this year without cloud. It’s supported everything, from the sudden changes in the way we work to the way we access healthcare and even shop for vital goods. While cloud is the vehicle, it’s what sits on it that makes it so valuable — data.

Certified technical partner solutions help customers succeed with Cloudera Data Platform

On August 18, we completed our Enterprise Data Cloud vision of bringing a truly hybrid cloud experience with the general availability of Cloudera Data Platform Private Cloud (CDP Private Cloud). CDP Private Cloud, which is based on Kubernetes (RedHat OpenShift), extends cloud-native speed, simplicity and economics for the connected data lifecycle to the on-prem world, enabling IT to respond to business needs faster and deliver rock-solid service levels so people can be more productive with data.

The Advantages Of Live Data-Streaming In The Competitive Financial Services Sector (Part II)

Live data-streaming offers businesses exciting new opportunities to transform the way they operate, leveraging real-time insights to drive better decision making and enhance operational efficiency. To find out more about how live-streaming data might impact the financial services sector, I sat down for a chat with Dinesh Chandrasekhar, Head of Product Marketing in Cloudera’s data-in-motion Business Unit. If you missed Part 1 of our Q&A, you can catch-up on it here.

Snowflake on Snowflake: How We Strengthened Data Governance Using Dynamic Data Masking

Managing access to sensitive data is the name of the game when it comes to security and data governance. It’s required to protect sensitive data from unauthorized changes or exposure, and it’s now a mandate as part of privacy regulations such as GDPR and the California Consumer Privacy Act (CCPA). Companies all over the world are now focused on protecting sensitive PII associated with their customers and employees.

The Role Of Technology In A Changing Financial Services Sector Part II

Evaluating anomalies and unpredicted events like pandemics and ESG concerns In part II of the series, we sat down for an interview with Dr. Richard Harmon, Managing Director of Financial Services at Cloudera, to find out more about how the industry is adopting new technology. You can catch-up and read part 1 of the series, here. Thank you for joining us for part two of our discussion around data, analytics and machine learning within the Financial Service Sector Dr. Harmon.

Production ML Capabilities Now Available In CDSW 1.8

With only about 35% of machine learning models making into production in the enterprise (IDC), it’s no wonder that production machine learning has become one of the most important focus areas for data scientists and ML engineers alike. As you may remember, we recently announced a full set of MLOps capabilities in Cloudera Machine Learning, our cloud native machine learning tool for the cloud.

The Gap in Data and Analytics Supply Chains Requires a New Way of Thinking

Today, we announced the acquisition of the assets and IP of Knarr Analytics, an innovative start-up that provides real-time collaboration, sophisticated data exploration and insight capture capabilities, to complement Qlik’s cloud data and analytics platform.

Archive data from to S3 with the new Kafka Connect connector

The new open-source #ApacheKafka Connect sink connector for #S3 gives you full control on how to sink data to S3 and save money on long term storage costs in #Kafka. The connector has the ability to flush data out in a number of different formats including #AVRO, #JSON, #Parquet and #Binary as well as ability to create S3 buckets based on partitions, metadata fields and value fields.

What is Automated Business Monitoring?

Modern businesses face a continuously growing mountain of data, made up of many different operational processes and performance metrics, which your users need to decipher for insights and proactively monitor in order to ensure business goals are met. Embedded dashboards are one way for your users to keep track of key information while using your application.

Stop Using Kubernetes for ML-Ops; Instead use Kubernetes

If your company has already started getting into machine learning / deep learning, you will quickly relate to the following story. If your company is taking its first steps into data-science, here is what is about to be dropped on you. If none of the above strikes a chord, well it’s probably good to know what’s out there because data-science is all the rage now, and it won’t be long until it gets you too 🙂

Dancing with Elephants in 5 Easy Steps

The Corner Office is pressing their direct reports across the company to “Move To The Cloud” to increase agility and reduce costs. And next to those legacy ERP, HCM, SCM and CRM systems, that mysterious elephant in the room – that “Big Data” platform running in the data center that is driving much of the company’s analytics and BI – looks like a great potential candidate.

The Advantages Of Live Data-Streaming In The Competitive Financial Services Sector (Part I)

Live data-streaming offers businesses exciting new opportunities to transform the way they operate, leveraging real-time insights to drive better decision making and enhance operational efficiency. To find out more about how live-streaming data might impact the sector I sat down for a chat with Dinesh Chandrasekhar, Head of Product Marketing in Cloudera’s data-in-motion Business Unit.

How To Modernize Your Mainframe Data For A Modern World

Back in 1977, an English mod/rock band called The Jam released a track called “The Modern World.” The same year also saw the dawn of the home computer, with the Apple II and Commodore PET 2001 going on sale at a time when the mainframe was going strong. Fast forward to 2020, and, despite many false claims of its death, the mainframe remains strong; however, to get the most value of from the data inside, mainframe modernization is key.

Talend Data Fabric August '20 release: Expanding cloud capabilities to meet the needs of today's data citizens

Talend is excited to announce the latest improvements to Talend Data Fabric, including expanded cloud capabilities, in our August ’20 release. Talend now offers Talend Cloud Data Catalog hosted on either AWS or Azure platforms. It provides the same great features and functionality as our on-premises solution but with no on-premises installation for a complete SaaS solution for data governance.

Data for Enterprise AI: at the very forefront of innovation

2020 may well go down as the year where what seems impossible today, did become possible tomorrow. It’s been a year filled with disruption and uncertainty. One day we were all going to the office, and the next we were working from home. Businesses had to literally switch operations, and enable better collaboration and access to data in an instant — while streamlining processes to accommodate a whole new way of doing things.

Using Cloudera Data Engineering to Analyze the Paycheck Protection Program Data

The Paycheck Protection Program (PPP) is implemented by the US federal government to provide a direct incentive for businesses to keep their employees on the payroll, particularly during the Covid-19 pandemic. PPP assists qualified businesses retain their workforce as well as help pay for related business expenses. Data from the US Treasury website show which companies received PPP loans and how many jobs were retained. The US Treasury approved approximately one million PPP loans across the US.

Yellowfin Embedded Analytics Walkthrough for Product Teams

As a recognized leader in embedded analytics, Yellowfin has been designed and built to enable you to embed amazing analytical experiences into your software. From a highly integrated dashboard module and full self service reporting, to enabling best practice integration that blurs the lines between analytics and your application and workflows. Modernize your reporting environment with Yellowfin to ensure your customers engage with your data, discover insights faster through automation and innovate with contextualized analytics.

Automated Deployment of Apache Spark Jobs in Cloudera Data Engineering

In this video we're going to go over some more advanced features of the Cloudera Data Engineering Experience. Using some publicly accessible Paycheck Protection Data, you'll see how to automatically setup Spark jobs to deploy by using the CDE CLI, making development and deployment times much quicker and painless. We'll also take the development cycle through to the end and get some visualization of the finished reports using the aforementioned PPP data.

Operational Database Performance Improvements in CDP Private Cloud 7 vs CDH5

Cloudera Data Platform (CDP) Private Cloud is the most comprehensive on-premises platform for integrated analytics and data management. It combines the best of Cloudera Enterprise Data Hub and Hortonworks Data Platform Enterprise Plus, and brings the latest and greatest open source technologies for data management and analytics to the data center. With the latest version (7) of CDP Private Cloud, we’ve introduced a number of new features and enhancements.

The New Partnership of Security and Data Analytics at Prologis

Data analytics is going through tremendous growth while helping businesses succeed in a challenging economy. But cybersecurity presents increasing challenges for businesses. Prologis, however, has combined data analytics and cybersecurity to improve visibility, cut costs, and reduce risk. Prologis plays an essential role in the global supply chain.

What makes a good KPI

Everyone talks about KPIs, but do you know what makes a good or bad KPI? Tracking and forecasting KPIs is empowering and valuable for any organization, but only if done right. A key performance indicator is a metric used to measure and evaluate how well a company is achieving its set goals. Peter Drucker once said, "what gets measured gets done." KPIs help organizations work towards their critical objectives by creating a basis for the value of work done.

Remote Engine Clusters for Data Services and Routes

Talend strives to keep you ahead of the curve by announcing the availability of remote engine clusters for APIs and real-time integrations as part of Talend Cloud API services. Now you can easily achieve high-availability and scale for your Data Services and Routes, thereby improving quality of service. The feature works for both Talend Runtime and microservice deployments.

Forrester TEI Study Shows Snowflake Can Deliver a Customer ROI of 612% Over 3 Years

We are pleased to share the findings of a Forrester Consulting Total Economic Impact™ (TEI) study that evaluates the cost savings and business benefits enabled by Snowflake’s cloud data platform. Snowflake commissioned the study to gain insight into the return on investment we’re delivering for customers and learn how we can serve them better.

Before Making A Big Splash - 5 "Gotchas" To Avoid When Building a Data Lake

There is no debate. Making sense of your data is just good business. Studies from Forrester Research, McKinsey and more show that companies that leverage their data better tend to outperform their less-informed counterparts. Our own recent research, Data as the New Water: The Importance of Investing in Data and Analytics Pipelines, done in partnership with IDC, shows that companies that optimize their data pipelines see enhanced operational efficiency (88 percent vs.

Big Data Analytics for Healthcare: Improving the patient experience

At its core, healthcare organizations revolve around the patient. Like any other business, the healthcare industry is consumer oriented. There are customers to take care of—literally—which is why it’s important to create a great patient experience. However, with COVID-19 putting a lot of pressure on the industry, maintaining a patient-centered approach has become a challenge.

The Future Of The Telco Industry And Impact Of 5G & IoT - Part 1

Communication Service Providers (CSPs) are in the middle of a data-driven transformation. The current scale and pace of change in the Telecommunications sector is being driven by the rapid evolution of new technologies like the Internet of Things (IoT), 5G, advanced data analytics and edge computing. This is opening up new revenue opportunities, use cases and even the possibility for different types of business models within the sector, changing the way that CSPs operate.

Enabling Automated Issue Resolution through the use of conversational ML

The Cloudera Support Organization has always strived to not only provide solutions to our customers but to also deliver helpful knowledge. One of the primary sources of that knowledge comes from our Knowledge Articles. This content is created and curated by our knowledgeable Support Staff based on real-world experience coming from support cases. These Knowledge Articles have proven to be invaluable to our Support Staff over the years.

Gartner A&BI Showfloor Showdown: Analyzing key drivers behind life expectancy

It almost seems like eons ago now, but in this year’s Gartner Data and Analytics Sydney Summit, Yellowfin were excited to participate in the first-ever Analytics & BI Showfloor Showdown, alongside IBM and Oracle. As a way of facilitating a side-by-side comparison - and in the spirit of an insightful and entertaining session - we were all invited to look at the WHO health indicators data set, understand the variables that drive life expectancy, and present them to the crowd.

Website Traffic Monitoring - Beyond Google Analytics

In today’s digital age there are numerous websites that compete to drive traffic to their portals. They invest heavily in strategies like SEO and SMM to get visitors hooked on their content. Getting traffic on your website is important but it’s also critical to understand how visitors interact with your content. This is where website analytics tools and solutions come in.

Apache Ozone Fault Injection Framework

One of the key challenges of building an enterprise-class robust scalable storage system is to validate the system under duress and failing system components. This includes, but is not limited to: failed networks, failed or failing disks, arbitrary delays in the network or IO path, network partitions, and unresponsive systems.

Demo - Qlik and Microsoft Unleash SAP Data For Analytics

This integrated solution accelerates analysis of SAP data by combining the automated data delivery capabilities of Qlik Data Integration with the agility of the Azure platform. Qlik and Microsoft offer a free Proof-of- Value (PoV) that includes Software and Expertise for real-time SAP Analytics in Azure. See what is possible.

BigQuery now offers industry-leading uptime SLA of 99.99%

More than ever, businesses are making real-time, data-driven decisions based on information stored in their data warehouses. Today’s data warehouse requires continuous uptime as analytics demands grow and organizations require rapid access to mission-critical insights. Business disruptions from unplanned downtime can severely impact company sales, reputation, and customer relations.

Accelerating Mayo Clinic's data platform with BigQuery and Variant Transforms

Genomic data is some of the most complex and vital data that our customers and strategic partners like Mayo Clinic work with. Many of them want to work with genomic variant data, which is the set of differences between a given sample and a reference genome, in order to diagnose patients and discover new treatments. Each sample’s variants are usually stored as a Variant Call Format file, or VCF, but files aren’t a great way to do analytics and machine learning on these data.

Data Security and Governance - not the lockdown businesses expect it to be

The current economic climate has meant a sudden and seismic step-change in how many of our customers operate. The recent shift to remote working has seen an increase in conversations around how data is managed, with many businesses needing to achieve democratic data access in order to derive value and improve efficiencies as we navigate the ‘new norm’. Toolsets and strategies have had to shift to ensure controlled access to data.

Top Takeaways From CDO Sessions: Customers and Thought Leaders

We’ve been busy speaking to our customers and thought leaders in the industry and have rounded up the key takeaways from our latest CDO sessions. Here are some of the top takeaways and advice gained from these sessions with big data leaders, Kumar Menon from Equifax, Anheuser-Busch’s Harinder Singh, Sandeep Uttamchandani from Unravel, and DBS Bank’s Matteo Pelati.

Factory Edge to Cloud Analytics- Three Fundamental Steps to Success

We recently Googled the manufacturing use case “predictive maintenance” and was astonished by the results there were 82 MILLION results returned. Next, we Googled “process optimization” and it yielded even more results – 302 MILLION. Clearly, these use cases are top of mind in today’s manufacturing landscape, considering digital transformation will deliver $11 Trillion USD in economic value by 2025.

Better BigQuery pricing flexibility with 100 slots

BigQuery is used by organizations of all sizes, and to meet the diverse needs of our users, BigQuery offers highly flexible pricing options. For enterprise customers, BigQuery’s flat-rate billing model is predictable and gives businesses direct control over cost and performance. We’re now making the flat-rate billing model even more accessible by lowering the minimum size to 100 slots, so you can get started faster and quicker.

Cloudera Data Warehouse on Azure Provides Fast, Cost-Effective and Highly Scalable Analytics

The Cloudera Data Warehouse (CDW) service is a managed data warehouse that runs Cloudera’s powerful engines on a containerized architecture. It is part of the new Cloudera Data Platform, or CDP, which went live on Microsoft Azure earlier this year. The CDW service lets you meet SLAs, onboard new use cases with zero friction, and minimize cost. Today, we are pleased to announce the general availability of CDW on Microsoft Azure.

5 Obstacles to Successful Data Governance

Organizational leaders worldwide agree that data governance is important. However, data governance programs in most companies are still being planned or in progress. In a 2020 Dataversity report¹, only 12 percent of companies had fully implemented programs, while 38 percent of programs were a work in progress, and 31 percent were just getting started. That’s because companies often run into roadblocks while executing data governance. Below are five common obstacles organizations face.

Fivetran's Approach to Automated Data Integration, with Co-Founder and COO Taylor Brown

Fivetran’s mission to to make data as accessible and reliable as electricity. We're focused on providing automated access to data so data analysts and engineers can be empowered to actually analyze their data. For small companies and large enterprises, Fivetran replicates data from 150+ sources to enable business intelligence and data-driven decisions alongside our partner companies. Learn more at Fivetran.com

Qlik App Analyzer - Demo

The App Analyzer provides a comprehensive dashboard to analyze application metadata across a Qlik Sense tenant, providing developers and administrators with a holistic view into the makeup and integrity of all applications. This includes granular level detail about application data models and memory allocation including base RAM, peak realod RAM, field/table RAM, row counts, cardinality, data islands, synthetic keys, and more.

Qlik Data Transfer - Quick Demo

Qlik Data Transfer – automate data transfer of your on-premise data sources to Qlik Sense SaaS deployments. Connect to files, bundled ODBC and REST data sources, select tables and fields and push on demand or on a schedule – while optionally triggering your SaaS Qlik Sense app to refresh. Even use Qlik Sense Apps as combined and transformed data, and watch a on-prem folders for new or changed file-based data. This free utility definitely is a welcomed edition to the Qlik’s SaaS analytics platform.

The Future of Business Monitoring is Here & it's Autonomous

As the business world continues to integrate AI and machine learning to better manage big data processes, one area that arguably has benefitted the most is business monitoring. From IT management to business intelligence, the last few years have seen a drastic shift in how companies are monitoring their data.

What's All the Hype About? Iguazio Listed in Five 2020 Gartner Hype Cycles

We are delighted to announce that Iguazio has been named a sample vendor in the 2020 Gartner Hype Cycle for Data Science and Machine Learning, as well as four additional Gartner Hype Cycles for Infrastructure Strategies, Compute Infrastructure, Hybrid Infrastructure Services, and Analytics and Business Intelligence, among industry leaders such as DataRobot, Amazon Web Services, Google Cloud Platform, IBM and Microsoft Azure (some of whom are also close partners of ours).

Qlik and Fortune Launch "History of the Fortune Global 500"

Earlier this year, we launched a unique partnership with Fortune Magazine, with the first-ever data analytics site supporting the publication of the annual Fortune 500 list. Today, we extended that partnership with the debut of the “History of the Fortune Global 500,” our interactive data analytics site timed with the publication of the 30th anniversary of the Fortune Global 500 list.

How to achieve product-market fit

Imagine going to work only to find that your inbox is flooded with customers telling you how happy they are with your software. People are in such a hurry to download your app, you need to scale your servers to meet the demand before the infrastructure crashes. Your phone rings: it’s a tech journalist trying to book an interview with you about your company's growth. This is the dream for every business owner and entrepreneur. But the reality is often in stark contrast to the scenario above.

What's Your Streaming-Data Strategy?

Are you ready to harvest the massive real-time data that your organization generates? You need to master streaming data to  gain an edge in business, in every industry. But, most businesses still rely on batch and incremental processing. If that’s you, don’t despair. Join this session to understand key concepts, common technologies and best practices you need to succeed with streaming  data. You will  also learn about the Hitachi Vantara streaming data stack and how we can help you meet your goals.

Deliver Analytics-Ready Data to the Cloud With Snowflake and Hitachi Vantara

One of the toughest challenges for data professionals today is migrating data from on-premises environments to the cloud. Many companies still lack the tools and infrastructure to ingest and process complex datasets to achieve critical business outcomes. Tune in for a joint-session with Snowflake and Hitachi Vantara as we discuss best practices to address common edge-to-multicloud issues and how our joint offering can dramatically simplify data preparation, migration and analytics tasks to help deliver analytics-ready data in the cloud.

Make Your Keyboard Great Again! - User Story

We are all familiar with this scenario, you work on your training code, fix “all” of the bugs (the ones you know about), wait for a few iterations, see that batch size wasn’t wrong and nothing blows up, and then you happily go home. However, when you come back into the office the next day look at your loss and test accuracy you’re horrified to find that the experiment crashed on the first test cycle because you pointed your test set in the wrong folder 🙁

Real-Time Cost Alerts and Forecasts for AWS

For many companies, cloud costs are among the top investments these days. With a growing number of services, instances and regions, cloud cost optimization is becoming increasingly painful. Companies use cloud management platforms to optimize costs and increase cloud visibility and security. But staying on top of AWS budgets requires proficiency, agility and time—especially when any glitch can result in massive cost bleeds.

Can reviews help your retail firm? Find out with Advanced Analytics.

Feedback is crucial to continuous improvement. When an employee wants to be more effective at their job, they would benefit from knowing what they’re good at and where they need to improve. The same goes for your products. To consistently offer quality products, knowing what your customers think is key. Customer reviews are a goldmine of valuable big data and insights that all retail organizations should tap into.

Yellowfin Embedded Analytics Walkthrough for Product Teams

As a recognized leader in embedded analytics, Yellowfin has been designed and built to enable you to embed amazing analytical experiences into your software. From a highly integrated dashboard module and full self service reporting, to enabling best practice integration that blurs the lines between analytics and your application and workflows. Modernize your reporting environment with Yellowfin to ensure your customers engage with your data, discover insights faster through automation and innovate with contextualised analytics.

Our reflections on the 2020 Gartner Magic Quadrant for Data Quality Solutions

“Every organization — no matter how big or how small — needs data quality,” says Gartner in its newly published Magic Quadrant for Data Quality Solutions. However, with more and more data coming from more and more sources, it’s increasingly harder for data professionals to transform the growing data chaos into trusted and valuable data assets.

Harnessing Data in Motion for Government Agencies

Today’s public sector organizations and government agencies demand a new standard for communicating and sharing information. That includes data-rich content that moves through environments, networks and locales. From being stored, analyzed and shared, to quickly and effectively moving between environments, to spinning up in clusters and informing endless applications—data is more critical than ever.

Cloud, big data analytics & AI are driving change in Finance

How Cloud Computing is evolving alongside Big Data, Analytics, and AI in Financial Services. New technology like Artificial Intelligence (AI), Cloud Computing, big data, and prescriptive analytics are changing the way the Financial Services sector does business. With evolving tech comes both new opportunities as well as different risks, and companies within the space must innovate and embrace new ideas as shifting business conditions and changing consumer preferences dictate new norms.

Getting Started with Cloudera Data Engineering on CDP

In this video we go over the Cloudera Data Engineering Experience, a new way for data engineers to easily manage spark jobs in a production environment. We'll go over a few of the key features as well as a quick demo on how to launch your first simple python ETL spark job. You'll see how to schedule as well as analyze a job once the run is complete.

Accelerating my COVID-19 DL project - User Story

The recent global pandemic caused by the COVID-19 virus has threatened the sanctity of our humanity and the well-being of our societies at large. Similar to times of war, the pandemic has also given us the opportunity to appreciate the things we take for granted such as health workers, food suppliers, drivers, grocery store clerks and many others who are in the frontlines keeping us safe at this difficult time, Salute!