Analytics

Are Your Machine Learning Models Wrong?

In addition to the very real negative impact on every person around the world, the COVID-19 pandemic is driving business disruptions and closures at an unprecedented scale. Enormous government stimulus programs are resulting in explosions in fiscal deficits, regulators are relaxing capital constraints on banks and central banks are supporting economic stability with a range of interest rate cuts and other stimulus measures.

5 Challenges of Building Data Applications

Fast-growing software companies are building data applications for a variety of uses, from marketing apps that provide customer insights, to IoT apps that handle device feedback, and data analytics apps that process both historical and near real-time data. But developers often face obstacles when building, designing, and supporting applications that need to parse large volumes of information.

Databases Demystified Lesson 1 Introduction to Databases and SQL

In the first episode of Databases Demystified with Michael Kaminsky, we give a high-level overview of the most important concepts in databases. We start with a brief history of databases going from the invention of relational databases through present day and we talk about the differences between analytical and transactional databases, distributed and single-node databases, and in-memory vs on-disk databases We finish up talking briefly about SQL and what makes it special.

Databases Demystified Lesson 4: Transactions Part 1

In this episode of Michael Kaminksy's Databases Demystified, we learn all about what a transaction is, and what ACID means. Learn why database constraints are important, and what the commands "begin" "commit" and "rollback" mean. We talk about atomicity, consistency, isolation, and durability and why transactions are so important.

Databases Demystified Lesson 6: Distributed Databases Part 1

Welcome to episode 6 of Michael Kaminsky's Databases Demystified. In this lesson, we introduce a fascinating and incredibly important topic: distributed databases. We discuss "nodes" and "clusters" and we cover the two major paradigms in distributed databases: big-compute databases and high-availability databases.

Databases Demystified: Lesson 7: Distributed Databases Part 2

Episode 7 of Michael Kaminsky's Databases Demystified. Learn about new issues we face in distributed databases and all about the CAP theorem. We'll talk about leader and follower nodes, what happens when distributed databases lose connection with a node, and what CAP stands for: consistency, availability, and partition tolerance.

Databases Demystified Lesson 3: Row vs Column Store

In Michael Kaminsky's third episode, we learn about the differences in row store vs column store database. This is a very important concept for understanding the difference between analytical and transactional databases, and we talk about the tradeoffs between using row and column stores for saving the data. Michael gets into the weeds and talks about disk blocks and the different types of queries that work well for row and column stores.