Systems | Development | Analytics | API | Testing

Databases

Databases Demystified Lesson 6: Distributed Databases Part 1

Welcome to episode 6 of Michael Kaminsky's Databases Demystified. In this lesson, we introduce a fascinating and incredibly important topic: distributed databases. We discuss "nodes" and "clusters" and we cover the two major paradigms in distributed databases: big-compute databases and high-availability databases.

Databases Demystified: Lesson 7: Distributed Databases Part 2

Episode 7 of Michael Kaminsky's Databases Demystified. Learn about new issues we face in distributed databases and all about the CAP theorem. We'll talk about leader and follower nodes, what happens when distributed databases lose connection with a node, and what CAP stands for: consistency, availability, and partition tolerance.

Databases Demystified Lesson 3: Row vs Column Store

In Michael Kaminsky's third episode, we learn about the differences in row store vs column store database. This is a very important concept for understanding the difference between analytical and transactional databases, and we talk about the tradeoffs between using row and column stores for saving the data. Michael gets into the weeds and talks about disk blocks and the different types of queries that work well for row and column stores.

Operational Database NoSQL and Related Capabilities

This blog post is part of a series on Cloudera’s Operational Database (OpDB) in CDP. Each post goes into more details about new features and capabilities. Start from the beginning of the series with, Operational Database in CDP. This blog post gives you an overview of the NoSQL, component integration, and object store support capabilities of OpDB.

Using Mnesia in an Elixir Application

In today’s post, we’ll learn about Mnesia, see when you would use such a tool, and take a look at some of the pros and cons of using it. After covering the fundamentals of Mnesia, we’ll dive right into a sample application where we’ll build an Elixir application that uses Mnesia as its database. Let’s jump right in!

PostgreSQL vs MySQL: Which one should you choose?

Bonus Material: PostgreSQL vs MySQL complete comparison table ‍ PostgreSQL (or Postgres) and MySQL are both relational database management systems (RDBMS for short). They are complex technological inventions designed to simplify your data operations across a wide variety of business use cases. The “relational” part of the name refers to the way in which they structure data as relations between rows and columns.

Happy Birthday Apache HBase! 10 years of resilience, stability, and performance

Apache HBase became a top-level project with Apache 10 years ago and Cloudera began contributing to it at the same time (2010). Over this time, it has become one of the largest and most popular open-source tools in big data and one of the most popular NoSQL databases.