Systems | Development | Analytics | API | Testing



Distributed tracing with Envoy, Kuma, Grafana Agent, and Jaeger

As a cloud service provider, observability is a critical subject as it's strongly related to the availability of the services running on the platform. We need to understand everything that is happening on our platform to troubleshoot errors as fast as possible and improve performance issues. A year ago, while the platform was still in private beta, we faced a tough reliability issue: users were facing random 500 errors when accessing their applications.


Stop Using TCP Health Checks for Kubernetes Applications

As developers, one of the most important things we can consider when designing and building applications is the ability to know if our application is running in an ideal operating condition, or said another way: the ability to know whether or not your application is healthy. This is particularly important when deploying your application to Kubernetes. Kubernetes has the concept of container probes that, when used, can help ensure the health and availability of your application.

Sponsored Post

Datadog & Speedscale: Improve Kubernetes App Performance

By combining traffic replay capabilities from Speedscale with observability from Datadog, SRE Teams can deploy with confidence. It makes sense to centralize your monitoring data into as few silos as possible. With this integration, Speedscale will push the results of various traffic replay conditions into Datadog so it can be combined with the other observability data. Being able to preview application performance by simulating production conditions allows better release decisions. Moreover, a baseline to compare production metrics can provide even earlier signals on degradation and scale problems. Speedscale joined the Datadog Marketplace so customers can shift-left the discovery of performance issues.


Establish a Code Ownership Loop with Collaborator and Bugsnag

This blog is derived from the webinar, “Accelerate Releases Through Code Ownership with Collaborator and Bugsnag”, focused on establishing a culture of code ownership and its benefits through the lens of the SmartBear tools Collaborator and Bugsnag. Taking a line from the SmartBear 2021 Annual State of Software Quality Report: "Quality is top of the mind for every individual and every team.


Process Monitoring in N|Solid [2/10] The best APM for Node, layer by layer.

When we are executing an application with a significant number of processes, we cannot afford to stop the operation to review what is happening outside of production, for this reason, a tool that allows us to have greater observability and a level of detail is key in the management of our project.


Project & Applications Monitoring in N|Solid [1/10] The best APM for Node, layer by layer.

Imagine you are responsible for the health of your node applications and you have N|Solid in place, you would regularly check the N|Solid Console to review your Projects and Applications to quickly view how well things are running.

App Performance Monitoring with Rasmus Larsson | Mobile DevOps is a Thing! Podcast

Rasmus is the Product Strategy Director at Pulselive, a Sony company that has been crafting digital experiences to connect teams, leagues, and sports federations to their fans for over a decade. The mobile team’s main motivation is to continuously build the best possible sports apps with the best UI, the best UX, built on the best code. They are doing all these things for an impressive list of clients, including AFL, Premier League, or Liverpool FC.