Systems | Development | Analytics | API | Testing

Technology

Production ML Capabilities Now Available In CDSW 1.8

With only about 35% of machine learning models making into production in the enterprise (IDC), it’s no wonder that production machine learning has become one of the most important focus areas for data scientists and ML engineers alike. As you may remember, we recently announced a full set of MLOps capabilities in Cloudera Machine Learning, our cloud native machine learning tool for the cloud.

Archive data from to S3 with the new Kafka Connect connector

The new open-source #ApacheKafka Connect sink connector for #S3 gives you full control on how to sink data to S3 and save money on long term storage costs in #Kafka. The connector has the ability to flush data out in a number of different formats including #AVRO, #JSON, #Parquet and #Binary as well as ability to create S3 buckets based on partitions, metadata fields and value fields.

Stop Using Kubernetes for ML-Ops; Instead use Kubernetes

If your company has already started getting into machine learning / deep learning, you will quickly relate to the following story. If your company is taking its first steps into data-science, here is what is about to be dropped on you. If none of the above strikes a chord, well it’s probably good to know what’s out there because data-science is all the rage now, and it won’t be long until it gets you too 🙂

Introducing The Open Source Program Office

Recently, we created the Sauce Labs Open Program Office to focus our attention internally on how we support and contribute to the open source community. Last week, we proudly launched a new web site with comprehensive information about the office, including best practices, contribution guidelines for the Sauce team, and a new blog where Diego Molina and Christian Bromann will write regularly about all things open source. This article is cross-posted from the new blog.

The best of Kafka Summit 2020

After a self-isolated and event-free spring, some of us around the world welcomed a more promising summer. You might be taking some time away on a socially distanced holiday. You might be taking some time away from the day-to-day at home. But if a cold beer in the sun isn't enough to make up for these difficult months, the premier event for the Streaming Data Community is back! Kafka Summit has gone virtual this year and that means you can attend the event from anywhere.