Systems | Development | Analytics | API | Testing

Classifying DNA Sequences into Gene Families on SageMaker

The cost of DNA sequencing continues to decline exponentially. With the average cost of sequencing mammalian DNA hovering around $1,000 in the beginning of 2023, startups like Ultima Genomics and Illumina are working to decrease the cost to between $100-$200. That’s about the same as a new pair of Brooks running shoes! As the sequencing cost drops, the quantity of genetic data to study and analyze explodes, making it even more important to leverage machine learning techniques.

To Data Fabric or not to Data Fabric, is it really a question?

Data fabric is a term used to describe a set of technologies and practices that enable organizations to manage and access data across multiple platforms and environments. This includes supporting an organization’s need to break down data silos, gain more insight into metadata, optimize data sharing across apps and data platforms. Organizations are starting to explore more flexible ways of managing their data ecosystems and ensuring they can leverage data more effectively.

ThoughtSpot and Databricks make governed, self-service analytics a reality with new Unity Catalog integration

Two years ago, we announced our Databricks partnership—including the launch of ThoughtSpot for Databricks, which gives joint customers the ability to run ThoughtSpot search queries directly on the Databricks Lakehouse without the need to move any data. Since then, we’ve empowered teams at companies like Johnson & Johnson, NASDAQ, and Flyr to safely self-serve business-critical insights on governed and reliable data.

Enabling Strong Engineering Practices at Maersk

As DataOps moves along the maturity curve, many organizations are deciphering how to best balance the success of running critical jobs with optimized time and cost governance. Watch the fireside chat from Data Teams Summit where Mark Sear, Head of Data Platform Optimization for Maersk, shares how his team is driving towards enabling strong engineering practices, design tenets, and culture at one of the largest shipping and logistics companies in the world.

Maximize Business Results with FinOps

As organizations run more data applications and pipelines in the cloud, they look for ways to avoid the hidden costs of cloud adoption and migration. Teams seek to maximize business results through cost visibility, forecast accuracy, and financial predictability. Watch the breakout session video from Data Teams Summit and see how organizations apply agile and lean principles using the FinOps framework to boost efficiency, productivity, and innovation. Transcript available below.

An Overview of Streaming Analytics in AWS for Logging Applications

Streaming analytics in AWS gives enterprises the ability to process and analyze log data in real time, enabling use cases that range from delivering personalized customer experiences to anomaly and fraud detection, application troubleshooting, and user behavior analysis. In the past, real-time log analytics solutions could process just a few thousand records per second and it would still take minutes or hours to process the data and get answers.

Getting Up to Speed on Snowpark for Python with Educational Services

In today's livestream, Evan Troyka and Melanie Klein will introduce the 1-day Snowpark DataFrame Programming course on Snowflake. This 1-day course covers concepts, features, and programming constructs intended for practitioners building DataFrame data solutions in Snowflake.