Systems | Development | Analytics | API | Testing

Episode 11: The future of data lakes: Open table formats, metadata and AI | AWS

Paul Meighan, Director of Product Management at AWS, shares how enterprises are increasingly looking for ways to integrate more data sources in their environment — especially with data lakes. From turning S3 buckets into databases to establishing better metadata layers, Meighan explores the rapid evolution of data lakes alongside data warehouses. He also explains the pivotal role AI, ML and GenAI workloads and applications will play in large metadata environments, driving innovative analytics and business insights.

RAG Application with Kong AI Gateway, AWS Bedrock, Redis and LangChain

For the last couple of years, Retrieval-Augmented Generation (RAG) architectures have become a rising trend for AI-based applications. Generally speaking, RAG offers a solution to some of the limitations in traditional generative AI models, such as accuracy and hallucinations, allowing companies to create more contextually relevant AI applications.

Using Moesif with Middy and Serverless for AWS Apps

See the GitHub repository for the source code of this article’s example project. Serverless is a popular framework to build serverless apps using AWS Lambda on the Node.js runtime. Serverless automatically orchestrates necessary resources on AWS and can scaffold a basic project for you that you can build up on. You can solely focus on your application’s core logic, development, and your Lambda functions.

Optimize Your AWS Data Lake with Streamsets Data Pipelines and ChaosSearch

Many enterprises face significant challenges when it comes to building data pipelines in AWS, particularly around data ingestion. As data from diverse sources continues to grow exponentially, managing and processing it efficiently in AWS is critical. Without these capabilities, it’s harder to analyze and get any meaning from your data.

Unleashing the Power of Amazon Redshift Analytics

Table of Contents Amazon Redshift has become one of the most popular data warehousing solutions due to its scalability, speed, and cost-effectiveness. As the data landscape continues to evolve, businesses are generating and data processing increasingly large datasets. Efficient analysis of these datasets is essential to making informed, data-driven decisions. Amazon Redshift allows companies to extract meaningful insights from vast amounts of structured and semi-structured data.

The Cloud Exit: Cost, Security, and Performance Driving the Move Back to On-Premises

The last decade has seen a giant shift by organizations into the cloud for software, storage, and compute, resulting in business benefits ranging from flexibility and lower up-front costs to easier maintenance. But lately we have seen more and more companies re-evaluating their cloud strategies and opting to move their data back to on-premises infrastructure due to several key factors.

Katalon Delivers End-to-End Test Automation Solution on AWS Marketplace

“We build our solution on top of an AWS stack. It frees us up to think about creating innovative solutions and not worry about infrastructure”- Vu Lam, CEO of Katalon In order to help enterprise customers release faster and deliver on time, Katalon has built an integration where automation scripts can be stored in AWS CodeCommit and an AWS CodePipeline project can execute them using the integration.

How to source data from AWS DynamoDB to Confluent using the Open-Source Connector

This is a one-minute video showing an animated architectural diagram of an integration between Amazon DynamoDB and Confluent Cloud using an open-source Kafka connector. The integration allows you to avoid maintaining custom code, and gives you the ability to automatically discover and adapt to changes in DynamoDB tables. All details are provided.

Volumes: High IOPS and Low Latency NVMe SSDs Public Preview

It’s the final day of Koyeb's Launch Week Round 2, and today we’re announcing the public preview of Volumes! Volumes on Koyeb are blazing-fast NVMe SSD you can use to persist data across deployments. After announcing Volumes in technical preview a few months ago, we are now opening the preview to all users on the Starter plan! Offering high throughput and low latency, Volumes open the door to a wide range of new workloads and use cases to handle the state of your applications.