Systems | Development | Analytics | API | Testing

Best Serverless GPU Platforms for AI Apps and Inference in 2025

The performance of your AI applications depends on your underlying infrastructure. Whether leveraging high-performance GPUs, accelerators, or CPUs, AI workloads require high-performance hardware. With a range of different GPUs and accelerators available, choosing the best one for your specific workload is critical. On top of selecting the best GPU for your workload's needs, efficiently running AI workloads in production and at scale is a challenge.

Orbit Codes: Achieving 10x Faster Deployments and Global Scale with Koyeb

Orbit Codes is solving how blockchain data is indexed and analyzed within the COSMOS ecosystem. It extracts raw blockchain data, transforms it into a structured PostgreSQL database, and provides easy access through a GraphQL interface. For deeper insights, data is replicated to ClickHouse, enabling advanced analytics on transactions, fees, and trends. Designed for blockchains, not just users, Orbit Codes offers white-label solutions and custom branding to integrate seamlessly into existing ecosystems.

Best Open Source Multimodal Vision Models in 2025

AI models are not just about LLMs and generating text. Multimodal vision models—which understand and generate images, videos, and even audio alongside text—are enabling new AI applications. At their core, multimodal vision models combine: There are several different types of multimodal vision models: vision-language models (VLMs) that generate text based on images, vision-reasoning models that answer complex questions based on images, and more.

Partnering with Vultr for Serverless & Global AI Deployments

Today, we are thrilled to announce our partnership with the Vultr Cloud Alliance. As you may already know, we provide a serverless cloud for developers and businesses to seamlessly deploy AI apps, inference endpoints, and APIs globally across GPUs, CPUs, and accelerators. Vultr provides high-performance cloud infrastructure and offers composable infrastructure that lets you deploy any stack, anywhere, in seconds.

eToro Accelerates Deployments for Real-Time Apps with Koyeb

eToro is a trailblazing social investing platform that has reshaped the way individuals engage with the stock market. In 2022, eToro acquired Bullsheet, a startup specializing in portfolio management tools designed exclusively for eToro that enable users to analyze the diversification of their portfolio. Bullsheet recently migrated services from AWS to Koyeb for its seamless deployment experience on high-performance infrastructure.

Tenstorrent Cloud Instances: Unveiling Next-Gen AI Accelerators

Today, we’re thrilled to announce the world premiere availability of Tenstorrent Instances via the Koyeb Serverless Platform. You can now access the Wormhole multi-chip solution in minutes to bring up and test frontiers of model inference performance. You've probably heard us say this: we're committed to bringing alternative accelerators to market to foster innovation in the AI infrastructure space.

Globula: Autoscaling seamlessly to 10,000 players and beyond

Globula is a geolocation-based augmented reality game that merges real-world exploration with multiplayer role-playing and storytelling. A science-fiction adventure, Globula combines real world player experience, multi-player mobile role playing strategy, and elaborate storytelling to immerse players in a thrilling game experience.

Best Open Source LLMs in 2025

Open source LLMs continue to compete with proprietary models on performance benchmarks for natural language tasks like text generation, code completion, and reasoning. Despite having fewer resources than closed models, these open LLMs offer cutting-edge AI without the high costs and restrictions of proprietary models. However, running these open-source models in production and at scale remains a challenge.

Deploy AI Infrastructure in 2025: Serverless GPUs, Autoscaling, Scale to Zero, and More!

We’re on a mission to simplify application deployment for developers and businesses worldwide, whether they're AI-driven models, full stack applications, APIs, or databases. Our next-generation serverless platform significantly accelerates your deployments and improves efficiency, enabling you to build more with less spend. 2024 was a major year for us, packed with crucial serverless milestones.

Autoscaling, Serverless GPUs, Croissants, and More! The 2024 Recap

We’re on a mission to simplify application deployment for developers and businesses worldwide. Our next-generation serverless platform enables you to deploy and scale AI workloads, full-stack applications, APIs, and more in seconds — without any complexity. 2024 was filled with major milestones in this journey: Autoscaling, scale to zero, new regions, faster deployments, Volumes and Snapshots, and so much more.