Koyeb

Paris, France
2019
  |  By Yann Léger
Today, we’re excited to share that Serverless GPUs are available for all your AI inference needs directly through the Koyeb platform! We're starting with GPU Instances designed to support AI inference workloads including both heavy generative AI models and lighter computer vision models. These GPUs provide up to 48GB of vRAM, 733 TFLOPS and 900GB/s of memory bandwidth to support large models including LLMs and text-to-image models.
  |  By Julien Castets
Hey there! We're back for our second edition of Tips and Tricks. As we said in our first post on Drizzle ORM, our new Tips and Tricks mini blog series is going to share some helpful insights and cool tech that we've stumbled upon while working on technical stuff. Today, we're going to talk about the template databases of PostgreSQL. Remember, these posts will be super short reads. If you don’t like the topic of one of the posts, no problem! Just skip it and check out the next one.
  |  By Alisdair Broshar
To keep up with everything happening in the world of artificial intelligence, it helps to understand and grasp key terms and concepts behind the technology. In this introduction, we are going to dive into what is generative AI, looking at the technology and models they are built on. We'll discuss how these models are built, trained, and deployed into the world.
  |  By Julien Castets
Hey there! At Koyeb, we really like diving into technical stuff. But here’s the thing: not every cool thing we stumble upon or think about needs a massive blog post. And honestly, not everything we’re into is directly related to what Koyeb does or about infrastructure in general. So, we’ve got an idea: what if we start sharing these bits and pieces with you in a series of really short blog posts?
  |  By Alisdair Broshar
toddl.co is an all-in-one booking platform for kids' activities in Spain. Offering more than 2,000 classes, camps, and events to over 17,000 monthly visitors, toddl.co is on a mission to help parents navigate the complex world of extra-curricular activities and to help activity organizers manage and grow their businesses. For businesses, toddl.co streamlines bookings, payments, and client management. This year, toddl.co plans to serve over 15,000 businesses and 50,000 families.
  |  By Alisdair Broshar
Retrieval-augmented generation (RAG) is an AI framework and powerful approach in NLP (Natural Language Processing) where generative AI models are enhanced with external knowledge sources and retrieval-based mechanisms. These appended pieces of outside knowledge provide the model with accurate, up-to-date information that supplements the LLM’s existing internal representation of information. As the name suggests, RAG models have a retrieval component and a generation component.
  |  By Alisdair Broshar
Everything you need to deploy high-performance serverless apps is available in Singapore: Singapore is our first Asia-Pacific region in GA, joining Washington, D.C. and Frankfurt, Germany, EU in the GA club with all our services available in these 3 regions. The new Eco instances are now available in Singapore and allow you to start for only $1.61/month, billed per second.
  |  By Édouard Bonlieu
Today marks a monumental milestone: Autoscaling is now in public preview and available to all our users. Don't like to wake up in the middle of the night to scale up? Do you still have nightmares of the time you forgot to scale down your cloud infrastructure? Autoscaling is the answer: we adjust infrastructure to demand dynamically. We built our autoscaling feature to be: Autoscaling is powerful and raises some questions: It was the most requested feature on our feedback platform with new regions.
  |  By Alisdair Broshar
Today, e-commerce has become much of the world's preferred way to shop, thanks to its convenience and accessibility. With more shoppers looking at online stores, both small and large businesses alike need to establish their presence in the online marketplace. A key player in this transition is Shopify, a game-changing commerce platform that simplifies building, customizing, growing, and managing an online store.
  |  By Alisdair Broshar
In the face of Canada’s housing crisis, the University of British Columbia’s Housing Assessment Resource Tools (HART) set out to make government census data more accessible. By developing more intuitive tools and offering valuable resources, HART plays a key role in providing essential information for governments, housing developers, and the public to make informed, data-driven decisions.

Koyeb provides the fastest way to run web applications, APIs, and event-driven workloads across clouds with high performance and a developer-oriented experience. Koyeb dramatically reduces deployment time and operational complexity by removing server and infrastructure management for businesses and developers.

At Koyeb, we provide a unified experience to deploy, run and scale your applications globally with seamless support of Docker containers, native code, functions and provides:

  • An easy-to-use web interface to manage all your apps deployments
  • Support of all kinds of services including full web applications, APIs, event-driven serverless functions, background workers, and cron jobs.
  • Full support of Docker containers
  • Git-driven deployment to build and deploy native code in Ruby, Node.js, Java, Python, Clojure, Scala, Go, Rust, PHP, or with a Dockerfile present in the repository.
  • A High-Performance Edge Network with a global CDN and powerful load-balancing across zones with automatic traffic geo-steering
  • Full-Service Mesh and Discovery to deploy secure micro-services and functions in seconds
  • Transparent deployment in fast, secure MicroVMs
  • The Koyeb CLI (Command Line Interface) to manage resources and automate directly from your terminal
  • An easy-to-use REST API to use Koyeb programmatically

Koyeb provides the fastest way to deploy apps globally with a developer-friendly serverless platform. No ops, servers, or infrastructure management.