Systems | Development | Analytics | API | Testing

Deploying Gen AI in Production with NVIDIA NIM & MLRun

In less than three years, gen AI has become a staple technology in the business world. In November of 2022, OpenAI launched ChatGPT, with explosive growth of over 1 million users in just five days, galvanizing the widespread use of gen AI. Over the course of 2023 enterprises entered the experimentation stage and kicked off POCs with API services and open models including Llama 2, Mistral, NVIDIA and others.

MLRun v1.8 Now Available: Smarter Model Monitoring, Alerts and Tracking

We’re proud to announce that the next version of MLRun has been released to community users. On the heels of MLRun v1.7’s focus on monitoring, MLRun v1.8 adds features to make LLM and ML evaluation and monitoring more accessible, practical and resource-efficient. New Highlights: MLRun is an open-source AI orchestration tool that provides AI practitioners with capabilities to accelerate and streamline the development, deployment and management of gen AI and ML applications.

The Future of AI Monitoring: How to Address a Non-Negotiable, Yet Still Developing, Requirement

Generative AI models are not just tools for producing text, audio or video—they're systems that learn patterns, improvise, and generate unexpected outcomes. When we look at LLMs, we're struck by their capacity to generate surprisingly creative and context-aware results. They can weave coherent narratives, propose novel solutions, mimic human conversation, and even offer nuanced insights across a wide range of topics. While this creativity is their strength, it also introduces variability and risk.

Building Agent Co-pilots for Proactive Call Centers

Gen AI call center co-pilots can provide enterprises with operational visibility and insights while automating repetitive tasks, to improve the customer experience. In this session, we’ll show how a large health insurance provider implemented an agentic co-pilot designed scale across multiple call centers and environments. To dive deep into the architecture and see a demo of the co-pilot, you can watch the webinar this blog is based on.

What's New in ClearML v3.25: Vector Database support, Smarter Orchestration, and UI Enhancements

ClearML v3.25 introduces native support for vector databases within the Hyper-Datasets feature. This release enables users to store and search embeddings directly inside ClearML, opening the door to powerful custom RAG pipelines. In addition, v3.25 includes expanded orchestration metrics, new Application Gateway UI, and a range of UI upgrades to streamline day-to-day operations.

How to Achieve Secure, Scalable Multi-tenancy for GPU Infrastructure

By Erez Schnaider, Technical Product Marketing Manager, ClearML In a previous blog post, we laid the foundations for understanding multi-tenancy in GPU-access infrastructure and highlighted its critical importance. In this post, we’ll dive into ClearML’s approach to achieving secure and efficient multi-tenancy. At a high level, multi-tenancy refers to the ability to share a single resource pool, typically GPU or CPU clusters, across multiple, logically isolated entities known as tenants.

Best 10 Free Datasets for Manufacturing [UPDATED]

The manufacturing industry can benefit from AI, data and machine learning to advance manufacturing quality and productivity, minimize waste and reduce costs. With ML, manufacturers can modernize their businesses through use cases like forecasting demand, optimizing scheduling, preventing malfunctioning and managing quality. These all significantly contribute to bottom line improvement.