%term

LLM Evaluation and Testing for Reliable AI Apps

Aug 5, 2025 By Alexandra Quinn In Iguazio

As LLMs become central to AI-driven products like copilots and customer support chatbots, data science teams need to ensure the LLM performs well for the use case. The process of LLM evaluation ensures reliability, safety and performance in production AI systems. In this guide, we explore how to approach evaluations across development and production lifecycles, what frameworks to use, and how the integration between open-source MLRun and Evidently AI enables more scalable, structured testing.

Read Post

Iguazio

Read more about LLM Evaluation and Testing for Reliable AI Apps

Thousands of Real-Time Models in Production

Jul 29, 2025 By Iguazio (Acquired by McKinsey) In Iguazio

Hear from Kaegan Casey, AI/ML Solutions Architect at Seagate, discuss how his team uses MLRun to train thousands of models in parallel.

View Video

Iguazio

Read more about Thousands of Real-Time Models in Production

Orchestrating Multi-Agent Workflows with MCP & A2A - MLOps Live #39 with Google Cloud

Jul 16, 2025 By Iguazio (Acquired by McKinsey) In Iguazio

In this webinar we explored cutting-edge tools enabling scalable AI workflows. Discover how MCP (Model Context Protocol) and A2A (Agent-to-Agent communication layer) empower teams to design, build, and manage multi-agent workflows with precision. Key Takeaways.

View Video

Iguazio

Read more about Orchestrating Multi-Agent Workflows with MCP & A2A - MLOps Live #39 with Google Cloud

13 Best Free Datasets for Call Centers and Telcos

Jul 1, 2025 By Alexandra Quinn In Iguazio

Customer service chatbots and co-pilots and smart call center analysis applications are prime use cases for AI and generative AI. These AI systems and agents can provide real-time recommendations, support customer service at scale, generate insights that can be used in downstream applications to reduce churn and increase revenue, and more. How can customer service organizations grow and optimize their use of data and AI?

Read Post

Iguazio

Read more about 13 Best Free Datasets for Call Centers and Telcos

Preventing Data Leakage in Gen AI Chatbots: What's Your Risk Appetite?

Jun 22, 2025 By Guy Lecker In Iguazio

Chatbots are quickly becoming more sophisticated and integrated into business workflows, enhancing productivity and scalability. However, they also expand the attack surface for organizations. This new exploitation vector requires data engineers and security teams to incorporate various security guardrails when building their gen AI architecture. In this blog post, we discuss the risk of data leakage through AI chatbots.

Read Post

Iguazio

Read more about Preventing Data Leakage in Gen AI Chatbots: What's Your Risk Appetite?

Build Observable Data Flywheels for Production with Iguazio's MLRun and NVIDIA NeMo Microservices

Jun 11, 2025 By Guy Lecker and Yonatan Shelach In Iguazio

We are proud to announce a new integration between MLRun, the open-source AI orchestration framework, and NVIDIA NeMo microservices, by extending NVIDIA Data Flywheel Blueprint. This integration streamlines training, evaluation, fine-tuning and monitoring of AI models at scale, ensuring high-performance, low latency and lowering costs while significantly reducing the manual effort required through intelligent automation.

Read Post

Iguazio

Read more about Build Observable Data Flywheels for Production with Iguazio's MLRun and NVIDIA NeMo Microservices

Deploying Gen AI in Production with NVIDIA NIM & MLRun

Jun 9, 2025 By Guy Lecker In Iguazio

In less than three years, gen AI has become a staple technology in the business world. In November of 2022, OpenAI launched ChatGPT, with explosive growth of over 1 million users in just five days, galvanizing the widespread use of gen AI. Over the course of 2023 enterprises entered the experimentation stage and kicked off POCs with API services and open models including Llama 2, Mistral, NVIDIA and others.

Read Post

Iguazio

Read more about Deploying Gen AI in Production with NVIDIA NIM & MLRun

MLRun v1.8 Now Available: Smarter Model Monitoring, Alerts and Tracking

Jun 5, 2025 By Gilad Shaham In Iguazio

We’re proud to announce that the next version of MLRun has been released to community users. On the heels of MLRun v1.7’s focus on monitoring, MLRun v1.8 adds features to make LLM and ML evaluation and monitoring more accessible, practical and resource-efficient. New Highlights: MLRun is an open-source AI orchestration tool that provides AI practitioners with capabilities to accelerate and streamline the development, deployment and management of gen AI and ML applications.

Read Post

Iguazio

Read more about MLRun v1.8 Now Available: Smarter Model Monitoring, Alerts and Tracking

The Future of AI Monitoring: How to Address a Non-Negotiable, Yet Still Developing, Requirement

Jun 4, 2025 By Gilad Shaham In Iguazio

Generative AI models are not just tools for producing text, audio or video—they're systems that learn patterns, improvise, and generate unexpected outcomes. When we look at LLMs, we're struck by their capacity to generate surprisingly creative and context-aware results. They can weave coherent narratives, propose novel solutions, mimic human conversation, and even offer nuanced insights across a wide range of topics. While this creativity is their strength, it also introduces variability and risk.

Read Post