Systems | Development | Analytics | API | Testing

Orchestrating Multi-Agent Workflows with MCP & A2A

Multi-agent workflows are the latest technological gen AI advancements. In this blog, we explore how to develop such systems, overcome operational challenges, improve system observability, and enable seamless collaboration between agents in complex AI pipelines. We’ll cover architecture, A2A and MCP protocols and introduce Google Cloud’s agentic marketplace.

LLM Evaluation and Testing for Reliable AI Apps

As LLMs become central to AI-driven products like copilots and customer support chatbots, data science teams need to ensure the LLM performs well for the use case. The process of LLM evaluation ensures reliability, safety and performance in production AI systems. In this guide, we explore how to approach evaluations across development and production lifecycles, what frameworks to use, and how the integration between open-source MLRun and Evidently AI enables more scalable, structured testing.