We collect the latest Development, Anaytics, API & Testing news from around the globe and deliver it direct to your inbox. One email per week, no spam.
By Erez Schnaider, Technical Product Marketing Manager, ClearML In a previous blog post, we demonstrated how easy it is to leverage Arm Neoverse-based Graviton instances on AWS to run training workloads. In this post, we’ll explore how ClearML simplifies the management and deployment of LLM inference using llama.cpp on Arm-based instances and helps deliver up to 4x performance compared to x86 alternatives on AWS. (Want to run llama.cpp directly?
Generative AI has rapidly transformed industries by enabling advanced automation, personalized experiences and groundbreaking innovations. However, implementing these powerful tools requires a production-first approach. This will maximize business value while mitigating risks. This guide outlines six best practices to ensure your generative AI initiatives are effective: valuable, scalable, compliant and future-proof.
For years, Cloudera’s platform has helped the world’s most innovative organizations turn data into action. As the AI landscape evolves from experiments into strategic, enterprise-wide initiatives, it’s clear that our naming should reflect that shift. That’s why we’re moving from Cloudera Machine Learning to Cloudera AI.
In a significant stride toward transforming AI infrastructure, ClearML has recently announced a collaboration with AMD. By integrating with AMD’s powerful hardware and open-source ROCm software with ClearML’s silicon-agnostic, end-to-end platform, we’re empowering IT teams and AI builders to innovate with ease across diverse infrastructures and integrate GPUs from multiple vendors.
In 2024, organizations realized the revolutionizing business potential of gen AI. They accelerated their gen AI operationalization processes: explored new use cases to implement, researched LLMs and AI pipelines and contemplated underlying ethical issues. And with the seeds of the AI revolution now planted, the market is maturing accordingly.
In this demo, we showcase how to leverage MLRun, Iguazio, and NVIDIA NIMs to deploy and monitor a generative AI model at scale, focusing on reducing risks and ensuring seamless performance. Using NVIDIA's NIMs, the demo demonstrates advanced methods in model monitoring, logging, and continuous fine-tuning.
LLMs are the foundation of gen AI applications. To effectively operationalize and de-risk LLMs and ensure they bring business value, organizations need to consider not just the model itself, but the supporting infrastructure, including GPUs and operational frameworks. By optimizing them to your use case, you can ensure you are using an LLM that is the right fit to your needs.
We’re thrilled to announce the release of a new Cloudera Accelerator for Machine Learning (ML) Projects (AMP): “Summarization with Gemini from Vertex AI”. An AMP is a pre-built, high-quality minimal viable product (MVP) for Artificial Intelligence (AI) use cases that can be deployed in a single-click from Cloudera AI (CAI). AMPs are all about helping you quickly build performant AI applications. More on AMPs can be found here.
Sorry for the click-bait title, but everyone is talking about AI Agents, and for a good reason. With the proliferation of LLMs, everyone – from software engineers using LLMs as a coding copilot to people using AI to plan vacations – is looking for new ways to use the technology that isn’t just answering questions or searching knowledgebases.