Machine Learning

Unlocking the value of unstructured data at scale using BigQuery ML and object tables

Oct 20, 2022 By Candice Chen In Google BigQuery

Most commonly, data teams have worked with structured data. Unstructured data, which includes images, documents, and videos, will account for up to 80 percent of data by 2025. However, organizations currently use only a small percentage of this data to derive useful insights. One of main ways to extract value from unstructured data is by applying ML to the data.

Read Post

Google BigQuery

Read more about Unlocking the value of unstructured data at scale using BigQuery ML and object tables

How to Get Started with ClearML's Hyper-Datasets

Oct 13, 2022 By Victor Sonck In ClearML

In this blog post, we’ll be taking a closer look at Hyper-Datasets, which are essentially a supercharged version of Clear-ML Data.

Read Post

ClearML

Read more about How to Get Started with ClearML's Hyper-Datasets

How to Run Workloads on Spark Operator with Dynamic Allocation Using MLRun

Oct 11, 2022 By Xingsheng Qian In Iguazio

With the Apache Spark 3.1 release in early 2021, the Spark on Kubernetes project has been production-ready for a few years. Spark on Kubernetes has become the new standard for deploying Spark. In the Iguazio MLOps platform, we built the Spark Operator into the platform to make the deployment of Spark Operator much simpler.

Read Post

Iguazio

Read more about How to Run Workloads on Spark Operator with Dynamic Allocation Using MLRun

MLRun Tutorial: How to Train, Compare, and Register Models

Oct 11, 2022 By Iguazio In Iguazio

View Video

Iguazio

Read more about MLRun Tutorial: How to Train, Compare, and Register Models

MLRun Tutorial: Serving Pre-Trained ML and DL Models

Oct 11, 2022 By Iguazio In Iguazio

View Video

Iguazio

Read more about MLRun Tutorial: Serving Pre-Trained ML and DL Models

How to Accelerate HuggingFace Throughput by 193%

Oct 8, 2022 By ClearML In ClearML

Deploying models is becoming easier every day, especially thanks to excellent tutorials like Transformers-Deploy. It talks about how to convert and optimize a Huggingface model and deploy it on the Nvidia Triton inference engine. Nvidia Triton is an exceptionally fast and solid tool and should be very high on the list when searching for ways to deploy a model. Our developers know this, of course, so ClearML Serving uses Nvidia Triton on the backend if a model needs GPU acceleration.

Read Post

ClearML

Read more about How to Accelerate HuggingFace Throughput by 193%

Is AI/ML Transforming the Banking Industry

Oct 7, 2022 By Cigniti Technologies In Cigniti

Artificial Intelligence (AI) is quite powerful and is constantly evolving and currently knows no bounds. It is focused on outperforming its limits using the power of Machine Learning (ML). AI is empowering computers to do things that human beings are unable to do efficiently and effectively and machine learning is aiding the computers to do so by breaking the rules of traditional programming.

Read Post

Cigniti

Read more about Is AI/ML Transforming the Banking Industry

AI Infrastructure Alliance: Transforming Snowflake into an MLOps 'Feature Factory' using Iguazio

Oct 6, 2022 By Iguazio In Iguazio

A demo showing how to use our feature store in conjunction with Snowflake. Focusing on.

View Video

Iguazio

Read more about AI Infrastructure Alliance: Transforming Snowflake into an MLOps 'Feature Factory' using Iguazio

How to Do Data Labeling, Versioning, and Management for ML

Oct 3, 2022 By ClearML In ClearML

It has been months ago when Toloka and ClearML met together to create this joint project. Our goal was to showcase to other ML practitioners how to first gather data and then version and manage data before it is fed to an ML model. We believe that following those best practices will help others build better and more robust AI solutions. If you are curious, have a look at the project we have created together.

Read Post

ClearML

Read more about How to Do Data Labeling, Versioning, and Management for ML

How to Distribute Machine Learning Workloads with Dask

Oct 3, 2022 By Jacob Bengtson In Cloudera

Tell us if this sounds familiar. You’ve found an awesome data set that you think will allow you to train a machine learning (ML) model that will accomplish the project goals; the only problem is the data is too big to fit in the compute environment that you’re using. In the day and age of “big data,” most might think this issue is trivial, but like anything in the world of data science things are hardly ever as straightforward as they seem.

Read Post