Retrieving Twitter data in real-time

Using a Talend route and aTalend job, Richard demonstrates how you can retrieve Twitter data in real-time. This week, Richard is using Talend to collect near real-time Twitter data around the Australian Grand Prix. Build and use a Talend route and a Talend job to do the same. This job would work for just about anything that is happening now. A concert, an event, a game, etc. Forget hashtags – collect near real-time twitter data around any current event.

Advantages of Data Warehouse Integration | Integrate.io

Our main points: For any business, there are multiple advantages of data warehouse integration. For instance, it’s the primary source of Business Intelligence (BI). Some of the most common reasons why an organization chooses to warehouse data include: Integrate.io offers the best ETL, ELT, and blazing fast change data capture (CDC) so your Ecommerce API integrations work for you. Email us at hello@integrate.io to learn more.

Announcing new BigQuery capabilities to help secure sensitive data

In order to better serve their customers and users, digital applications and platforms continue to store and use sensitive data such as Personally Identifiable Information (PII), genetic and biometric information, and credit card information. Many organizations that provide data for analytics use cases face evolving regulatory and privacy mandates, ongoing risks from data breaches and data leakage, and a growing need to control data access.

Introducing Firehose: An open source tool from Gojek for seamless data ingestion to BigQuery and Cloud Storage

Indonesia’s largest hyperlocal company, Gojek has evolved from a motorcycle ride-hailing service into an on-demand mobile platform, providing a range of services that include transportation, logistics, food delivery, and payments. A total of 2 million driver-partners collectively cover an average distance of 16.5 million kilometers each day, making Gojek Indonesia’s de-facto transportation partner.

Machine Learning Experiment Tracking from Zero to Hero in 2 Lines of Code

In your machine learning projects, have you ever wondered “why is model Y is performing better than Z, which dataset was model Y trained on, what are the training parameters I used for model Y, and what are the model performance metrics I used to select model Y?” Does this sound familiar to you? Have you wondered if there is a simple way to answer the questions above? Data science experiments can get complex, which is why you need a system to simplify tracking.

5 Highlights from Snowflake Summit 2022

It’s that time of year again. Conference season is upon us! And, for the first time in what feels like a lifetime, the data ecosystem is getting back together in person. It couldn’t come at a more important time. The decade of data is upon us, as we unveiled at our own customer conference Beyond 2022. The opportunity is greater than ever before. So, too, is the need to change.

Making the World's AWS Bills Less Daunting

Armed with a Ph.D. from UC San Diego, our guest started off with internships at Google and Microsoft before gaining valuable experience as a VP and a highly sought-after consultant for startups and SMBs. Now he’s one of the world’s foremost experts on wrangling vast data sets and maximizing efficiency.