Systems | Development | Analytics | API | Testing

Latest Posts

ELK with Talend cloud

ELK is the acronym for three open source projects where E stands for Elasticsearch, L stands for Logstash and K stands for Kibana. ELK is a robust solution for log management and data analysis. In this blog, I am going to show you how to configure ELK while working with Talend Cloud. The blog will focus on Loading Streaming Data into Amazon ES from Amazon S3.

5 best practices to deliver trust in your data project: Tip #4 Empower organizations with modern tools

Traditional tools for managing data integrity, such as data quality, governance and stewardship tools, were targeted at the most skilled data experts. With the advent of social networks, machine learning and smart pattern recognition technologies, these tools are getting simpler at every release. They now allow anyone with market or customer knowledge to contribute and collaborate in a data governance effort.

5 best practices to deliver trust in your data project: Tip #3 Take ownership around a single source of trusted data

Would you imagine e-commerce without an electronic catalog or the web without search engines? Digital transformation requires single points of access to enable a wider range of people to access a wider range of information.

Data Privacy through shuffling and masking - Part 2

In the first part of this blog two-part series, we took a deep dive on Data Shuffling techniques aiming to mix up data and allowing to optionally retain logical relationships between columns. In this second part, we will now focus on Data Masking techniques as one of the main approach to guarantee Data Privacy.

5 best practices to deliver trust in your data project : Tip #2 control your data wherever

Whenever an IT system, application or personal productivity tool is used inside an organization without explicit organizational approval, we talk about shadow IT. Shadow IT is not only a security and compliance nightmare, it creates a data sprawl where each group can create its data silos.

Generating a Heat Map with Twitter data using Pipeline Designer - Part 3

If you have got through part 1 and part 2 of this series of blogs, there are only a few more steps to carry out before you can see the end to end flow of data and create your Heatmap. If you have not read the first two blogs, the links to the blogs are above. Although these blogs have been quite lengthy, I hope you understand that I have tried to make sure that any level of experience can achieve this. Since Pipeline Designer is a new product, I felt that it made sense to be as explicit as possible.

The Privacy Hazard in High Tech Heritage

DNA kits like 23andMe, Helix and AncestryDNA topped holiday gift guides again this past year. Kits range in the market from $60 to $200, and they’re meant to help consumers understand family history, genealogy and can even connect unknown family members. Collecting genetic data can also have broader impacts in healthcare and justice for law enforcement.