Systems | Development | Analytics | API | Testing

Operating Apache Kafka with Cruise Control

There are two big gaps in the Apache Kafka project when we think of operating a cluster. The first is monitoring the cluster efficiently and the second is managing failures and changes in the cluster. There are no solutions for these inside the Kafka project but there are many good 3rd party tools for both problems. Cruise Control is one of the earliest open source tools to provide a solution for the failure management problem but lately for the monitoring problem as well.

Cloudera and NVIDIA Help IRS Fight Fraud, Safeguard Taxpayers

Across the federal government, agencies are struggling to identify, organize, analyze, and act on troves of data. It’s a problem that leaders are working actively to tackle, but they’re in a race against immeasurable volumes of data that is continuously being generated in perpetuity in stores known and unknown. At the Internal Revenue Service, decades’ worth of data exceeds even the most cutting-edge processing capabilities.

Enabling Multi-User Fine-Grained Access Control for Cloud Storage in CDP

Shared Data Experience (SDX) on Cloudera Data Platform (CDP) enables centralized data access control and audit for workloads in the Enterprise Data Cloud. The public cloud (CDP-PC) editions default to using cloud storage (S3 for AWS, ADLS-gen2 for Azure). This introduces new challenges around managing data access across teams and individual users. To solve these challenges for S3 and ADLS-gen2, Cloudera has introduced a new service — the Ranger Authorization Service (RAZ).

Use Cases for Reverse ETL

According to Gartner, leading organizations in every industry are wielding data and analytics as competitive weapons. Companies that leverage data as a competitive differentiator will stand the best chance of acting faster on opportunities and responding to threats in a competitive marketplace. The problem is that most companies aren’t aware of the value of their data. As a result, they aren’t leveraging the full potential of their data to make informed decisions.

The 7 Critical Differences Between DynamoDB vs MongoDB:

MongoDB vs DynamoDB: How do you choose between them? Whether you are a two-man team bootstrapping a proof of concept or an established one battling with high throughput and heavy load; this post can serve as a guidepost in your decision process. Before going into the details, a brief history lesson on how these technologies emerged is pertinent; you must understand the optimal conditions for running these systems and how they operate in the wild before making an informed choice.

The Pros and Cons of Application Software Integration

Gartner predicts that by 2023, organizations that promote data sharing will outperform their peers on most business value metrics. According to Debra Logan, Gartner’s Research Vice President, “Data sharing is the way to optimize higher quality data and more robust data and analytics to solve business challenges and goals.” Given these numbers, it is clear that businesses will need to embrace application software integration as a core business strategy.

Client Reporting 101: Tips and Best Practices for Agencies and Freelancers

Communication‌ ‌is‌ ‌the‌ ‌key‌ ‌factor‌ ‌for‌ ‌a‌ ‌good‌ ‌relationship‌ ‌with‌ ‌your‌ ‌clients.‌ ‌ Here‌ ‌are‌ ‌the‌ ‌best‌ ‌client‌ ‌reporting‌ ‌practices‌ ‌to‌ ‌help‌ ‌you‌ ‌showcase‌ ‌the‌ results‌ ‌you‌ &#