Remove what-is-an-apache-kafka-cluster
article thumbnail

Streaming Ingestion for Apache Iceberg With Cloudera Stream Processing

Cloudera

Recently, we announced enhanced multi-function analytics support in Cloudera Data Platform (CDP) with Apache Iceberg. The CSP engine is powered by Apache Flink, which is the best-in-class processing engine for stateful streaming pipelines. Iceberg is a high-performance open table format for huge analytic data sets.

Analytics 113
article thumbnail

Streams Replication Manager Prefixless Replication

Cloudera

Streams Replication Manager (SRM) is an enterprise-grade replication solution that enables fault tolerant, scalable, and robust cross-cluster Kafka topic replication. SRM replicates data at high performance and keeps topic properties in sync across clusters. ACL and configuration changes are not synced across mirrored clusters.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

A Closer Look at The Next Phase of Cloudera’s Hybrid Data Lakehouse

Cloudera

Cloudera is now the only provider to offer an open data lakehouse with Apache Iceberg for cloud and on-premises. Apache Ozone As AI and other advanced analytics continue to grow in scale, performance and scalable data storage will need to expand right along with them. But even with its rise, AI is still a struggle for some enterprises.

Data 87
article thumbnail

Fraud Detection With Cloudera Stream Processing Part 2: Real-Time Streaming Analytics

Cloudera

In part 1 of this blog we discussed how Cloudera DataFlow for the Public Cloud (CDF-PC), the universal data distribution service powered by Apache NiFi, can make it easy to acquire data from wherever it originates and move it efficiently to make it available to other applications in a streaming fashion.

article thumbnail

Let’s Flink on EKS: Data Lake Primer

OpenCredo

Here at OpenCredo we love projects that are based around Kafka and/or Data/Platform Engineering; in one of our recent projects, we created an open data lake using Kafka, Flink, Nessie and Iceberg. The first part of this blog is related to the Flink and S3 infra design. to now include a Kubernetes Operator.

Data 59
article thumbnail

Scaling Kafka Brokers in Cloudera Data Hub

Cloudera

This blog post will provide guidance to administrators currently using or interested in using Kafka nodes to maintain cluster changes as they scale up or down to balance performance and cloud costs in production deployments. Kafka brokers contained within host groups enable the administrators to more easily add and remove nodes.

Data 78
article thumbnail

Making API Requests With the Kafka REST Proxy

Instaclustr

Why Use the Kafka REST Proxy. Apache Kafka is known best as a powerful, open source message streaming and queueing solution. Used across more than 80% of the Fortune 500, Kafka is ubiquitous in supporting event-driven architectures. Read Now: Apache Kafka Architecture – The complete guide.