Remove easy-ways-generate-test-data-kafka
article thumbnail

Fraud Detection with Cloudera Stream Processing Part 1

Cloudera

In a previous blog of this series, Turning Streams Into Data Products , we talked about the increased need for reducing the latency between data generation/ingestion and producing analytical results and insights from this data. This blog will be published in two parts.

article thumbnail

Cloudera DataFlow Designer: The Key to Agile Data Pipeline Development

Cloudera

We just announced the general availability of Cloudera DataFlow Designer , bringing self-service data flow development to all CDP Public Cloud customers. In our previous DataFlow Designer blog post , we introduced you to the new user interface and highlighted its key capabilities.

Agile 81
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Generating and Viewing Lineage through Apache Ozone

Cloudera

Follow your data in object storage on-premises. With Apache Ozone on the Cloudera Data Platform (CDP) , they can implement a scale-out model and build out their next generation storage architecture without sacrificing security, governance and lineage. Ozone stores data as objects which live inside these buckets.

Storage 103
article thumbnail

Scaling Kafka Brokers in Cloudera Data Hub

Cloudera

This blog post will provide guidance to administrators currently using or interested in using Kafka nodes to maintain cluster changes as they scale up or down to balance performance and cloud costs in production deployments. Kafka brokers contained within host groups enable the administrators to more easily add and remove nodes.

Data 77
article thumbnail

Machine Learning with Python, Jupyter, KSQL and TensorFlow

Confluent

Building a scalable, reliable and performant machine learning (ML) infrastructure is not easy. It allows real-time data ingestion, processing, model deployment and monitoring in a reliable and scalable way. It allows real-time data ingestion, processing, model deployment and monitoring in a reliable and scalable way.

article thumbnail

How to Connect KSQL to Confluent Cloud using Kubernetes with Helm

Confluent

Confluent Cloud, a fully managed event cloud-native streaming service that extends the value of Apache Kafka ® , is simple, resilient, secure, and performant, allowing you to focus on what is important—building contextual event-driven applications, not infrastructure. KSQL and Kafka Connect example. and Helm/Tiller 2.8.2+

Cloud 93
article thumbnail

How 3 Companies Implemented Distributed Tracing for Better Insight into Their Systems

Honeycomb

Through instrumenting your code, users can parse large amounts of high-cardinality data that can then be further parsed and analyzed at extremely granular levels. Its rate for flaky tests, which return both a passing and failing result despite no changes in the code, climbed as high as 50%. Enthusiasm increased.

System 57