Remove applying-fine-grained-security-to-apache-spark
article thumbnail

Applying Fine Grained Security to Apache Spark

Cloudera

Fine grained access control (FGAC) with Spark. The challenges of arbitrary code execution notwithstanding, there have been attempts to provide a stronger security model but with mixed results. One approach is to use 3rd party tools (such as Privacera ) that integrate with Spark.

article thumbnail

How to Use Apache Iceberg in CDP’s Open Lakehouse

Cloudera

In June 2022, Cloudera announced the general availability of Apache Iceberg in the Cloudera Data Platform (CDP). Iceberg is a 100% open-table format, developed through the Apache Software Foundation , which helps users avoid vendor lock-in and implement an open lakehouse. .

How To 88
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Fine-Grained Authorization with Apache Kudu and Apache Ranger

Cloudera

When Kudu was first introduced as a part of CDH in 2017, it didn’t support any kind of authorization so only air-gapped and non-secure use cases were satisfied. Coarse-grained authorization was added along with authentication in CDH 5.11 (Kudu 1.3.0) You’ll need to name the policy and set the resource it will apply to.

article thumbnail

Upgrade Journey: The Path from CDH to CDP Private Cloud

Cloudera

Cloudera delivers an enterprise data cloud that enables companies to build end-to-end data pipelines for hybrid cloud, spanning edge devices to public or private cloud, with integrated security and governance underpinning it to protect customers data. Attribute-based access control and SparkSQL fine-grained access control.

Cloud 130
article thumbnail

Value Proposition of the Cloudera Operational Database over Legacy Apache HBase Deployments

Cloudera

The CDP Operational Database ( COD ) builds on the foundation of existing operational database capabilities that were available with Apache HBase and/or Apache Phoenix in legacy CDH and HDP deployments. Quantifiable performance improvements of Apache Hbase 2.2.x Quantifiable performance improvements of Apache Hbase 2.2.x

Storage 93
article thumbnail

Introducing Apache Iceberg in Cloudera Data Platform

Cloudera

Over the past decade, the successful deployment of large scale data platforms at our customers has acted as a big data flywheel driving demand to bring in even more data, apply more sophisticated analytics, and on-board many new data practitioners from business analysts to data scientists. Key Design Goals .

Data 105
article thumbnail

Five Strategies to Accelerate Data Product Development

Cloudera

Deep Java Learning, Apache Spark 3.x, Lambda or Kappa architectures) and implementing reliable streaming capabilities at scale by leveraging technologies such as Apache NiFi and Apache Kafka, has made possible the ability to harness and commercialize an ever-increasing volume of real-time data such as time-series or clickstream data.

Strategy 114