I’m curious about why integration between OPA and Spark is not supported. #54

wooarchi · 2024-08-26T05:59:20Z

wooarchi
Aug 26, 2024

want to use OPA to control access to specific paths in the data lake that Spark accesses. Is this approach not suitable?

razvan · 2024-08-26T08:04:12Z

razvan
Aug 26, 2024
Maintainer

Spark, being mostly a compute engine, is only one from the many ways to access the data your data lake. Therefore a better approach is to implement your data access policies in the storage engine.

For example, you can use Spark to access data in a kerberized Hadoop cluster and implement OPA policies for HDFS paths.

1 reply

wooarchi Aug 26, 2024
Author

Thank you for your kind response. Thanks to you, I fully understand.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stackable

I’m curious about why integration between OPA and Spark is not supported. #54

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

Stackable

I’m curious about why integration between OPA and Spark is not supported. #54

wooarchi Aug 26, 2024

Replies: 1 comment · 1 reply

razvan Aug 26, 2024 Maintainer

wooarchi Aug 26, 2024 Author

wooarchi
Aug 26, 2024

Replies: 1 comment 1 reply

razvan
Aug 26, 2024
Maintainer

wooarchi Aug 26, 2024
Author