Replies: 1 comment 1 reply
-
Spark, being mostly a compute engine, is only one from the many ways to access the data your data lake. Therefore a better approach is to implement your data access policies in the storage engine. For example, you can use Spark to access data in a kerberized Hadoop cluster and implement OPA policies for HDFS paths. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
want to use OPA to control access to specific paths in the data lake that Spark accesses. Is this approach not suitable?
Beta Was this translation helpful? Give feedback.
All reactions