Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Unity Catalog] - Change the import library process #980

Open
2 tasks
thesqlpro opened this issue Dec 19, 2024 · 5 comments
Open
2 tasks

[Unity Catalog] - Change the import library process #980

thesqlpro opened this issue Dec 19, 2024 · 5 comments
Assignees
Labels

Comments

@thesqlpro
Copy link
Contributor

thesqlpro commented Dec 19, 2024

Libraries are being imported into DBFS - as of now with Unity Catalog enabled these files go into the legacy Hive metastore. We would like to upload them into Unity Catalog. Additionally, need to back port the work to ADF as it will fail with the new way of storing the libraries. #896 depends on this task as the upgrade to the cluster will cause problems for ADF. Task #896 is ready, but this needs to be completed first.

DoD

  • Library files are stored in the Unity Catalog metastore and no longer in the legacy Hive Metastore
  • Research -Back port of ADF pipelines to use new file paths from metastore

Task for #765

@ydaponte
Copy link
Collaborator

@thesqlpro, please do add a DoD and fill in the metadata.

@ydaponte ydaponte added the P0 Uber URGENT priority label Dec 20, 2024
@thesqlpro
Copy link
Contributor Author

DoD and meta updated

@thesqlpro
Copy link
Contributor Author

Adding some notes here so we can discuss at the next design meeting. Cluster configured to 15.4LTS with Unity Catalog. Libraries are still being mounted properly in DBFS notebooks function properly as well.

Image

Image

@thesqlpro
Copy link
Contributor Author

ADF executes properly until SQLDW load (investigating).

Image

@thesqlpro
Copy link
Contributor Author

Some more findings. The ADF Linked Service for Databricks uses an older version of spark that's not in our current configuration.

{04FFBB33-5DB8-406C-A2C2-4D9FB5602930}

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants