aws_glue : CfnCrawler missing data source HudiTarget both in CFN and CDK. HudiTarget only available in GUI #31133
Labels
@aws-cdk/aws-glue
Related to AWS Glue
effort/small
Small work item – less than a day of effort
feature-request
A feature should be added or improved.
needs-cfn
This issue is waiting on changes to CloudFormation before it can be addressed.
p2
Describe the feature
Hello
If I set up my crawler from GUI console I can choose Hudi S3 Table as a data source to be crawled. This is not yet possible neither on CFN or CDK. CDK already has iceberg and delta lake as sources for the crawler but no Hudi : https://docs.aws.amazon.com/cdk/api/v2/docs/aws-cdk-lib.aws_glue.CfnCrawler.html
Use Case
I would like to use the serde Hudi and not parquet when crawling my hudi S3 folders, although available via GUI it is still not present in CDK or CFN
Proposed Solution
P1. Add at least to CloudFormation HudiTarget to complete the data lake frameworks formats (given that Iceberg and Delta Target are present in CFN)
P2. Add to CDK the target type
Other Information
No response
Acknowledgements
CDK version used
"aws-cdk-lib": "^2.115.0",
Environment details (OS name and version, etc.)
MACBookPro M1
The text was updated successfully, but these errors were encountered: