Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The provided customer-managed cluster 'Rubrik-Exocompute-Customer-Managed' is not in the expected 'ACTIVE' state #125

Open
DamaniN opened this issue Feb 26, 2024 · 1 comment
Labels
api-issue There is an issue with the backend API aws Amazon AWS bug Something isn't working

Comments

@DamaniN
Copy link

DamaniN commented Feb 26, 2024

Expected Behavior

When attaching a customer managed Exocompute cluster to RSC, the command should complete successfully.

Current Behavior

When running this code:

data "aws_region" "current" {}

# Temporary fix until this error is resolved:
# │ Error: failed to lookup exocompute config: failed to get vpcs: failed to 
# | request allVpcsByRegionFromAws: graphql response body is an error (status code 200): Objects 
# | are not authorized (code: 403, traceId: x9TBQt14uQpe5tSLU2BDEQ==) | error is

resource "time_sleep" "wait_for_polaris_sync" {
  create_duration = "60s"
}

# Create an Exocompute configuration using the specified VPC and subnets.
resource "polaris_aws_exocompute" "customer_managed" {
  account_id              = var.rsc_aws_cnp_account_id
  region                  = data.aws_region.current.name
  
  depends_on = [time_sleep.wait_for_polaris_sync]
}

resource "polaris_aws_exocompute_cluster_attachment" "cluster" {
  exocompute_id = polaris_aws_exocompute.customer_managed.id
  cluster_name  = var.aws_eks_cluster_name
}

The following error occurs:

│ Error: failed to connect exocompute cluster: failed to request connectAwsExocomputeCluster: graphql response body is an error (status code 200): INVALID_ARGUMENT: Error RBK30200017: The customer-managed cluster 'Rubrik-Exocompute-Customer-Managed' is not ready for Exocompute. Possible cause: The provided customer-managed cluster 'Rubrik-Exocompute-Customer-Managed' is not in the expected 'ACTIVE' state. Possible remedy: Ensure the provided cluster is in the 'ACTIVE' state for Exocompute. (code: 400, traceId: 8xmlXkATpZcjVEfo3JXd4A==)
│
│   with module.polaris-aws-cloud-native-customer-managed-exocompute-us-east-1.polaris_aws_exocompute_cluster_attachment.cluster,
│   on ../../terraform-aws-polaris-cloud-native-customer-managed-exocompute/main.tf line 20, in resource "polaris_aws_exocompute_cluster_attachment" "cluster":
│   20: resource "polaris_aws_exocompute_cluster_attachment" "cluster" {
│

Rerunning the plan some time later resolves the issue.

I suspect that after running polaris_aws_exocompute that some time is needed for the cluster to become active. TF should not return until the cluster is active. Alternatively, polaris_aws_exocompute_cluster_attachment should wait longer for the cluster to become active.


@johan3141592 johan3141592 added bug Something isn't working aws Amazon AWS api-issue There is an issue with the backend API labels Mar 26, 2024
@johan3141592
Copy link
Member

@DamaniN not sure about what could be causing this issue. Looking at the attached TF configuration code, the snippet does not create a delay between polaris_aws_exocompute and polaris_aws_exocompute_cluster_attachment, it just creates a delay before polaris_aws_exocompute is created. Is the AWS EKS cluster also created by the same TF configuration? Could it be that the EKS cluster is not yet fully created in AWS?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
api-issue There is an issue with the backend API aws Amazon AWS bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants