You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Add a resilience test suite to cover the expected behavior of the data plane for both existing and new proxy instances during specific edge conditions
#4861
Description:
Consider having a resilience test suite test to cover the expected behavior of the data plane for both existing and new proxy instances during specific edge conditions—such as API server unavailability, Envoy Gateway (EG) downtime, or EG leader failures.
Ensuring Continuous Operation:
By testing scenarios where the control plane components are unavailable, we can verify that existing and new proxy instances maintain their functionality.
Validating Last Known Good (LKG) State Persistence:
Assessing how long the LKG state persists in active EG/Envoy instances during disruptions.
Preventing Premature Readiness of Unconfigured Instances:
Ensuring that unconfigured Envoys/EGs do not reach a ready state prematurely prevents potential misrouting or service failures, thereby maintaining the integrity of the data plane.
Reflecting Real-World Scenarios:
Running EG in a production-like setting with leader election enabled and multiple instances ensures that the tests accurately represent real-world operations, providing confidence in the system’s resilience when things go wrong.
The text was updated successfully, but these errors were encountered:
alexwo
changed the title
Add ש resilience test suite to cover the expected behavior of the data plane for both existing and new proxy instances during specific edge conditions
Add a resilience test suite to cover the expected behavior of the data plane for both existing and new proxy instances during specific edge conditions
Dec 6, 2024
Description:
Consider having a resilience test suite test to cover the expected behavior of the data plane for both existing and new proxy instances during specific edge conditions—such as API server unavailability, Envoy Gateway (EG) downtime, or EG leader failures.
Ensuring Continuous Operation:
By testing scenarios where the control plane components are unavailable, we can verify that existing and new proxy instances maintain their functionality.
Validating Last Known Good (LKG) State Persistence:
Assessing how long the LKG state persists in active EG/Envoy instances during disruptions.
Preventing Premature Readiness of Unconfigured Instances:
Ensuring that unconfigured Envoys/EGs do not reach a ready state prematurely prevents potential misrouting or service failures, thereby maintaining the integrity of the data plane.
Reflecting Real-World Scenarios:
Running EG in a production-like setting with leader election enabled and multiple instances ensures that the tests accurately represent real-world operations, providing confidence in the system’s resilience when things go wrong.
The text was updated successfully, but these errors were encountered: