[Recoverable] Allow certain tasks to gracefully reconnect without crashing the entire cluster during transient errors (i.e. preemption). #19238
+822
−92
Google CLA / cla/google
succeeded
Nov 15, 2024 in 2s
✅ All contributors are covered under a CLA with Google
See https://cla.developers.google.com/ for more info about Google's Contributor License Agreement (CLA).
ℹ️ Googlers: Go here to view more details and manage scans for this pull request.
Details
The following contributors were found for this pull request:
✅ 54fad16 PR Opener: @copybara-service[bot]
✅ 54fad16 Author: @Google-ML-Automation <googl**********ation@google.com>
(Only the first commit for a unique contributor is listed.)
Loading