Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

suppress systemd-run error(Connection reset by peer) while validating ext cgroups #3278

Merged
merged 2 commits into from
Dec 19, 2024

Conversation

nagworld9
Copy link
Contributor

Description

Cgroups may be disabled if there is a systemd-error while invoking any extension. As a result, extensions won't start in cgroups, and validation fail. This is expected behavior, so we are suppressing systemd error (Connection reset by peer)

Today, we did suppress for unknown process in agent cgroup but recently we improved the agent logic to determine unexpected processes. So, we shouldn't expect to see this unknown process error now and removing that in this pr.

Issue #


PR information

  • Ensure development PR is based on the develop branch.
  • The title of the PR is clear and informative.
  • There are a small number of commits, each of which has an informative message. This means that previously merged commits do not appear in the history of the PR. For information on cleaning up the commits in your pull request, see this page.
  • If applicable, the PR references the bug/issue that it fixes in the description.
  • New Unit tests were added for the changes made

Quality of Code and Contribution Guidelines

# It is possible that agent cgroup can be disabled due to UNKNOWN process or throttled before we run this check, in that case, we should ignore the validation
if check_cgroup_disabled_with_unknown_process() and retry_if_false(check_agent_quota_disabled):
log.info("Cgroup is disabled due to UNKNOWN process, ignoring ext cgroups validations")
# It is possible that agent cgroup can be disabled and reset the quotas if extension failed start using systemd-run. In that case, we should ignore the validation
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
# It is possible that agent cgroup can be disabled and reset the quotas if extension failed start using systemd-run. In that case, we should ignore the validation
# It is possible that agent cgroup can be disabled and reset the quotas if the extension failed to start using systemd-run. In that case, we should ignore the validation

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

updated

@nagworld9 nagworld9 merged commit d536497 into Azure:develop Dec 19, 2024
9 of 11 checks passed
@nagworld9 nagworld9 deleted the systemderror branch December 19, 2024 22:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants