Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for fabric link monitoring feature on SUP #3493

Conversation

vmittal-msft
Copy link
Contributor

@vmittal-msft vmittal-msft commented Aug 21, 2024

What I did

Added support for Fabric link monitoring on SUP to log critical syslog if a link is seeing high crc error. This is for 202205 only as master will have complete support of fabric link monitoring

How I did it

A python script will be launched by monit to periodically monitor link state and if a link sees high rate of crc error then a syslog will be generated to record this. We can plan to add link shutdown support later.

How to verify it

Verified this on lab as well as production pilot devices.

Previous command output (if the output of a command-line utility has changed)

New command output (if the output of a command-line utility has changed)

@vmittal-msft
Copy link
Contributor Author

@saksarav-nokia @kenneth-arista for review.

@kenneth-arista
Copy link
Contributor

@vmittal-msft it looks like your PR is based on master. Is this intended?

Tagging @jfeng-arista for awareness

Copy link
Contributor

@arlakshm arlakshm left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you add some UT

scripts/fabric_links_monitoring.py Show resolved Hide resolved
scripts/fabric_links_monitoring.py Show resolved Hide resolved
scripts/fabric_links_monitoring.py Show resolved Hide resolved
scripts/fabric_links_monitoring.py Show resolved Hide resolved
scripts/fabric_links_monitoring.py Show resolved Hide resolved
scripts/fabric_links_monitoring.py Show resolved Hide resolved
scripts/fabric_links_monitoring.py Show resolved Hide resolved
self.filter_for_up_ports = filter_for_up_ports
self.port_map = self.get_port_map()
print('Fabric device id :', namespace)
subprocess.run(["sonic-clear", "fabriccountersport"])
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why clear the counter via CLI, it does not clear the DB entries

scripts/fabric_links_monitoring.py Show resolved Hide resolved
@vmittal-msft
Copy link
Contributor Author

@vmittal-msft it looks like your PR is based on master. Is this intended?

Tagging @jfeng-arista for awareness

No. Moving to sonic-utilities.msft.

@vmittal-msft
Copy link
Contributor Author

Closing this PR on master and opening for 202205/msft repo.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

4 participants