Add all possible endpoint to diagnose command #24377

louis-cqrl · 2024-04-04T11:19:42Z

What does this PR do?

Add endpoints to diagnose command

Motivation

https://datadoghq.atlassian.net/browse/ASCII-707

Additional Notes

Possible Drawbacks / Trade-offs

Describe how to test/QA your changes

All old endpoints in datadog-agent diagnose should behave exactly the same as before
All new endpoints in datadog-agent diagnose should succeed

…agnose

pr-commenter · 2024-04-04T12:43:35Z

Test changes on VM

Use this command from test-infra-definitions to manually test this PR changes on a VM:

inv create-vm --pipeline-id=34558172 --os-family=ubuntu

pr-commenter · 2024-04-04T13:23:32Z

Regression Detector

Regression Detector Results

Run ID: f9909f57-a59e-497a-8eaa-ac8286c2d476
Baseline: 797b120
Comparison: e542628

Performance changes are noted in the perf column of each table:

✅ = significantly better comparison variant performance
❌ = significantly worse comparison variant performance
➖ = no significant change in performance

No significant changes in experiment optimization goals

Confidence level: 90.00%
Effect size tolerance: |Δ mean %| ≥ 5.00%

There were no significant changes in experiment optimization goals at this confidence level and effect size tolerance.

Fine details of change detection per experiment

perf	experiment	goal	Δ mean %	Δ mean % CI
➖	uds_dogstatsd_to_api_cpu	% cpu utilization	+1.00	[-1.76, +3.75]
➖	file_tree	memory utilization	+0.88	[+0.77, +1.00]
➖	basic_py_check	% cpu utilization	+0.51	[-1.97, +2.99]
➖	tcp_syslog_to_blackhole	ingress throughput	+0.32	[-20.90, +21.55]
➖	otel_to_otel_logs	ingress throughput	+0.06	[-0.30, +0.41]
➖	tcp_dd_logs_filter_exclude	ingress throughput	+0.00	[-0.04, +0.04]
➖	trace_agent_json	ingress throughput	-0.00	[-0.01, +0.01]
➖	trace_agent_msgpack	ingress throughput	-0.00	[-0.00, +0.00]
➖	uds_dogstatsd_to_api	ingress throughput	-0.01	[-0.22, +0.19]
➖	idle	memory utilization	-0.32	[-0.36, -0.28]
➖	pycheck_1000_100byte_tags	% cpu utilization	-3.18	[-7.65, +1.30]

Explanation

A regression test is an A/B test of target performance in a repeatable rig, where "performance" is measured as "comparison variant minus baseline variant" for an optimization goal (e.g., ingress throughput). Due to intrinsic variability in measuring that goal, we can only estimate its mean value for each experiment; we report uncertainty in that value as a 90.00% confidence interval denoted "Δ mean % CI".

For each experiment, we decide whether a change in performance is a "regression" -- a change worth investigating further -- if all of the following criteria are true:

Its estimated |Δ mean %| ≥ 5.00%, indicating the change is big enough to merit a closer look.
Its 90.00% confidence interval "Δ mean % CI" does not contain zero, indicating that if our statistical model is accurate, there is at least a 90.00% chance there is a difference in performance between baseline and comparison variants.
Its configuration does not mark it "erratic".

…agnose

…y tests in diagnose/README.md

…agnose

…iagnose

…agnose

codecov · 2024-05-17T14:45:17Z

Codecov Report

Attention: Patch coverage is 46.93878% with 26 lines in your changes missing coverage. Please review.

Project coverage is 55.80%. Comparing base (797b120) to head (e542628).
Report is 3429 commits behind head on main.

Files with missing lines	Patch %	Lines
pkg/diagnose/connectivity/endpoint_info.go	0.00%	17 Missing ⚠️
pkg/diagnose/connectivity/core_endpoint.go	35.71%	7 Missing and 2 partials ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##             main   #24377       +/-   ##
===========================================
+ Coverage   44.66%   55.80%   +11.14%     
===========================================
  Files        2281      886     -1395     
  Lines      263602    75424   -188178     
===========================================
- Hits       117726    42093    -75633     
+ Misses     136416    30852   -105564     
+ Partials     9460     2479     -6981

Flag	Coverage Δ
amzn_aarch64	`56.66% <46.93%> (+11.21%)`	⬆️
centos_x86_64	`56.56% <46.93%> (+11.21%)`	⬆️
ubuntu_aarch64	`56.67% <46.93%> (+11.21%)`	⬆️
ubuntu_x86_64	`56.67% <46.93%> (+11.23%)`	⬆️
windows_amd64	`55.27% <46.93%> (+4.28%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

feat(endpoint_info.go): try to add endpoint with no success

5573522

louis-cqrl added changelog/no-changelog team/agent-shared-components labels Apr 4, 2024

louis-cqrl and others added 2 commits April 4, 2024 13:20

Merge branch 'main' into louis-cqrl/add-all-endpoints-connectivity-di…

382d9ad

…agnose

remove endpoint that fail during diagnose command

20de192

louis-cqrl and others added 19 commits April 5, 2024 13:25

Update working endpoints in endpoints.go

911cc34

Merge branch 'main' into louis-cqrl/add-all-endpoints-connectivity-di…

89adcc7

…agnose

Update endpoints in defaultforwarder/endpoints.go and add connectivit…

06fc382

…y tests in diagnose/README.md

Merge branch 'main' into louis-cqrl/add-all-endpoints-connectivity-di…

5daa1a4

…agnose

Add subdomain field in endpoint struct to allow other subdomains

b00f337

Refactor endpoint structs and update connectivity diagnose tests

763f33a

add endpoints in endpoints.go

6394fe9

Refactor endpoint struct and update connectivity diagnose tests

bd660ed

Refactor endpoint struct and update connectivity diagnose tests

164737f

Crash test some endpoints

44556b1

Refactor endpoint struct and update connectivity diagnose tests

3e50333

Make 400 with specific body message from processes endpoint work in d…

afad245

…iagnose

Revert commented out endpoint in endpoint_info.go

5d44b42

Merge branch 'main' into louis-cqrl/add-all-endpoints-connectivity-di…

89f1c18

…agnose

fix linter

7cb67f8

fix endpoint creator

3483195

add other subdomain test

3f2b9f1

fix linter

9f9d970

Merge branch 'main' into louis-cqrl/add-all-endpoints-connectivity-di…

e542628

…agnose

dd-devflow bot closed this Nov 17, 2024

dd-devflow bot deleted the louis-cqrl/add-all-endpoints-connectivity-diagnose branch November 17, 2024 00:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add all possible endpoint to diagnose command #24377

Add all possible endpoint to diagnose command #24377

louis-cqrl commented Apr 4, 2024

pr-commenter bot commented Apr 4, 2024 •

edited

Loading

pr-commenter bot commented Apr 4, 2024 •

edited

Loading

Fine details of change detection per experiment

Explanation

codecov bot commented May 17, 2024 •

edited

Loading

Add all possible endpoint to diagnose command #24377

Add all possible endpoint to diagnose command #24377

Conversation

louis-cqrl commented Apr 4, 2024

What does this PR do?

Motivation

Additional Notes

Possible Drawbacks / Trade-offs

Describe how to test/QA your changes

pr-commenter bot commented Apr 4, 2024 • edited Loading

Test changes on VM

pr-commenter bot commented Apr 4, 2024 • edited Loading

Regression Detector

Regression Detector Results

No significant changes in experiment optimization goals

Fine details of change detection per experiment

Explanation

codecov bot commented May 17, 2024 • edited Loading

Codecov Report

pr-commenter bot commented Apr 4, 2024 •

edited

Loading

pr-commenter bot commented Apr 4, 2024 •

edited

Loading

codecov bot commented May 17, 2024 •

edited

Loading