Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CONTINT-3558] Add e2e tests for APM origin detection on k8s #22969

Merged
merged 21 commits into from
Mar 8, 2024

Conversation

AliDatadog
Copy link
Contributor

@AliDatadog AliDatadog commented Feb 19, 2024

What does this PR do?

This PR adds e2e tests for origin detection on APM.

Motivation

Increase test coverage.

Additional Notes

N/A

Possible Drawbacks / Trade-offs

N/A

Describe how to test/QA your changes

Run tests using inv new-e2e-tests.run --targets=./tests/containers --run TestKindSuite

=== RUN   TestKindSuite/TestTraceTCP
=== RUN   TestKindSuite/TestTraceUDP

@AliDatadog AliDatadog added team/containers changelog/no-changelog [deprecated] qa/skip-qa - use other qa/ labels [DEPRECATED] Please use qa/done or qa/no-code-change to skip creating a QA card qa/done QA done before merge and regressions are covered by tests labels Feb 19, 2024
@AliDatadog AliDatadog added this to the 7.51.0 milestone Feb 19, 2024
@AliDatadog AliDatadog requested review from a team as code owners February 19, 2024 16:21
@pr-commenter
Copy link

pr-commenter bot commented Feb 19, 2024

Bloop Bleep... Dogbot Here

Regression Detector Results

Run ID: 7beef838-ee85-4b76-91ac-af28fbf4b2cb
Baseline: 9898528
Comparison: 0cc41ee

Performance changes are noted in the perf column of each table:

  • ✅ = significantly better comparison variant performance
  • ❌ = significantly worse comparison variant performance
  • ➖ = no significant change in performance

No significant changes in experiment optimization goals

Confidence level: 90.00%
Effect size tolerance: |Δ mean %| ≥ 5.00%

There were no significant changes in experiment optimization goals at this confidence level and effect size tolerance.

Experiments ignored for regressions

Regressions in experiments with settings containing erratic: true are ignored.

perf experiment goal Δ mean % Δ mean % CI
file_to_blackhole % cpu utilization -2.06 [-8.56, +4.44]

Fine details of change detection per experiment

perf experiment goal Δ mean % Δ mean % CI
otel_to_otel_logs ingress throughput +1.31 [+0.68, +1.93]
process_agent_real_time_mode memory utilization +0.12 [+0.08, +0.15]
file_tree memory utilization +0.04 [-0.03, +0.11]
process_agent_standard_check_with_stats memory utilization +0.04 [+0.01, +0.07]
trace_agent_json ingress throughput +0.03 [+0.01, +0.06]
trace_agent_msgpack ingress throughput +0.03 [+0.01, +0.04]
uds_dogstatsd_to_api ingress throughput +0.00 [-0.00, +0.00]
tcp_dd_logs_filter_exclude ingress throughput -0.00 [-0.00, +0.00]
process_agent_standard_check memory utilization -0.03 [-0.05, -0.00]
tcp_syslog_to_blackhole ingress throughput -0.14 [-0.19, -0.09]
idle memory utilization -0.40 [-0.43, -0.37]
uds_dogstatsd_to_api_cpu % cpu utilization -0.48 [-1.92, +0.95]
basic_py_check % cpu utilization -0.62 [-2.81, +1.56]
file_to_blackhole % cpu utilization -2.06 [-8.56, +4.44]

Explanation

A regression test is an A/B test of target performance in a repeatable rig, where "performance" is measured as "comparison variant minus baseline variant" for an optimization goal (e.g., ingress throughput). Due to intrinsic variability in measuring that goal, we can only estimate its mean value for each experiment; we report uncertainty in that value as a 90.00% confidence interval denoted "Δ mean % CI".

For each experiment, we decide whether a change in performance is a "regression" -- a change worth investigating further -- if all of the following criteria are true:

  1. Its estimated |Δ mean %| ≥ 5.00%, indicating the change is big enough to merit a closer look.

  2. Its 90.00% confidence interval "Δ mean % CI" does not contain zero, indicating that if our statistical model is accurate, there is at least a 90.00% chance there is a difference in performance between baseline and comparison variants.

  3. Its configuration does not mark it "erratic".

@AliDatadog
Copy link
Contributor Author

/trigger-ci --variable RUN_ALL_BUILDS=true --variable RUN_KITCHEN_TESTS=true --variable RUN_E2E_TESTS=auto

@dd-devflow
Copy link

dd-devflow bot commented Feb 21, 2024

🚂 Gitlab pipeline started

Started pipeline #28643033

@kacper-murzyn kacper-murzyn modified the milestones: 7.51.0, 7.53.0 Feb 27, 2024
@AliDatadog AliDatadog marked this pull request as draft February 27, 2024 16:09
@AliDatadog AliDatadog marked this pull request as ready for review February 27, 2024 16:18
@AliDatadog AliDatadog marked this pull request as draft March 1, 2024 14:21
@AliDatadog AliDatadog marked this pull request as ready for review March 1, 2024 14:27
@AliDatadog
Copy link
Contributor Author

/trigger-ci --variable RUN_ALL_BUILDS=true --variable RUN_KITCHEN_TESTS=true --variable RUN_E2E_TESTS=auto

@dd-devflow
Copy link

dd-devflow bot commented Mar 1, 2024

🚂 Gitlab pipeline started

Started pipeline #29296220

test/new-e2e/tests/containers/k8s_test.go Outdated Show resolved Hide resolved
test/new-e2e/tests/containers/k8s_test.go Outdated Show resolved Hide resolved
test/new-e2e/tests/containers/k8s_test.go Outdated Show resolved Hide resolved
test/new-e2e/tests/containers/k8s_test.go Outdated Show resolved Hide resolved
test/new-e2e/tests/containers/k8s_test.go Outdated Show resolved Hide resolved
AliDatadog and others added 5 commits March 4, 2024 14:31
Co-authored-by: Lénaïc Huard <L3n41c@users.noreply.github.com>
Co-authored-by: Lénaïc Huard <L3n41c@users.noreply.github.com>
Co-authored-by: Lénaïc Huard <L3n41c@users.noreply.github.com>
Co-authored-by: Lénaïc Huard <L3n41c@users.noreply.github.com>
Co-authored-by: Lénaïc Huard <L3n41c@users.noreply.github.com>
Base automatically changed from ali/bump-test-infra-def to main March 7, 2024 18:00
@AliDatadog
Copy link
Contributor Author

/trigger-ci --variable RUN_E2E_TESTS=true

@dd-devflow
Copy link

dd-devflow bot commented Mar 8, 2024

🚂 Gitlab pipeline started

Started pipeline #29787400

@AliDatadog AliDatadog changed the title [CONTINT-3558] Add e2e tests for APM origin detection [CONTINT-3558] Add e2e tests for APM origin detection on k8s Mar 8, 2024
@AliDatadog
Copy link
Contributor Author

/merge

@dd-devflow
Copy link

dd-devflow bot commented Mar 8, 2024

🚂 MergeQueue

Pull request added to the queue.

This build is next! (estimated merge in less than 28m)

Use /merge -c to cancel this operation!

@dd-mergequeue dd-mergequeue bot merged commit 5d68596 into main Mar 8, 2024
185 checks passed
@dd-mergequeue dd-mergequeue bot deleted the ali/e2e-test-apm-origin branch March 8, 2024 13:47
julien-lebot pushed a commit that referenced this pull request Mar 11, 2024
* implement e2e test for APM origin detection

* remove test of cgroup version

* only test the namespace workload-tracegen

* simplify the logic with two separate tests for TCP and UDS

* iterate from the most recent traces

* bump test infra definitions

* fix image_tag issue

* add docker_image and image_tag to ecs suite

* rebump

* hardcode the image tag

* Update test/new-e2e/tests/containers/k8s_test.go

Co-authored-by: Lénaïc Huard <L3n41c@users.noreply.github.com>

* Update test/new-e2e/tests/containers/k8s_test.go

Co-authored-by: Lénaïc Huard <L3n41c@users.noreply.github.com>

* Remove the `require` func

Co-authored-by: Lénaïc Huard <L3n41c@users.noreply.github.com>

* Update error message

Co-authored-by: Lénaïc Huard <L3n41c@users.noreply.github.com>

* Iterate in normal order

Co-authored-by: Lénaïc Huard <L3n41c@users.noreply.github.com>

* Rename test

---------

Co-authored-by: Lénaïc Huard <L3n41c@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
changelog/no-changelog [deprecated] qa/skip-qa - use other qa/ labels [DEPRECATED] Please use qa/done or qa/no-code-change to skip creating a QA card qa/done QA done before merge and regressions are covered by tests team/containers
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants