Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Flaky test list #3186

Closed
3 tasks done
apostasie opened this issue Jul 5, 2024 · 7 comments
Closed
3 tasks done

Flaky test list #3186

apostasie opened this issue Jul 5, 2024 · 7 comments
Labels
area/ci e.g., CI failure bug Something isn't working

Comments

@apostasie
Copy link
Contributor

apostasie commented Jul 5, 2024

CI has been improved with the last few PRs, but there are still sources of flakyness.

Floodgates are open here: #3189

This ticket is to list them all, and get to the bottom of it and fix them all.

Feel free to add here any transient failure you see in the comments, with a copy of the logs and a link to the failing build (first failure in the run ONLY).

Fixed by #3192:

@apostasie apostasie added the kind/unconfirmed-bug-claim Unconfirmed bug claim label Jul 5, 2024
@apostasie
Copy link
Contributor Author

These issues look a lot like #3092

Something really weird is going on.

@AkihiroSuda AkihiroSuda added area/ci e.g., CI failure bug Something isn't working and removed kind/unconfirmed-bug-claim Unconfirmed bug claim labels Jul 5, 2024
@apostasie
Copy link
Contributor Author

apostasie commented Jul 6, 2024

Canary PR: 10 failures

Build link:

https://github.com/containerd/nerdctl/actions/runs/9815148442?pr=3189

@apostasie
Copy link
Contributor Author

apostasie commented Jul 6, 2024

Canary PR: 7 failures (3 counts TestTailFollowRotateLogs, 1 count TestIPFSAddress).

3 "new":

  • TestBuildAttestation 2 counts - log is very weak - error: failed to solve: not found sounds like a buildkit message to me
  • TestRunRestart - OCI runtime start failed: cannot start a container that has stopped: unknown - a bit worrisome - probably the same underlying issue as TestComposeUp above

Build link:

https://github.com/containerd/nerdctl/actions/runs/9815520719?pr=3189

@apostasie
Copy link
Contributor Author

apostasie commented Jul 6, 2024

Canary: 7 failures

  • TestRunWithSystemdTrueEnabled
  • TestBuildAttestation
  • TestImageEncryptJWE
  • TestTailFollowRotateLogs
  • TestIPFSComposeUp (2 counts)
  • TestVolumeLsFilter

@apostasie
Copy link
Contributor Author

TestBuildAttestation & TestTailFollowRotateLogs are now top two.

@apostasie
Copy link
Contributor Author

apostasie commented Jul 9, 2024

As of today, a lot of the issues got PR merged-in.

Latest Canary build (with extra fixes for TestBuildAttestation) failed 4 times:

  • TestCopyToContainer
  • TestTailFollowRotateLogs is still failing (3x count)

@apostasie
Copy link
Contributor Author

Latest (unmerged yet) version of the test tooling will introduce a way to mark flaky tests in code - allowing to have separate runs for flaky (with retry) / not flaky.

This ticket here is getting outdated and is unwildly.

Closing for the better solution of in-code marking.

@apostasie apostasie closed this as not planned Won't fix, can't repro, duplicate, stale Oct 3, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/ci e.g., CI failure bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants