Partition helix test runs using historical execution time data #62138

dibarbet · 2022-06-24T23:56:40Z

Resolves #62036

This does a few different things to fix our test partitioning

Run real test discovery to find the real set of tests to run and puts that in a json file in prepare-tests. We must do it here because discovery must be run before test payload minimization.
During partitioning, we lookup the test execution history for the appropriate stage to estimate the execution time of each test. We then use this info to partition the tests into work items that run under a certain time (currently 2:30). There is one test that does not run in under 2:30 consistently - Microsoft.CodeAnalysis.CSharp.UnitTests.OverloadResolutionPerfTests.NestedLambdas_01 but that can be fixed as a followup
Run tests via vstest.console.dll directly. This is what already gets called when using dotnet test and passing a specific dll in. We run directly on vstest.console.dll so we can utilize RSP files to output a test filter that contains all fully qualified test method names (there is a bug where RSP files passed to dotnet test do not get passed down correctly, see Long response files don't work with dotnet test & dotnet vstest microsoft/vstest#3513). RSP files get around max path limitations and avoid escaping issues on different platforms.

dibarbet · 2022-08-05T21:17:38Z

@jaredpar this is ready for round 2

src/Compilers/CSharp/Test/EndToEnd/Microsoft.CodeAnalysis.CSharp.EndToEnd.UnitTests.csproj

src/Tools/Source/RunTests/TestHistoryManager.cs

src/Tools/Source/RunTests/RunTests.csproj

RikkiGibson

great progress.

src/Compilers/Test/Core/RunInSinglePartitionAssemblyAttribute.cs

src/Tools/PrepareTests/TestDiscovery.cs

src/Tools/Source/RunTests/AssemblyInfo.cs

src/Tools/Source/RunTests/AssemblyScheduler.cs

src/Tools/Source/RunTests/ProcessRunner.cs

src/Tools/Source/RunTests/ProcessTestExecutor.cs

RikkiGibson · 2022-08-08T21:13:48Z

src/Tools/Source/RunTests/TestHistoryManager.cs

+        timer.Start();
+        for (var i = 0; i < totalTests; i += MaxTestsReturnedPerRequest)
+        {
+            var testResults = await GetTestResultsAsync(runForThisStage, i, MaxTestsReturnedPerRequest, testClient, cancellationToken);


It looks like this phase takes maybe 30 seconds on desktop. Should we tinker with firing the all the tasks off immediately, pushing them into an array or something, then awaiting them one by one?

There's no need to spend time on it in this PR, but thought we should keep it in our back pocket.

Yup - we can play around with it (will do that separately). We have to be careful not to run too many though that we get rate limited.

src/Compilers/CSharp/Test/EndToEnd/Microsoft.CodeAnalysis.CSharp.EndToEnd.UnitTests.csproj

jaredpar · 2022-08-15T22:26:17Z

Can we file a work item to track refactoring run tests and prepare tests to have the better separation of responsibilities? Essentially at this point I think we should just move all helix ops into prepare tests and dumb down run tests a bit (perhaps just remove it from our core infra loop).

dibarbet · 2022-08-15T22:31:06Z

Can we file a work item to track refactoring run tests and prepare tests to have the better separation of responsibilities? Essentially at this point I think we should just move all helix ops into prepare tests and dumb down run tests a bit (perhaps just remove it from our core infra loop).

Done - #63413

dibarbet · 2022-08-15T23:30:59Z

test failures look like a known outage uploading results
https://helix.dot.net/api/jobs/b0ef8a27-a0a8-466f-b918-0152d8be7506/workitems/Microsoft.Build.Tasks.CodeAnalysis.UnitTests_Microsoft.CodeAnalysis.CSharp.CodeStyle.UnitTests_1?api-version=2019-06-17

there is an ICM open for it right now. Will retry once it is closed.

dibarbet · 2022-08-16T21:22:00Z

looks like there are some new breakages that were not showing up before. They also aren't showing up in main so likely some main change/helix rollout + partitioning is breaking this.

[xUnit.net 00:02:10.41]     Microsoft.CodeAnalysis.Editor.UnitTests.CommentSelection.CommentUncommentSelectionCommandHandlerTests.Uncomment_AtBeginningOfEndOfBlockComment [FAIL]
  Failed Microsoft.CodeAnalysis.Editor.UnitTests.CommentSelection.CommentUncommentSelectionCommandHandlerTests.Uncomment_MatchesBlockComment [1 ms]
  Error Message:
   System.TypeInitializationException : The type initializer for 'Roslyn.Test.Utilities.StaTaskScheduler' threw an exception.
---- System.IO.FileLoadException : Could not load file or assembly 'file:///C:/h/w/B66909A9/w/A15A08F5/e/Microsoft.CodeAnalysis.EditorFeatures.UnitTests/Debug/net472/Microsoft.CodeAnalysis.XunitHook.DLL' or one of its dependencies. Operation is not supported. (Exception from HRESULT: 0x80131515)
-------- System.NotSupportedException : An attempt was made to load an assembly from a network location which would have caused the assembly to be sandboxed in previous versions of the .NET Framework. This release of the .NET Framework does not enable CAS policy by default, so this load may be dangerous. If this load is not intended to sandbox the assembly, please enable the loadFromRemoteSources switch. See http://go.microsoft.com/fwlink/?LinkId=155569 for more information.
  Stack Trace:
     at Roslyn.Test.Utilities.StaTaskScheduler.get_DefaultSta()

…WPFFact

dibarbet · 2022-08-18T10:51:24Z

Going to merge this. Will followup on why exactly https://github.com/dotnet/roslyn/blob/main/src/Compilers/CSharp/Test/WinRT/Metadata/WinMdMetadataTests.cs#L141 causes the xunit hook to fail to load when run before any wpffact test.

dotnet-issue-labeler bot added the Area-Infrastructure label Jun 24, 2022

dibarbet force-pushed the partition_tests branch 9 times, most recently from 607bcfc to 841d67c Compare June 30, 2022 00:47

dibarbet force-pushed the partition_tests branch 4 times, most recently from 6f5c82a to b6ada4d Compare July 8, 2022 21:16

RikkiGibson self-assigned this Jul 8, 2022

dibarbet force-pushed the partition_tests branch 8 times, most recently from ab3b550 to 793e9fd Compare July 11, 2022 22:53

dibarbet added 7 commits July 18, 2022 10:46

Add two failing tests

561d57f

Working on partition

45d49dd

more work

1bdef98

working

b428661

more work

eec06a9

some more work on prepare tests

e0a2316

Cleanup unneeded code

43991a3

dibarbet force-pushed the partition_tests branch from 5a57832 to 9fc71af Compare August 5, 2022 02:08

Comment updates

1bd0e1c

dibarbet requested a review from jaredpar August 5, 2022 05:38

jaredpar reviewed Aug 8, 2022

View reviewed changes

RikkiGibson reviewed Aug 8, 2022

View reviewed changes

Review feedback

5eda6de

dibarbet requested review from jaredpar and RikkiGibson August 9, 2022 04:08

Merge branch 'main' into partition_tests

c66fe12

RikkiGibson approved these changes Aug 15, 2022

View reviewed changes

jmarolf reviewed Aug 15, 2022

View reviewed changes

src/Compilers/CSharp/Test/EndToEnd/Microsoft.CodeAnalysis.CSharp.EndToEnd.UnitTests.csproj Show resolved Hide resolved

jmarolf approved these changes Aug 15, 2022

View reviewed changes

jaredpar approved these changes Aug 15, 2022

View reviewed changes

dibarbet mentioned this pull request Aug 15, 2022

Refactor / remove RunTests #63413

Open

Merge remote-tracking branch 'upstream/main' into partition_tests

0519c06

dibarbet added 3 commits August 16, 2022 16:31

Unblock downloaded files on windows

fb0c6fc

Merge remote-tracking branch 'upstream/main' into partition_tests

6b0d0b8

Run WinRT tests in their own work item as they can break tests using …

ff5c993

…WPFFact

dibarbet force-pushed the partition_tests branch from 8ec1499 to ff5c993 Compare August 18, 2022 00:43

dibarbet merged commit c5cf429 into dotnet:main Aug 18, 2022

ghost added this to the Next milestone Aug 18, 2022

dibarbet deleted the partition_tests branch August 22, 2022 18:10

dibarbet modified the milestones: Next, 17.4 P2 Sep 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Partition helix test runs using historical execution time data #62138

Partition helix test runs using historical execution time data #62138

dibarbet commented Jun 24, 2022 •

edited

Loading

dibarbet commented Aug 5, 2022

RikkiGibson left a comment

RikkiGibson Aug 8, 2022

dibarbet Aug 8, 2022

jaredpar commented Aug 15, 2022

dibarbet commented Aug 15, 2022

dibarbet commented Aug 15, 2022 •

edited

Loading

dibarbet commented Aug 16, 2022 •

edited

Loading

dibarbet commented Aug 18, 2022

Partition helix test runs using historical execution time data #62138

Partition helix test runs using historical execution time data #62138

Conversation

dibarbet commented Jun 24, 2022 • edited Loading

dibarbet commented Aug 5, 2022

RikkiGibson left a comment

Choose a reason for hiding this comment

RikkiGibson Aug 8, 2022

Choose a reason for hiding this comment

dibarbet Aug 8, 2022

Choose a reason for hiding this comment

jaredpar commented Aug 15, 2022

dibarbet commented Aug 15, 2022

dibarbet commented Aug 15, 2022 • edited Loading

dibarbet commented Aug 16, 2022 • edited Loading

dibarbet commented Aug 18, 2022

dibarbet commented Jun 24, 2022 •

edited

Loading

dibarbet commented Aug 15, 2022 •

edited

Loading

dibarbet commented Aug 16, 2022 •

edited

Loading