Supplement Results

1. RAUC-k metric

$RAUC-k$ is a metric to measure the prioritization effectiveness when all prioritized tests can not be executed completely in a limited time practically. Therefore, RAUC-s are proposed to measure the prioritization effectiveness when only top k tests can be executed. Specifically, it is calculated based on the prioritization result graph with the number of tests as the x-axis and the bug number as the y-axis. The RAUC is determined by calculating the area under the curve of the prioritization technique and contrasting it with the area under the curve of the ideal prioritization, which represents the sequential order in which the test cases would have been executed had all bugs been known beforehand. In our study, we evaluated the performance of the TCP technique on different proportions of test cases, specifically 25%, 50%, 75%, and 100% of the total number of tests, which we referred to as RAUC-25%, RAUC-50%, RAUC-75%, and RAUC-100% respectively. A higher value of RAUC-k indicates better performance of the prioritization strategy. The bold in the table means the best value.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

supplement_results.md

supplement_results.md

Supplement Results

1. RAUC-k metric

2. Templates of Model Generation for Three Libraries

PyTorch

Keras

ONNX

Files

supplement_results.md

Latest commit

History

supplement_results.md

File metadata and controls

Supplement Results

1. RAUC-k metric

2. Templates of Model Generation for Three Libraries

PyTorch

Keras

ONNX