Skip to content

Latest commit

 

History

History
26 lines (15 loc) · 1.66 KB

supplement_results.md

File metadata and controls

26 lines (15 loc) · 1.66 KB

Supplement Results

1. RAUC-k metric

$RAUC-k$ is a metric to measure the prioritization effectiveness when all prioritized tests can not be executed completely in a limited time practically. Therefore, RAUC-s are proposed to measure the prioritization effectiveness when only top k tests can be executed. Specifically, it is calculated based on the prioritization result graph with the number of tests as the x-axis and the bug number as the y-axis. The RAUC is determined by calculating the area under the curve of the prioritization technique and contrasting it with the area under the curve of the ideal prioritization, which represents the sequential order in which the test cases would have been executed had all bugs been known beforehand. In our study, we evaluated the performance of the TCP technique on different proportions of test cases, specifically 25%, 50%, 75%, and 100% of the total number of tests, which we referred to as RAUC-25%, RAUC-50%, RAUC-75%, and RAUC-100% respectively. A higher value of RAUC-k indicates better performance of the prioritization strategy. The bold in the table means the best value. image

2. Templates of Model Generation for Three Libraries

PyTorch

torch_template

Keras

keras_template

ONNX

onnx_template