[Draft] `kani-cov`: A coverage tool for Kani #3121

adpaco-aws · 2024-04-03T22:04:59Z

This PR adds a new tool to visualize coverage results stored in a .kanicov file generated with the new coverage implementation from #3119. At the moment, the tool reports only regions which are known to be covered with the green color.

For example, if we take the Rust code:

1 fn _other_function() {
2    println!("Hello, world!");
3 }
4
5 fn test_cov(val: u32) -> bool {
6     if val < 3 || val == 42 {
7         true
8     } else {
9         false
10    }
11 }
12
13 #[cfg_attr(kani, kani::proof)]
14 fn main() {
15    let test1 = test_cov(1);
16    let test2 = test_cov(2);
17    assert!(test1);
18    assert!(test2);
19 }

The tool prints this to terminal:

I'm expecting the tool to be used for testing purposes as well, with "colored regions" being replaced with markers ``` and '''.

The tool still contains a lot of duplicated code from kani-driver. It also needs a proper set of options to filter certain results. This is marked as a draft because it depends on #3119.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 and MIT licenses.

celinval

Are we planning to release this tool or is it only for developers?

Cargo.toml

tools/kanicov/Cargo.toml

adpaco-aws · 2024-04-09T19:52:55Z

Are we planning to release this tool or is it only for developers?

The plan is to release the tool. More details in the upcoming RFC.

adpaco-aws · 2024-04-20T15:46:03Z

Note to self: Consider highlighting uncovered spans in red as the default behavior. At the very least, discuss in the RFC.

This PR replaces the line-based coverage instrumentation we introduced in #2609 with the standard source-based code coverage instrumentation performed by the Rust compiler. As a result, we now insert code coverage checks in the `StatementKind::Coverage(..)` statements produced by the Rust compiler during compilation. These checks include coverage-relevant information[^note-internal] such as the coverage counter/expression they represent [^note-instrument]. Both the coverage metadata (`kanimap`) and coverage results (`kaniraw`) are saved into files after the verification stage. Unfortunately, we currently have a chicken-egg problem with this PR and #3121, where we introduce a tool named `kani-cov` to postprocess coverage results. As explained in #3143, `kani-cov` is expected to be an alias for the `cov` subcommand and provide most of the postprocessing features for coverage-related purposes. But, the tool will likely be introduced after this change. Therefore, we propose to temporarily print a list of the regions in each function with their associated coverage status (i.e., `COVERED` or `UNCOVERED`). ### Source-based code coverage: An example The main advantage of source-based coverage results is their precision with respect to the source code. The [Source-based Code Coverage](https://clang.llvm.org/docs/SourceBasedCodeCoverage.html) documentation explains more details about the LLVM coverage workflow and its different options. For example, let's take this Rust code: ```rust 1 fn _other_function() { 2 println!("Hello, world!"); 3 } 4 5 fn test_cov(val: u32) -> bool { 6 if val < 3 || val == 42 { 7 true 8 } else { 9 false 10 } 11 } 12 13 #[cfg_attr(kani, kani::proof)] 14 fn main() { 15 let test1 = test_cov(1); 16 let test2 = test_cov(2); 17 assert!(test1); 18 assert!(test2); 19 } ``` Compiling and running the program with `rustc` and the `-C instrument-coverage` flag, and using the LLVM tools can get us the following coverage result: ![Image](https://github.com/model-checking/kani/assets/73246657/9070e390-6e0b-4add-828d-d9f9caacad07) In contrast, the `cargo kani --coverage -Zsource-coverage` command currently generates: ``` src/main.rs (main) * 14:1 - 19:2 COVERED src/main.rs (test_cov) * 5:1 - 6:15 COVERED * 6:19 - 6:28 UNCOVERED * 7:9 - 7:13 COVERED * 9:9 - 9:14 UNCOVERED * 11:1 - 11:2 COVERED ``` which is a verification-based coverage result almost equivalent to the runtime coverage results. ### Benchmarking We have evaluated the performance impact of the instrumentation using the `kani-perf.sh` suite (14 benchmarks). For each test, we compare the average time to run standard verification against the average time to run verification with the source-based code coverage feature enabled[^note-line-evaluation]. The evaluation has been performed on an EC2 `m5a.4xlarge` instance running Ubuntu 22.04. The experimental data has been obtained by running the `kani-perf.sh` script 10 times for each version (`only verification` and `verification + coverage`), computing the average and standard deviation. We've split this data into `small` (tests taking 60s or less) and `large` (tests taking more than 60s) and drawn the two graphs below. #### Performance comparison - `small` benchmarks ![performance_comparison_small](https://github.com/user-attachments/assets/679cf412-0193-4b0c-a78c-2d0fb702706f) #### Performance comparison - `large` benchmarks ![performance_comparison_large](https://github.com/user-attachments/assets/4bb5a895-7f57-49e0-86b5-5fea67fad939) #### Comments on performance Looking at the small tests, the performance impact seems negligible in such cases. The difference is more noticeable in the large tests, where the time to run verification and coverage can take 2x or even more. It wouldn't be surprising that, as programs become larger, the complexity of the coverage checking grows exponentially as well. However, since most verification jobs don't take longer than 30min (1800s), it's OK to say that coverage checking represents a 100-200% slowdown in the worst case w.r.t. standard verification. It's also worth noting a few other things: * The standard deviation remains similar in most cases, meaning that the coverage feature doesn't have an impact on their stability. * We haven't tried any SAT solvers other than the ones used by default for each benchmark. It's possible that other solvers perform better/worse with the coverage feature enabled. ### Call-outs * The soundness issue documented in #3441. * The issue with saving coverage mappings for non-reachable functions documented in #3445. * I've modified the test cases in `tests/coverage/` to test this feature. Since this technique is simpler, we don't need that many test cases. However, it's possible I've left some test cases which don't contribute much. Please let me know if you want to add/remove a test case. [^note-internal]: The coverage mappings can't be accessed through the StableMIR interface so we retrieve them through the internal API. [^note-instrument]: The instrumentation replaces certain counters with expressions based on other counters when possible to avoid a part of the runtime overhead. More details can be found [here](https://github.com/rust-lang/rustc-dev-guide/blob/master/src/llvm-coverage-instrumentation.md#mir-pass-instrumentcoverage). Unfortunately, we can't avoid instrumenting expressions at the moment. [^note-line-evaluation]: We have not compared performance against the line-based code coverage feature because it doesn't seem worth it. The line-based coverage feature is guaranteed to include more coverage checks than the source-based one for any function. In addition, source-based results are more precise than line-based ones. So this change represents both a quantitative and qualitative improvement. By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 and MIT licenses.

This PR changes the condition we use to decide where to store the coverage data resulting from a coverage-enabled verification run. The previous condition would otherwise cause Kani to crash at the end of the run in some cases: ``` Source-based code coverage results: src/pair.rs (pair::Pair::new) * 6:5 - 8:6 COVERED src/pair.rs (pair::Pair::sum) * 9:5 - 11:6 COVERED src/pair.rs (pair::kani_tests::test_one_plus_two) * 29:5 - 32:6 COVERED Verification Time: 0.083455205s thread 'main' panicked at kani-driver/src/coverage/cov_session.rs:70:43: called `Option::unwrap()` on a `None` value note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace ``` The PR also adds a new `cargo-coverage` mode to `compiletest` which runs `cargo kani` with the source-based coverage feature enabled. This ensures that coverage-enabled `cargo kani` runs are also being tested, and will also be helpful for testing the upcoming coverage tool (#3121) with cargo packages. By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 and MIT licenses.

kanicov: A tool to visualize coverage in Kani

c487f2a

github-actions bot added the Z-BenchCI Tag a PR to run benchmark CI label Apr 3, 2024

celinval reviewed Apr 5, 2024

View reviewed changes

Cargo.toml Outdated Show resolved Hide resolved

celinval reviewed Apr 5, 2024

View reviewed changes

tools/kanicov/Cargo.toml Outdated Show resolved Hide resolved

Rename and more

eddb363

adpaco-aws changed the title ~~[Draft] kanicov: A tool to visualize coverage in Kani~~ [Draft] kani-cov: A tool to visualize coverage in Kani Apr 9, 2024

adpaco-aws changed the title ~~[Draft] kani-cov: A tool to visualize coverage in Kani~~ [Draft] kani-cov: A coverage tool for Kani Apr 9, 2024

feliperodri assigned adpaco-aws Jun 10, 2024

This was referenced Aug 15, 2024

Adopt Rust's source-based code coverage instrumentation #3119

Merged

Coverage metadata should include mappings for all source code #3445

Open

adpaco-aws added 3 commits August 21, 2024 15:01

Merge branch 'main' into kanicov-tool

935dace

Scaffold for kani-cov

13c26fd

Parse and combine raw results

bee8f2d

adpaco-aws added 13 commits August 28, 2024 20:27

Merge branch 'main' into kanicov-tool

e0b7a10

Add missing copyright notices

0aa25b9

Remove atty dependency

debeef4

Add license to Cargo.toml

9e9575a

Move unused code to report

dd53e38

Save combined results through merge

ad632f3

Start summary command

a659c1a

Some format fixes

76351fd

Fixes for clippy

304cfd2

Add description to Cargo.toml

c69f465

Parse function info with tree-sitter

d0bb478

Use file-function pairs to combine results

5512d28

Change combined results again

0e1dcc0

adpaco-aws added 11 commits September 13, 2024 15:37

Produce markdown table with summary

fabdf44

Sketch Json format for summary

855d0a6

Refator some code so it's shared with report

0df18b9

Init. version for terminal reports

50bb553

Simplify processing for markers

5b4d717

Remove some formatting code that's not needed anymore

e1d0160

Remove leftover debugging info

6d47c0b

Complete terminal highlight algorithm

ae48e21

Change coverage mode to get kani-cov reports

8c79baf

Bless expected files

39f5ac2

Complete blessing for coverage tests

778816a

adpaco-aws mentioned this pull request Sep 20, 2024

Fix storing coverage data in cargo projects #3527

Merged

adpaco-aws added 4 commits September 20, 2024 15:03

Merge branch 'main' into kanicov-tool

b33fd5c

Reformat

06b97f0

Clippy fixes

781ba58

Clean up: unused imports and commented out code

7d7ef6c

adpaco-aws added 7 commits September 20, 2024 18:22

Merge branch 'main' into kanicov-tool

a997656

Report format

5cd0687

Introduce format for reports

4694294

Add documentation for several functions

21f2bd8

Most clippy fixes

923e606

More documentation

812231e

Documentation for main methods

745a55f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Draft] `kani-cov`: A coverage tool for Kani #3121

[Draft] `kani-cov`: A coverage tool for Kani #3121

adpaco-aws commented Apr 3, 2024

celinval left a comment

adpaco-aws commented Apr 9, 2024

adpaco-aws commented Apr 20, 2024

[Draft] kani-cov: A coverage tool for Kani #3121

Are you sure you want to change the base?

[Draft] kani-cov: A coverage tool for Kani #3121

Conversation

adpaco-aws commented Apr 3, 2024

celinval left a comment

Choose a reason for hiding this comment

adpaco-aws commented Apr 9, 2024

adpaco-aws commented Apr 20, 2024

[Draft] `kani-cov`: A coverage tool for Kani #3121

[Draft] `kani-cov`: A coverage tool for Kani #3121