feat: identify internal function invocations in traces #8222

klkvr · 2024-06-21T01:05:36Z

Motivation

Introduces --decode-internal flag for forge test, cast run and cast call --trace which enables decoding of internal functions in traces

Example

Example trace of random Uniswap V3 swap:

Solution

To determine when we are jumping in/out of functions we are using source map Jump key. However, it is not really reliable, especially after optimizations. Almost in all cases there are mismatches between number of "in"s and "out"s, so we need additional processing to correctly display subset of functions which are correctly reported.

Main implementation of this tracing is in DebugTraceIdentifier: https://github.com/foundry-rs/foundry/blob/216e9da8a28fcc57bcba1c6c4986aa5353472cc5/crates/evm/traces/src/debug/mod.rs

The only issue with this approach is that we are losing data about entire stack of internal functions which were joined before revert

I've used default tracer from revm-inspectors instead of traces collected by Debugger to allow easier integration into printing logic. Using it required a small patch to inspectors: paradigmxyz/revm-inspectors#150

This approach is enough to implement flamegraphs in a similar way, and can probably be extended to smarter tracking of stack/memory/calldata to also resolve input and output parameters of internal functions

Printing logic is a bit ugly at the moment

Closes: #3999 + Closes: #4351

Adds `Step` variant for `LogCallOrder` enum and renames it to `TraceMemberOrder`. This is useful for printing logic which relies on execution steps as well, e.g. foundry-rs/foundry#8222

klkvr · 2024-06-24T15:11:30Z

Added tracking of inputs and ouputs as well. It is currently not able to decode user-defined types such as structs and enums, tracking those would probably require smarter AST analysis.

Currently --decode-internal is pretty expensive in terms of memory usage because each step is being tracked for each test. We are only interested in JUMPs and JUMPDESTs, so it might make sense to add a configuration option for TracingInspector to only collect steps with specific opcodes.

mattsse

only briefly skimmed parts of it.

I think this makes sense, I'd appreciate a few more docs, and I'll take a closer look

crates/cli/src/utils/cmd.rs

mattsse · 2024-06-25T19:08:48Z

crates/evm/evm/src/inspectors/stack.rs

+ if self.tracer.is_none() && yes ||
+ !self.tracer.as_ref().map_or(false, |t| t.config().record_steps) && debug


this is a bit hard to follow,
I wonder if we can encapsulate these two bools into an enum TracingKid or smth, because debug also implies tracing, right?

- Adds `record_returndata_snapshots` flag to config which enables snapshots of `interpreter.return_data_buffer` - Adds `record_opcodes_filter` parameter which allows to only record specific opcodes. ref foundry-rs/foundry#8222 (comment) - Adds `gas_used` field for `CallTraceStep` This should be enough to migrate foundry's debugger to using `TracingInspector` from here, I will open PR for this later today.

klkvr · 2024-06-27T08:25:21Z

This is pretty much ready. Memory usage is still very high when running a complex test suite with --decode-internal, but with filters it is not more than for debugger. cast run and cast call are also working fine.

Not sure what to do with printing logic. It is not great and I think it would be better to integrate this directly into TraceWriter once we have #8224. wdyt @mattsse @DaniPopes

mattsse

cool, only have nits

this is now blocked by the tracewriter pr?
cc @zerosnacks

crates/cli/src/utils/cmd.rs

crates/evm/evm/src/executors/invariant/replay.rs

crates/evm/traces/src/debug/sources.rs

ChinW · 2024-06-28T09:41:49Z

crates/evm/traces/src/debug/mod.rs

+ match ty {
+ // For `string` and `bytes` layout is a word with length followed by the data
+ DynSolType::String | DynSolType::Bytes => {
+ let length = U256::from_be_bytes::<32>(first_word.try_into().unwrap()).to::<usize>();


will the usize cause overflow problem, usize is 64 bits (8 bytes), where the length can be 32 bytes long

i am trying this new feature, seeing below error in my end

Message: Uint conversion error: Overflow(256, 10488988339550576416, 18446744073709551615) Location: crates/evm/traces/src/debug/mod.rs:305 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ BACKTRACE ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ ⋮ 14 frames hidden ⋮ 15: core::result::Result<T,E>::expect::h1e62bb9f5f4f9d0f at /rustc/d124af4d4a707212dfe7c081/library/core/src/result.rs:1034 16: ruint::from::<impl ruint::Uint<_,_>>::to::h7b50fa5729cd9809 at /Users/x/.cargo/registry/src/index.crates.io-6f17d22bba15001f/ruint-1.12.3/src/from.rs:236 234 │ T: Debug, 235 │ { 236 > self.uint_try_to().expect("Uint conversion error") 237 │ } 238 │ 17: foundry_evm_traces::debug::decode_from_memory::hb99e9e33dfb84c71 at /Users/x/foundry/crates/evm/traces/src/debug/mod.rs:305 303 │ // For `string` and `bytes` layout is a word with length followed by the data 304 │ DynSolType::String | DynSolType::Bytes => { 305 > let length = U256::from_be_bytes::<32>(first_word.try_into().unwrap()).to::<usize>(); 306 │ let data = memory.get(location + 32..location + 32 + length)?; 307 │ 18: foundry_evm_traces::debug::try_decode_args_from_step::{{closure}}::{{closure}}::h0912b74d67673459 at /Users/x/foundry/crates/evm/traces/src/debug/mod.rs:283 281 │ (DynSolType::Uint(8), Some(Storage::Memory | Storage::Storage)) => None,

hi @ChinW thanks for trying this out and reporting!

I think it's safe to assume that memory wouldn't be expanded for more than usize, though this analysis is not perfect and we might expect to find the length while there's actually some other value (i.e. number > usize::max), which I believe has happened in your case

any chance you could provide a repro for this? also, are you using via-ir here?

Philogy · 2024-06-29T18:35:57Z

Been waiting for this feature forever!

What do reverts look like? Is it able to give you the line you reverted on?

What about solidity native functions like abi.encode, require? I think it'd be really useful if the internal calls were shown with a line number

klkvr · 2024-06-29T23:03:54Z

What do reverts look like? Is it able to give you the line you reverted on?

Current approach relies on solc source map keys of jump type. For JUMPs we are sometimes provided with info on whether this is a jump in or out of the function, and by reading source code you can determine the name, location, input and output types of the function.

However, with optimizations those source maps are getting messed up and you are getting a lot of mismatched ins and outs.

Currently I've been mostly focusing on correctness of the identification, thus we currently only identify a match if we see an explicit JUMP in and JUMP out of the same function. This currently doesn't really work for REVERTs and RETURNs done in low-level assembly, because there are no JUMPs out, just a frame execution end.

We're still ending up with a stack of potentially correct internal fns in those cases, but when I tested this for some random cases this stack usually contained some invalid data which I wouldn't want to display.

So currently there is definitely space for improvement of identification, likely through more "guessing"-approach relying on multiple factors and guided by AST, source maps and bytecode analysis.

What about solidity native functions like abi.encode, require? I think it'd be really useful if the internal calls were shown with a line number

While I was reading source maps I've seen that solc marks JUMPs into abi.encode/abi.decode as jumps in, so it should be possible to identify those in the future. Though this is mostly suited for user-defined internal fns at the moment.

require is not treated by solc as a function, though REVERT instruction source mapping is usually pointing to the require source code, so those should be possible to identify as well (we are already doing this in coverage iirc)

IMO all of this is basically a better/more readable UX for the debugger, which can already be used to check the exact line of code where the revert occured

ref foundry-rs/foundry#8222 ref foundry-rs/foundry#8198 Adds structs and extends `TraceWriter` to support formatting of decoded trace steps. Currently two decoding formats are supported: - Internal calls. Similar to a decoded call trace, decoded internal function invocation which spans over multiple steps. Kept as decoded function name, inputs, outputs and index of the last step. - Arbitrary strings. This might be useful for formatting decoded opcodes (e.g. adding `├─ [sload] <slot>` to trace. It might make sense to extend it to something more configurable once we start implementing this

mattsse

lgtm

pending @DaniPopes

crates/forge/src/multi_runner.rs

crates/evm/traces/src/debug/sources.rs

crates/evm/evm/src/executors/trace.rs

crates/evm/traces/src/debug/sources.rs

klkvr · 2024-07-09T14:29:41Z

Updated --decode-internal flag to accept a regex similar to --debug. On large suites --decode-internal easily results in OOM, so I think it's better to restrict its usage in such way

DaniPopes · 2024-07-09T16:12:37Z

Yeah same problem as in the debugger caused by memory snapshots

DaniPopes · 2024-07-09T16:17:37Z

Maybe we can disable memory decoding by default to avoid the memory consumption issue?

klkvr · 2024-07-09T17:10:32Z

Maybe we can disable memory decoding by default to avoid the memory consumption issue?

Yeah, I though about disabling memory tracking if more than one test matched filters. Though not sure how to make this intuitive

Should it be two separate flags, one of which does not require the test function filter?

fix: small debugger updates

0460633

klkvr mentioned this pull request Jun 21, 2024

feat: Add Step to LogCallOrder paradigmxyz/revm-inspectors#150

Merged

[wip] feat: identify internal function invocations in traces

9fd779a

klkvr force-pushed the klkvr/internal-fns-in-traces branch from 3b2b1fe to 9fd779a Compare June 21, 2024 01:24

klkvr added 3 commits June 21, 2024 04:36

fmt

83c7a23

doc

b1a365f

correctly enable tracing

2d17d37

klkvr added 6 commits June 22, 2024 23:29

correctly enable tracing

4728d2e

collect contract definition locs

5ed1abf

feat: print traces in format of Contract::function

6518cb9

Merge branch 'master' into klkvr/internal-fns-in-traces

a038e05

wip

06dc30a

refactor

216e9da

klkvr force-pushed the klkvr/internal-fns-in-traces branch from 5f643a4 to 216e9da Compare June 23, 2024 05:17

clippy

d92f436

klkvr mentioned this pull request Jun 23, 2024

Support for flamegraph #7761

Open

klkvr added 3 commits June 23, 2024 08:28

fix doc

7fa698b

track input/output values

b3ef110

Merge branch 'master' into klkvr/internal-fns-in-traces

3d59b3f

clippy

5972083

klkvr mentioned this pull request Jun 24, 2024

feat: small updates for steps tracing paradigmxyz/revm-inspectors#152

Merged

zerosnacks mentioned this pull request Jun 25, 2024

Support for internal function jump trace #4351

Open

mattsse requested changes Jun 25, 2024

View reviewed changes

zerosnacks mentioned this pull request Jun 26, 2024

Best in class Gas Reporting #1795

Closed

klkvr added 3 commits June 26, 2024 12:36

clean up

e9e97a0

Merge branch 'master' into klkvr/internal-fns-in-traces

44e976d

TraceMode

08fd6c5

klkvr marked this pull request as ready for review June 27, 2024 08:13

klkvr requested review from DaniPopes and Evalir as code owners June 27, 2024 08:13

klkvr changed the title ~~[wip] feat: identify internal function invocations in traces~~ feat: identify internal function invocations in traces Jun 27, 2024

mattsse reviewed Jun 27, 2024

View reviewed changes

crates/cli/src/utils/cmd.rs Show resolved Hide resolved

crates/evm/evm/src/executors/invariant/replay.rs Outdated Show resolved Hide resolved

crates/evm/traces/src/debug/sources.rs Show resolved Hide resolved

ChinW reviewed Jun 28, 2024

View reviewed changes

klkvr added 4 commits June 28, 2024 15:47

safer decofing from stack and memory

7d362f7

use Into<Option<TraceMode>>

4d44529

TraceMode::None

ecb5f13

fmt

6503571

Merge branch 'master' into klkvr/internal-fns-in-traces

7976e27

Merge branch 'master' into klkvr/internal-fns-in-traces

3a01a97

zemse mentioned this pull request Jun 30, 2024

[WIP] Support for Flamegraph #8315

Draft

klkvr mentioned this pull request Jul 1, 2024

feat: add decoding for individual trace steps paradigmxyz/revm-inspectors#157

Merged

Merge branch 'master' into klkvr/internal-fns-in-traces

0031d77

mattsse approved these changes Jul 9, 2024

View reviewed changes

Merge branch 'master' into klkvr/internal-fns-in-traces

e9ec97e

DaniPopes reviewed Jul 9, 2024

View reviewed changes

klkvr added 2 commits July 9, 2024 17:00

review fixes

ec783d6

--decode-internal for single fn

ff2a11b

klkvr added 2 commits July 10, 2024 00:17

use Vec

e46c017

TraceMode builder

062d550

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: identify internal function invocations in traces #8222

feat: identify internal function invocations in traces #8222

klkvr commented Jun 21, 2024 •

edited by zerosnacks

Loading

klkvr commented Jun 24, 2024 •

edited

Loading

mattsse left a comment

mattsse Jun 25, 2024

klkvr commented Jun 27, 2024

mattsse left a comment

ChinW Jun 28, 2024

ChinW Jun 28, 2024

klkvr Jun 28, 2024 •

edited

Loading

Philogy commented Jun 29, 2024

klkvr commented Jun 29, 2024 •

edited

Loading

mattsse left a comment

klkvr commented Jul 9, 2024 •

edited

Loading

DaniPopes commented Jul 9, 2024

DaniPopes commented Jul 9, 2024

klkvr commented Jul 9, 2024

		if self.tracer.is_none() && yes \|\|
		!self.tracer.as_ref().map_or(false, \|t\| t.config().record_steps) && debug

feat: identify internal function invocations in traces #8222

Are you sure you want to change the base?

feat: identify internal function invocations in traces #8222

Conversation

klkvr commented Jun 21, 2024 • edited by zerosnacks Loading

Motivation

Example

Solution

klkvr commented Jun 24, 2024 • edited Loading

mattsse left a comment

Choose a reason for hiding this comment

mattsse Jun 25, 2024

Choose a reason for hiding this comment

klkvr commented Jun 27, 2024

mattsse left a comment

Choose a reason for hiding this comment

ChinW Jun 28, 2024

Choose a reason for hiding this comment

ChinW Jun 28, 2024

Choose a reason for hiding this comment

klkvr Jun 28, 2024 • edited Loading

Choose a reason for hiding this comment

Philogy commented Jun 29, 2024

klkvr commented Jun 29, 2024 • edited Loading

mattsse left a comment

Choose a reason for hiding this comment

klkvr commented Jul 9, 2024 • edited Loading

DaniPopes commented Jul 9, 2024

DaniPopes commented Jul 9, 2024

klkvr commented Jul 9, 2024

klkvr commented Jun 21, 2024 •

edited by zerosnacks

Loading

klkvr commented Jun 24, 2024 •

edited

Loading

klkvr Jun 28, 2024 •

edited

Loading

klkvr commented Jun 29, 2024 •

edited

Loading

klkvr commented Jul 9, 2024 •

edited

Loading