Full documentation for ROCprofiler-SDK is available at Click Here
- HSA API Tracing
- Kernel Dispatch Tracing
- Kernel Dispatch Counter Collection
- Instances are reported as single dimensions
- No serialization
- HIP API Tracing
- ROCTx Tracing
- Tracing ROCProf Tool V3
- Packaging Documentation
- ROCTx start/stop
- Memory Copy Tracing
- Kernel Dispatch Counter Collection – (includes serialization and multidimensional instances)
- Kernel serialization
- Serialization on/off handling
- ROCprof Tool Plugin Interface V3 for Counters and Dimensions
- List metrics support
- Correlation-id retirement
- HIP and HSA trace distinction
- --hip-runtime-trace For Collecting HIP Runtime API Traces
- --hip-compiler-trace For Collecting HIP Compiler generated code Traces
- --hsa-core-trace For Collecting HSA API Traces (core API)
- --hsa-amd-trace For Collecting HSA API Traces (AMD-extension API)
- --hsa-image-trace For Collecting HSA API Traces (Image-extension API)
- --hsa-finalizer-trace For Collecting HSA API Traces (Finalizer-extension API)
- Page Migration Reporting (API)
- Scratch Memory Reporting (API)
- Kernel Dispatch Callback Tracing (API)
- External Correlation ID Request Service (API)
- Buffered counter collection record headers (API)
- Remove HSA dependency from counter collection (API)
- rocprofv3 Multi-GPU support in single-process (tool)
- Agent/Device Counter Collection (API)
- Single JSON output format support (tool)
- Perfetto output format support(.pftrace) (tool)
- Input YAML support for counter collection (tool)
- Input JSON support for counter collection (tool)
- Application Replay (Counter collection)
- PC Sampling (Beta)(API)
- ROCProf V3 Multi-GPU Support:
- Multi-process (multiple files)
- SQ_ACCUM_PREV and SQ_ACCUM_PREV_HIRE overwriting issue
- rocprofv3 tool now needs
--
in front of application. For detailed uses, please Click Here
- OTF2 Tool Support
- Kernel and Range Filtering
- Counter Collection Definitions in YAML
- Documentation updates (SQ Block, Counter Collection, Tracing, Tool Usage)
- Added rocprofv3 option --kernel-rename
- Added rocprofv3 options for perfetto settings (buffer size, etc.)
- Added CSV columns for kernel trace
- Thread_Id
- Dispatch_Id
- Added CSV column for counter_collection
- Miscellaneous bug fixes
- Support
--marker-trace
on application linked against old (roctracer) ROCTx (i.e.libroctx64.so
) - Replaced deprecated hipHostMalloc and hipHostFree functions with hipExtHostAlloc and hipFreeHost in when ROCm version is greater than or equal to 6.3
- Creation of subdirection when rocprofv3
--output-file
contains a folder path - Fix misaligned stores (undefined behavior) for buffer records
- Fix crash when only scratch reporting is enabled