Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SYCL][Graph] Node Profiling #353

Closed
wants to merge 1,803 commits into from
Closed
Show file tree
Hide file tree
Changes from 7 commits
Commits
Show all changes
1803 commits
Select commit Hold shift + click to select a range
292b508
[AMDGPU] Use StringRef::consume_front (NFC)
kazutakahirata Jan 31, 2024
8a98091
[RISCV] Use disjoint flag in or_is_add.
topperc Jan 31, 2024
f2816ff
[InstCombine] Simplify and/or by replacing operands with constants (#…
dtcxzyw Jan 31, 2024
5bb99ed
[clang][Interp] Add inline descriptor to global variables (#72892)
tbaederr Jan 31, 2024
6f35f1d
[NFC] Update .git-blame-ignore-revs for compiler-rt builtins (#79803)
piggynl Jan 31, 2024
9594746
[NFC] Add compiler-rt:* to .github/new-prs-labeler.yml (#79872)
piggynl Jan 31, 2024
c83ec84
[clang][dataflow] Extend debug output for `Environment`. (#79982)
martinboehme Jan 31, 2024
3564666
[AMDGPU]: Fix type signatures for wmma intrinsics, NFC (#80087)
changpeng Jan 31, 2024
ee01a2c
[clang] static operators should evaluate object argument (reland) (#8…
SuperSodaSea Jan 31, 2024
82324bc
[clang][dataflow] In the CFG visualization, mark converged blocks. (#…
martinboehme Jan 31, 2024
b49b3dd
[ADT] Use a constexpr version of llvm::bit_ceil (NFC) (#79709)
kazutakahirata Jan 31, 2024
a3e2315
[SYCL][Fusion] Enable fusion of rounded-range kernels (#12492)
jopperm Jan 31, 2024
f292f90
[InstCombine] Fold select with signbit idiom into fabs (#76342)
dtcxzyw Jan 31, 2024
565490d
[SYCL] [NATIVECPU] Update OneAPI Construction Kit tag (#12543)
PietroGhg Jan 31, 2024
6ec040e
[SYCL][Joint matrix tests] Fix test execution env setting for two tes…
dkhaldi Jan 31, 2024
d71831a
[NFC] [clang-repl] Fix test failures due to incosistent target settings
ChuanqiXu9 Jan 31, 2024
7d492f8
[SYCL][Fusion] Improve error messages on incompatible ND-ranges (#12524)
victor-eds Jan 31, 2024
dc5dca1
[RISCV][Isel] Remove redundant vmerge for the scalable vwadd(u).wv (#…
sun-jacobi Jan 31, 2024
db49319
[mlir][memref] `memref.subview`: Verify result strides (#79865)
matthias-springer Jan 31, 2024
f852503
[CodeGen] Don't include aliases in RegisterClassInfo::IgnoreCSRForAll…
jayfoad Jan 31, 2024
96c907d
Revert "[mlir][memref] `memref.subview`: Verify result strides" (#80116)
matthias-springer Jan 31, 2024
2f20e37
Turn on LLVM_USE_SPLIT_DWARF by default for Linux Debug build (#12527)
jsji Jan 31, 2024
04ff5b8
[SYCL] Ensure that RTDeviceBinaryImage instances have a unique image …
uditagarwal97 Jan 31, 2024
64a849a
[clang][Interp] Support arbitrary precision constants (#79747)
tbaederr Jan 31, 2024
d439f36
Add support of param type for transform.structured.tile_using_forall …
jinchen62 Jan 31, 2024
dd73666
[SME] Stop RA from coalescing COPY instructions that transcend beyond…
sdesmalen-arm Jan 31, 2024
89f87c3
[RISCV][MC] Add MC layer support for the experimental zabha extension…
dtcxzyw Jan 31, 2024
488f88b
[mlir][transform] Add elementwise criteria to `match.structured.body`…
srcarroll Jan 31, 2024
95ef8e3
[mlir][ArmSME] Support 2-way widening outer products (#78975)
c-rhodes Jan 31, 2024
88610b7
[mlir][vector] Disable transpose -> shuffle lowering for scalable vec…
MacDue Jan 31, 2024
ce7cc72
[mlir][memref] `memref.subview`: Verify result strides
matthias-springer Jan 31, 2024
e121c88
[SYCL][COMPAT] Force device function to be inlined (#12550)
Naghasan Jan 31, 2024
9bf5d5c
Merge from 'main' to 'sycl-web' (46 commits)
KseniyaTikhomirova Jan 31, 2024
db1fbd6
[BDCE] Add tests for #80113 (NFC)
nikic Jan 31, 2024
44ba4c7
[GitHub][workflows] Add buildbot information comment to first merged …
DavidSpickett Jan 31, 2024
24a8041
[llvm][Support] Support bright colors in raw_ostream (#80017)
tbaederr Jan 31, 2024
b210cbb
[BDCE] Fix clearing of poison-generating flags
nikic Jan 31, 2024
da784a2
[mlir][IR] Add `RewriterBase::moveBlockBefore` and fix bug in `moveOp…
matthias-springer Jan 31, 2024
942cc9a
Revert "[CodeGen] Don't include aliases in RegisterClassInfo::IgnoreC…
jayfoad Jan 31, 2024
50e80e0
[ValueTracking] Merge `cannotBeOrderedLessThanZeroImpl` into `compute…
dtcxzyw Jan 31, 2024
c2c650f
[AMDGPU] Stop combining arbitrary offsets into PAL relocs (#80034)
jayfoad Jan 31, 2024
0217d2e
[clang][AMDGPU] Remove trialing whitespace in doc
DavidSpickett Jan 31, 2024
b897152
[SYCL][Bindless] Unique sampler addressing modes per dimension (#12109)
Seanst98 Jan 31, 2024
fbbc822
Moves tests to Explicit directory
mfrancepillois Jan 31, 2024
7ff2327
Merge branch 'sycl' into maxime/profiling_v2
mfrancepillois Jan 31, 2024
78e0cca
[mlir] Fix debug output for passes that modify top-level operation. (…
Jezurko Jan 31, 2024
e624648
[mlir][EmitC] Add `verbatim` op (#79584)
simon-camp Jan 31, 2024
5a07774
[SPIR-V] Improve how lowering of formal arguments in SPIR-V Backend i…
VyacheslavLevytskyy Jan 31, 2024
53b9d47
[X86] i256-add - replace i386 triple X32 check prefixes with X86 and …
RKSimon Jan 30, 2024
8d450b4
[X86] mmx-arith.ll - replace X32 check prefixes with X86 + strip cfi …
RKSimon Jan 30, 2024
00a6817
[X86] v4f32-immediate.ll - replace X32 check prefixes with X86
RKSimon Jan 31, 2024
929503e
[X86] v2f32.ll - replace X32 check prefixes with X86 (and add common …
RKSimon Jan 31, 2024
3f5fcb5
Fix MSVC "not all control paths return a value" warning. NFC.
RKSimon Jan 31, 2024
b437014
[OpenMPIRBuilder] Do not call host runtime for GPU teams codegen (#79…
DominikAdamski Jan 31, 2024
cb6240d
[BDCE] Also drop poison-generating metadata
nikic Jan 31, 2024
5cc87b4
[AsmParser] Add missing globals declarations in incomplete IR mode (#…
nikic Jan 31, 2024
a74e9ce
[OpenMP] atomic compare weak : Parser & AST support (#79475)
SunilKuravinakop Jan 31, 2024
3abf55a
[AArch64][SME] Fix inlining bug introduced in #78703 (#79994)
sdesmalen-arm Jan 31, 2024
d309261
[llvm][InstCombine] bitcast bfloat half castpair bug (#79832)
nasherm Jan 31, 2024
d55d72e
[llvm-rc] Support ARM64EC resource generation (#78908)
bylaws Jan 31, 2024
d74619a
[clang] [MinGW] Handle linking ARM64EC code (#78912)
bylaws Jan 31, 2024
042800a
[mlir][ArmSME] Add initial SME vector legalization pass (#79152)
MacDue Jan 31, 2024
912cdd2
[DAG] AddNodeIDCustom - call ShuffleVectorSDNode::getMask once instea…
RKSimon Jan 31, 2024
a82ca1c
[X86] insertps-from-constantpool.ll - replace X32 check prefixes with…
RKSimon Jan 31, 2024
e4af212
[X86] divrem.ll - replace X32 check prefixes with X86
RKSimon Jan 31, 2024
ed11f25
[X86] divide-by-constant.ll - replace X32 check prefixes with X86
RKSimon Jan 31, 2024
824d073
[X86] fold-vector-sext - replace X32 check prefixes with X86
RKSimon Jan 31, 2024
1d8c8f1
[X86] cfguard - replace X32 check prefixes with X86
RKSimon Jan 31, 2024
648eb7c
[X86] divrem8_ext.ll - replace X32 check prefixes with X86
RKSimon Jan 31, 2024
b8f9c8b
[SYCL][Fusion] Silence warning (#12555)
jopperm Jan 31, 2024
cf828ae
[AArch64] Convert concat(uhadd(a,b), uhadd(c,d)) to uhadd(concat(a,c)…
Rin18 Jan 31, 2024
e3c9327
[X86][CodeGen] Set isReMaterializable = 1 for AVX broadcast load
KanRobert Jan 31, 2024
f96e85b
[AMDGPU][GFX12] Add tests for unsupported builtins (#78729)
mariusz-sikora-at-amd Jan 31, 2024
d9e875d
[X86][MC] Support encoding/decoding for APX variant LZCNT/TZCNT/POPCN…
XinWang10 Jan 31, 2024
817d0cb
[InstCombine] Simplify commutative compares of symmetric pairs (#80134)
dtcxzyw Jan 31, 2024
9536a62
[VPlan] Preserve original induction order when creating scalar steps.
fhahn Jan 31, 2024
ab87426
Fix after #79152
JoelWee Jan 31, 2024
c672b34
[mlir][IR] Send missing notifications when inlining a block (#79593)
matthias-springer Jan 31, 2024
7e45cfd
[mlir] Fix ab874268f636bdfc83b567429ca2d2483f7cc831
JoelWee Jan 31, 2024
121a0ef
[mlir][EmitC] Remove unused attribute from verbatim op (#80142)
simon-camp Jan 31, 2024
cec24f0
[VPlan] Update stale test after 9536a6286, fix formatting.
fhahn Jan 31, 2024
c2675ba
[mlir][IR] Send missing notification when splitting a block (#79597)
matthias-springer Jan 31, 2024
de75e50
[ARM][NEON] Add constraint to vld2 Odd/Even Pseudo instructions. (#79…
AlfieRichardsArm Jan 31, 2024
e538486
[Driver] Fix erroneous warning for -fcx-limited-range and -fcx-fortra…
zahiraam Jan 31, 2024
4f32f5d
[AA][JumpThreading] Don't use DomTree for AA in JumpThreading (#79294)
nikic Jan 31, 2024
74bf0b1
[mlir] Lower math dialect later in gpu-lower-to-nvvm-pipeline (#78556)
grypp Jan 31, 2024
9bf4e54
[clang] Represent array refs as `TemplateArgument::Declaration` (#80050)
bolshakov-a Jan 31, 2024
b7738e2
[MIRPrinter] Don't print space when there is no successor (#80143)
DianQK Jan 31, 2024
70fb96a
Revert "[mlir][complex] Prevent underflow in complex.abs (#79786)"
d0k Jan 31, 2024
470e378
[SYCL][Fusion] Handle GEPs that were canonicalized to byte offsets (#…
jopperm Jan 31, 2024
4679132
[flang] Lower ASYNCHRONOUS variables and IO statements (#80008)
jeanPerier Jan 31, 2024
47df391
[clang][Interp] Handle std::move etc. builtins (#70772)
tbaederr Jan 31, 2024
6e6aa44
Revert "[Clang][Sema] fix outline member function template with defau…
erichkeane Jan 31, 2024
31fc0a1
[mlir][mesh] Refactoring code organization, tests and docs (#79606)
sogartar Jan 31, 2024
1bab570
Move the PowerPC/PPCMergeStringPool work to initializer (#77352)
scui-ibm Jan 31, 2024
c651b2b
Fix: CMake Error at cmake/modules/LLVMExternalProjectUtils.cmake:86 (…
hiraditya Jan 31, 2024
8d1b1c9
[AST] Add dump() method to TypeLoc (#65484)
sam-mccall Jan 31, 2024
8841846
[AArch64] MI Scheduler LDP combine follow up (#79003)
sjoerdmeijer Jan 31, 2024
e33dc6b
Add a release note for TypeLoc::dump() support; NFC
AaronBallman Jan 31, 2024
5d7d89d
[AArch64] Use add_and_or_is_add for CSINC (#79552)
davemgreen Jan 31, 2024
32c0048
[clang][Interp] Handle casts between complex types (#79269)
tbaederr Jan 31, 2024
dfd5a64
[clang][Interp] Remove wrong * operator
tbaederr Jan 31, 2024
f2df4bf
[AsmParser] Support non-consecutive global value numbers (#80013)
nikic Jan 31, 2024
0cd8348
[gn build] Port 8d1b1c9b97de
llvmgnsyncbot Jan 31, 2024
5c2da28
[clang][dataflow] fix assert in `Environment::getResultObjectLocation…
paulsemel Jan 31, 2024
bd8bec2
[Flang] Support NULL(procptr): null intrinsic that has procedure poin…
DanielCChen Jan 31, 2024
e34fd2e
[CodeGen] Simplify RegisterClassInfo BitVector comparisons. NFC.
jayfoad Jan 31, 2024
baf1b19
[CodeGen] Use regunits instead of MCRegUnitIterator in RegisterClassI…
jayfoad Jan 31, 2024
98dbc68
Revert "[mlir] Lower math dialect later in gpu-lower-to-nvvm-pipeline…
d0k Jan 31, 2024
2907c63
Revert "[AArch64] Convert concat(uhadd(a,b), uhadd(c,d)) to uhadd(con…
Rin18 Jan 31, 2024
6720e3a
[bazel] Port 31fc0a12e1552e6bcea63ae740f284eaf74f4c17
d0k Jan 31, 2024
d04ae1b
[AArch64] Use DAG->isAddLike in add_and_or_is_add (#79563)
davemgreen Jan 31, 2024
b929be2
[Clang][test] Add fPIC when building shared library (#80065)
jsji Jan 31, 2024
16c4843
[bazel] Add missing header file for 31fc0a12e1552e6bcea63ae740f284eaf…
d0k Jan 31, 2024
8241106
[Exegesis] Print epsilon value in the sched model inconsistency repor…
mshockwave Jan 31, 2024
08c0eb1
[lldb][DataFormatter][NFC] Use GetFirstValueOfLibCXXCompressedPair th…
Michael137 Jan 31, 2024
57c66b3
[lldb] Add support for large watchpoints in lldb (#79962)
jasonmolenda Jan 31, 2024
35a0089
[gn build] Port 57c66b35a885
llvmgnsyncbot Jan 31, 2024
2542876
[Libomptarget] Remove handling of old ctor / dtor entries (#80153)
jhuber6 Jan 31, 2024
372f7dd
[libc++abi] Add temporary workaround to unblock Chrome
ldionne Jan 31, 2024
dad50fe
Add extra printing to TestWatchpointCount.py to debug CI fail
jasonmolenda Jan 31, 2024
151559c
[DirectX][docs] Architecture and design philosophy of DXIL support
bogner Jan 31, 2024
0bf2e66
[SYCL][ESIMD] Implement unified memory API for scatter(usm, ...) (#12…
sarnex Jan 31, 2024
775c285
[lld] enable fixup chains by default (#79894)
rmaz Jan 31, 2024
cf2533e
Collecting more logging to debug CI bots
jasonmolenda Jan 31, 2024
09fc333
[NFC] Fold an `if` statement into `return` of bool expression
shiltian Jan 31, 2024
d6e1ae2
Add logging to WatchpointAlgorithm
jasonmolenda Jan 31, 2024
16a368c
[CI][NFC] Unify naming scheme for SYCL workflows. (#12525)
bader Jan 31, 2024
fa42589
Enable verbose watch log channel to debug x86-64-debian bot
jasonmolenda Jan 31, 2024
1b5daa8
Revert "[CI][NFC] Unify naming scheme for SYCL workflows." (#12567)
bader Jan 31, 2024
7348207
[SYCL][ESIMD][E2E] Disable two LSC tests on DG2 (#12565)
sarnex Jan 31, 2024
4cb13f2
[clang-tidy] Remove cert-dcl21-cpp check (#80181)
carlosgalvezp Jan 31, 2024
51e0d1b
[lldb][progress][NFC] Add unit test for progress reports (#79533)
chelcassanova Jan 31, 2024
c84f2ba
Add one more verbose watchpoint logging for arm-ubuntu
jasonmolenda Jan 31, 2024
dc15524
[flang] DEALLOCATE(pointer) should use PointerDeallocate() (#79702)
klausler Jan 31, 2024
dbf547f
[flang][runtime] Add limit check to MOD/MODULO (#80026)
klausler Jan 31, 2024
6086007
[flang][preprocessor] Replace macros in some #include directives (#80…
klausler Jan 31, 2024
2ba94bf
[flang] Downgrade a too-strong error message to a warning (#80095)
klausler Jan 31, 2024
209fe1f
Revert "[lldb][progress][NFC] Add unit test for progress reports (#79…
chelcassanova Jan 31, 2024
b478d2f
[SYCL] Fix resource leak related to SYCL_FALLBACK_ASSERT (#12532)
aelovikov-intel Jan 31, 2024
99852c0
Fix conflict resolution fa36da7bfefb
jsji Jan 31, 2024
9d41fba
Revert "Add one more verbose watchpoint logging for arm-ubuntu"
jasonmolenda Jan 31, 2024
19f429a
Revert "Enable verbose watch log channel to debug x86-64-debian bot"
jasonmolenda Jan 31, 2024
e95250c
Revert "Add logging to WatchpointAlgorithm"
jasonmolenda Jan 31, 2024
46643e0
Revert "Collecting more logging to debug CI bots"
jasonmolenda Jan 31, 2024
cc4af03
Revert "Add extra printing to TestWatchpointCount.py to debug CI fail"
jasonmolenda Jan 31, 2024
d347c56
Revert "[lldb] Add support for large watchpoints in lldb (#79962)"
jasonmolenda Jan 31, 2024
85e461e
[SYCL][E2E] Disable USM/usm_pooling.cpp on gpu-intel-dg2 (#12564)
aelovikov-intel Jan 31, 2024
742f88e
[gn build] Port d347c564299e
llvmgnsyncbot Jan 31, 2024
c003d85
[clang][DependencyScanner] Remove unused -fmodule-map-file arguments …
Bigcheese Jan 31, 2024
5282202
[LSR] Add a test case mentioned in review
preames Jan 31, 2024
9107904
[Github] Build PGO optimized toolchain in container (#80096)
boomanaiden154 Jan 31, 2024
ebe8733
[ORC] Merge MaterializationResponsibility notifyEmitted and addDepend…
lhames Jan 31, 2024
22773e5
[libc] Fix condition ordering in scanf (#80083)
michaelrj-google Jan 31, 2024
a03a6e9
[AIX] [XCOFF] Add support for common and local common symbols in the …
syzaara Jan 31, 2024
56e241a
[analyzer] Unbreak [[clang::suppress]] on checkers without decl-with-…
haoNoQ Jan 31, 2024
1d14323
[AArch64][SVE2] Generate urshr rounding shift rights (#78374)
UsmanNadeem Jan 31, 2024
4eee045
AMDGPU/NFC: Add predicate for supporting buffer/flat/global f64 atomi…
kzhuravl Jan 31, 2024
0f728a0
[flang][hlfir][NFC] Fix typo
clementval Jan 31, 2024
5e3ae4c
[NVPTX] improve Boolean ISel (#80166)
AlexMaclean Jan 31, 2024
ff53d50
[RISCV] Improve legalization of e8 m8 VL>256 shuffles (#79330)
preames Jan 31, 2024
db68e92
[lldb][NFCI] Remove m_being_created from Breakpoint classes (#79716)
bulbazord Jan 31, 2024
f264da4
[lsr][term-fold] Restrict transform to low cost expansions (#74747)
preames Jan 31, 2024
7c2e32d
Partial revert "[HIP] Fix -mllvm option for device lld linker" (#80202)
yxsamliu Jan 31, 2024
435845b
[CI][NFC] Unify naming scheme for SYCL workflows. (#12568)
bader Jan 31, 2024
a5a8cbb
Reland "[lldb][progress][NFC] Add unit test for progress reports (#79…
chelcassanova Jan 31, 2024
5561bea
[WebAssembly] avoid to enable explicit disabled feature (#80094)
HerrCai0907 Jan 31, 2024
40ebe52
Revert "Reland "[lldb][progress][NFC] Add unit test for progress repo…
chelcassanova Jan 31, 2024
cf401f7
[RISCV] Use Zacas for AtomicRMWInst::Nand i32 and XLen. (#80119)
topperc Jan 31, 2024
0e0d155
[libc][docs] fix stdbit.h docs (#80070)
nickdesaulniers Jan 31, 2024
0e8eb44
[libc] Fix read under msan (#80203)
michaelrj-google Feb 1, 2024
8ba018d
[mlir][Vector] Add support for sub-byte transpose emulation (#80110)
dcaballe Feb 1, 2024
730f498
[mlir][arith] Improve `truncf` folding (#80206)
kuhar Feb 1, 2024
f8be7f2
[llvm-objcopy][test] Use llvm-readelf instead for clearer visualizati…
kongy Feb 1, 2024
a8279a8
[clang][NFC] Move isSimpleTypeSpecifier() from Sema to Token (#80101)
owenca Feb 1, 2024
908fd09
[clang-format] Simplify the AfterPlacementOperator option (#79796)
owenca Feb 1, 2024
994493c
[clang][dataflow][NFC] Rename a confusingly named variable. (#80182)
martinboehme Feb 1, 2024
0c36127
[clang][dataflow] Display line numbers in the HTML logger timeline. (…
martinboehme Feb 1, 2024
147d7a6
[lldb] Add support for large watchpoints in lldb (#79962)
jasonmolenda Feb 1, 2024
19a10c1
[gn build] Port 147d7a64f849
llvmgnsyncbot Feb 1, 2024
995d21b
[SelectOpt] Print instruction instead of pointer
wangpc-pp Jan 31, 2024
5a8f290
[llvm-gsymutil] Print one-time DWO file missing warning under --quiet…
kusmour Feb 1, 2024
3b76b86
Add debug prints to diagnose a crash on arm-ubuntu bot
jasonmolenda Feb 1, 2024
c82a645
[X86][NFC] Simplify the code for memory fold
KanRobert Feb 1, 2024
8eea582
[C++20] [Modules] Introduce -fskip-odr-check-in-gmf (#79959)
ChuanqiXu9 Feb 1, 2024
b777bb7
[clang-tidy] Add AllowStringArrays option to modernize-avoid-c-arrays…
PiotrZSL Feb 1, 2024
9b68c09
[clang-format] Allow decltype in requires clause (#78847)
rymiel Feb 1, 2024
fdd98e5
Skip 2 of the three test sets to narrow down the arm-ubuntu
jasonmolenda Feb 1, 2024
a8f317a
[clang][Interp] complex binary operators aren't always initializing
tbaederr Jan 31, 2024
5d9ffcd
[Github] Build stage2-clang-bolt target for CI container
boomanaiden154 Feb 1, 2024
6ff431b
[clang][Interp] Handle imaginary literals (#79130)
tbaederr Feb 1, 2024
1395e58
[X86][CodeGen] Set mayLoad = 1 for LZCNT/POPCNT/TZCNTrm_(EVEX|NF)
KanRobert Feb 1, 2024
021a2b4
Uncomment the 2GB max tests and see if that works on arm-ubuntu
jasonmolenda Feb 1, 2024
a9e8309
[clang][Interp] Protect Inc/Dec ops against dummy pointers
tbaederr Jan 31, 2024
fa98e28
[clang][Interp][NFC] Remove unused RecordScope
tbaederr Feb 1, 2024
48f8b74
[clang][Interp] Support GenericSelectionExprs
tbaederr Jan 31, 2024
54f324f
Trying to refine which test is crashing on arm-ubuntu.
jasonmolenda Feb 1, 2024
65066c0
[mlir] Use `create` instead of `createOrFold` for ConstantOp as foldi…
nujaa Feb 1, 2024
7ec996d
[GlobalISel][TableGen] Support Intrinsics in MIR Patterns (#79278)
Pierre-vh Feb 1, 2024
e851278
[IR] Use range-based for loops (NFC)
kazutakahirata Feb 1, 2024
39fa304
[llvm] Use StringRef::starts_with (NFC)
kazutakahirata Feb 1, 2024
b67ce7e
[clang] Use StringRef::starts_with (NFC)
kazutakahirata Feb 1, 2024
eaa3d5e
Done iterating with arm-ubuntu bot, I see the problem test.
jasonmolenda Feb 1, 2024
90e6808
Skip two WatchpointAlgorithm tests for 32-bit lldb's
jasonmolenda Feb 1, 2024
5fdf8c6
[mlir][Transforms] `GreedyPatternRewriteDriver`: Hash ops separately …
matthias-springer Feb 1, 2024
84564e1
[flang][NFC] Cache derived type translation in lowering (#80179)
jeanPerier Feb 1, 2024
ae931b4
[Clang][test] Limit library search when linking shared lib (#80253)
apolloww Feb 1, 2024
e7d40a8
[mlir][EmitC] Add func, call and return operations and conversions (…
marbre Feb 1, 2024
d0dbd50
[bazel] Add missing header for 7ec996d4c5c30083b070be4898140440094e6b97
d0k Feb 1, 2024
468b239
[bazel] Merge TableGenGlobalISel into the tablegen target
d0k Feb 1, 2024
395c817
[bazel] Put back the pieces of TableGenGlobalISel that unittests depe…
d0k Feb 1, 2024
415bf20
[llvm-exegesis] Replace --num-repetitions with --min-instructions (#7…
boomanaiden154 Feb 1, 2024
ca7fd25
[bazel] Fix a typo from e7d40a87ff230528131541f6ac17a2e1a7dc78e1
d0k Feb 1, 2024
e9e0167
[flang][HLFIR] Relax verifiers of intrinsic operations (#80132)
tblah Feb 1, 2024
1bbb797
[Clang][AArch64] Add ACLE macros for FEAT_PAuth_LR (#80163)
pratlucas Feb 1, 2024
f34418c
[HWASAN] Remove DW_OP_LLVM_tag_offset from DIExpression::isImplicit (…
OCHyams Feb 1, 2024
96a3d05
[GitHub][workflows] Reflow some text in buildbot info PR comment
DavidSpickett Feb 1, 2024
b5c0b67
[AMDGPU] Check wavefrontsize for GFX11 WMMA builtins (#79980)
jayfoad Feb 1, 2024
da43733
[SCEVExp] Keep NUW/NSW if both original inc and isomporphic inc agree…
fhahn Feb 1, 2024
7d78ccf
[libc++][memory] P2652R2: Disallow Specialization of `allocator_trait…
H-G-Hristov Feb 1, 2024
ea29842
[ARM] Add ctpop codegen tests
RKSimon Feb 1, 2024
af448b0
[SYCL][Fusion] Handle fusion leading to synchronization issues (#12538)
victor-eds Feb 1, 2024
c105848
[IndVars] Add tests for #79861 (NFC)
nikic Feb 1, 2024
e402523
[UR][CUDA] Use new variant of the enableCUDATracing function (#12521)
pasaulais Feb 1, 2024
f9e4f10
[SYCL][CUDA] Improved joint_matrix layout test coverage. (#12483)
JackAKirk Feb 1, 2024
178719e
[RISCV][NFC] Simplify calls.ll and autogenerate checks for tail-calls.ll
wangpc-pp Feb 1, 2024
4bdd647
Merge commit '178719e86043c1e830a7e24de027a84f6f8ea28f' into llvmspir…
sys-ce-bb Feb 1, 2024
b0c60b0
Update LLVM version from 18 to 19 (#2315)
svenvh Jan 29, 2024
6f35f7c
add support for out of bounds load/store (#2277)
VyacheslavLevytskyy Jan 29, 2024
f0ac661
add API to query error message by an error code (#2304)
VyacheslavLevytskyy Jan 29, 2024
cccbd9e
Support llvm.frexp intrinsic translation (#2252)
vmaksimo Jan 29, 2024
6732fee
Fix SPIRVRegularizeLLVMBase::regularize fix for shl i1 and lshr i1 (#…
bwlodarcz Jan 30, 2024
805f842
add initial support for CooperativeMatrixConstructCheckedINTEL (#2331)
VyacheslavLevytskyy Feb 1, 2024
96812b9
Updates UR branch
mfrancepillois Feb 1, 2024
f589d9b
Merge branch 'sycl-upstream' into maxime/profiling_v2
mfrancepillois Feb 1, 2024
f7a360d
[SYCL][NFC] Fix some 'startswith/endswith' related to SYCL (#12573)
jinge90 Feb 1, 2024
21e703a
Revert "Update add-ir-annotations tests after 5518a9d7673b"
sys-ce-bb Feb 1, 2024
0af4ac7
[Driver] Allow for -O3 on Windows using clang-cl (#12504)
mdtoguchi Feb 1, 2024
4fdcb58
[SYCL] Fix compiler crash. (#12324)
zahiraam Feb 1, 2024
8427bd2
[SYCL][HIP][CUDA] Use new version of piMemGetNativeHandle and add tes…
hdelan Feb 1, 2024
6c1dde4
[SYCL][libdevice] Add sqrt with rounding mode supported in sycl::ext:…
jinge90 Feb 1, 2024
0dc97ec
LLVM and SPIRV-LLVM-Translator pulldown (WW05)
bb-sycl Feb 1, 2024
8bfc56f
[SYCL][ESIMD] Fix a few issues with scatter(usm, ...) (#12585)
sarnex Feb 1, 2024
71eee2c
[SYCL] [NATIVECPU] Add OCK subdirectory with EXCLUDE_FROM_ALL (#12579)
PietroGhg Feb 2, 2024
262b44a
Merge branch 'sycl-upstream' into maxime/profiling_v2
mfrancepillois Feb 2, 2024
9b2e77a
[SYCL] Disable dynamic_address_cast test on FPGA (#12561)
Pennycook Feb 2, 2024
46bce9c
Updates function name
mfrancepillois Feb 2, 2024
35f9696
[CI] Modify Nightly task to run opencl:cpu testing on different CPUs …
aelovikov-intel Feb 2, 2024
faad41d
[UR][CL] Handle INTERMEDIATE binary type (#12562)
kbenzie Feb 2, 2024
30ab2fe
Merge branch 'sycl-upstream' into maxime/profiling_v2
mfrancepillois Feb 2, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
51 changes: 49 additions & 2 deletions sycl/doc/extensions/experimental/sycl_ext_oneapi_graph.asciidoc
Original file line number Diff line number Diff line change
Expand Up @@ -426,6 +426,13 @@ public:
void ext_oneapi_graph(command_graph<graph_state::executable>& graph);
}

// New methods added to the sycl::event class
class event {
public:
template <typename Param>
typename Param::return_type ext_oneapi_get_profiling_info(node Node) const;
}

} // namespace sycl
----

Expand Down Expand Up @@ -1107,6 +1114,44 @@ Exceptions:

|===

==== New Event Member Functions

Table {counter: tableNumber}. Additional member functions of the `sycl::event` class.
[cols="2a,a"]
|===
|Member function|Description

|
[source,c++]
----
template <typename Param>
typename Param::return_type
event::ext_oneapi_get_profiling_info(node Node) const;
----

| Queries the profiling information of a SYCL Graph node for the graph
execution associated with this SYCL event. If the requested info is
not available when this member function is called due to incompletion of
command groups associated with the event, then the call to this member
function will block until the requested info is available.

Parameters:

* `Node` - Handle to the node for which the profiling information is queried.
mfrancepillois marked this conversation as resolved.
Show resolved Hide resolved

Exceptions:

EwanC marked this conversation as resolved.
Show resolved Hide resolved
* Throws synchronously with error code `invalid` if this SYCL event is not
associated to a graph execution.
mfrancepillois marked this conversation as resolved.
Show resolved Hide resolved
* Throws synchronously with error code `invalid` if the queue on which
the graph is submitted was not constructed with
mfrancepillois marked this conversation as resolved.
Show resolved Hide resolved
the `property::queue::enable_profiling` property.
* Throws synchronously with error code `invalid` if `Node` is not associated
to the graph that was executed by event.
mfrancepillois marked this conversation as resolved.
Show resolved Hide resolved

|===


=== Thread Safety

The new functions in this extension are thread-safe, the same as member
Expand Down Expand Up @@ -1810,8 +1855,10 @@ if used in application code.
. Using `handler::memadvise` in a graph node.
. Using reductions in a graph node.
. Using sycl streams in a graph node.
. Profiling an event returned from graph submission with
`event::get_profiling_info()`.
. Profiling information is not available for graphs that contain host-task nodes.
Bensuo marked this conversation as resolved.
Show resolved Hide resolved
. Profiling a node from an event returned from graph submission with
`event::get_profiling_info(ext::node)` is only avalaible for
EwanC marked this conversation as resolved.
Show resolved Hide resolved
the level-zero backend.
. Level Zero immediate command-lists are not supported, and
`sycl::ext::intel::property::queue::no_immediate_command_list`
should be set on construction to any queues an executable
Expand Down
1 change: 1 addition & 0 deletions sycl/include/sycl/detail/pi.def
Original file line number Diff line number Diff line change
Expand Up @@ -181,6 +181,7 @@ _PI_API(piextCommandBufferFillUSM)
_PI_API(piextCommandBufferPrefetchUSM)
_PI_API(piextCommandBufferAdviseUSM)
_PI_API(piextEnqueueCommandBuffer)
_PI_API(piextSyncPointGetProfilingInfo)

_PI_API(piextUSMPitchedAlloc)

Expand Down
15 changes: 15 additions & 0 deletions sycl/include/sycl/detail/pi.h
EwanC marked this conversation as resolved.
Show resolved Hide resolved
Original file line number Diff line number Diff line change
Expand Up @@ -2571,6 +2571,21 @@ piextEnqueueCommandBuffer(pi_ext_command_buffer command_buffer, pi_queue queue,
pi_uint32 num_events_in_wait_list,
const pi_event *event_wait_list, pi_event *event);

/// API to get the profiling information of a graph node.
/// A Node is identified by a sync-point in a command-buffer.
/// The sync-point passed in parameter corresponds therefore to the node from
/// which we want to get the profiling information. returns an error if the node
/// is found. \param event PI event that has been return from the command-buffer
/// submission. \param sync_point The sync-point corresponding to the node from
/// which we want to get the profiling information. \param param_name The name
/// of the profiling property to query depends on. \param param_value_size Size
/// in bytes of the profiling property value. \param param_value Value of the
/// profiling property. \param param_value_size_ret pointer to the actual size
/// in bytes returned in param_value of the profiling property.
EwanC marked this conversation as resolved.
Show resolved Hide resolved
__SYCL_EXPORT pi_result piextSyncPointGetProfilingInfo(
pi_event event, pi_ext_sync_point sync_point, pi_profiling_info param_name,
size_t param_value_size, void *param_value, size_t *param_value_size_ret);

/// API to destroy bindless unsampled image handles.
///
/// \param context is the pi_context
Expand Down
20 changes: 20 additions & 0 deletions sycl/include/sycl/event.hpp
Original file line number Diff line number Diff line change
Expand Up @@ -14,6 +14,7 @@
#include <sycl/detail/info_desc_helpers.hpp> // for is_event_info_desc, is_...
#include <sycl/detail/owner_less_base.hpp> // for OwnerLessBase
#include <sycl/detail/pi.h> // for pi_native_handle
#include <sycl/ext/oneapi/experimental/graph.hpp>

#ifdef __SYCL_INTERNAL_API
#include <sycl/detail/cl.h>
Expand Down Expand Up @@ -130,6 +131,25 @@ class __SYCL_EXPORT event : public detail::OwnerLessBase<event> {
typename detail::is_event_profiling_info_desc<Param>::return_type
get_profiling_info() const;

/// Queries the profiling information of a SYCL Graph node for the graph
/// execution associated with this SYCL event.
///
/// If this SYCL event is not associated to a graph execution, an
mfrancepillois marked this conversation as resolved.
Show resolved Hide resolved
/// invalid_object_error SYCL exception is thrown. If the requested info is
/// not available when this member function is called due to incompletion of
/// command groups associated with the event, then the call to this member
/// function will block until the requested info is available. If the queue
/// which submitted the command group this event is associated with was not
/// constructed with the property::queue::enable_profiling property, an
/// invalid_object_error SYCL exception is thrown.
///
/// \param Node is the handle to the node for which the profiling information
/// is queried.
mfrancepillois marked this conversation as resolved.
Show resolved Hide resolved
/// \return depends on template parameter.
template <typename Param>
typename detail::is_event_profiling_info_desc<Param>::return_type
ext_oneapi_get_profiling_info(ext::oneapi::experimental::node Node) const;

/// Returns the backend associated with this platform.
///
/// \return the backend associated with this platform
Expand Down
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
__SYCL_PARAM_TRAITS_SPEC_PARAMT(event_profiling, command_submit, ext::oneapi::experimental::node, uint64_t, PI_PROFILING_INFO_COMMAND_SUBMIT)
__SYCL_PARAM_TRAITS_SPEC_PARAMT(event_profiling, command_start, ext::oneapi::experimental::node, uint64_t, PI_PROFILING_INFO_COMMAND_START)
__SYCL_PARAM_TRAITS_SPEC_PARAMT(event_profiling, command_end, ext::oneapi::experimental::node, uint64_t, PI_PROFILING_INFO_COMMAND_END)
9 changes: 9 additions & 0 deletions sycl/include/sycl/info/info_desc.hpp
Original file line number Diff line number Diff line change
Expand Up @@ -36,6 +36,11 @@ namespace info {
struct Desc { \
using return_type = ReturnT; \
};
#define __SYCL_PARAM_TRAITS_SPEC_PARAMT(DescType, Desc, ParamType, ReturnT, \
PiCode) \
struct Desc { \
using return_type = ReturnT; \
};
// A.1 Platform information desctiptors
namespace platform {
// TODO Despite giving this deprecation warning, we're still yet to implement
Expand Down Expand Up @@ -155,7 +160,11 @@ namespace event {
namespace event_profiling {
#include <sycl/info/event_profiling_traits.def>
} // namespace event_profiling
namespace ext_oneapi_event_profiling {
#include <sycl/info/ext_oneapi_graph_node_profiling_traits.def>
} // namespace ext_oneapi_event_profiling
#undef __SYCL_PARAM_TRAITS_SPEC
#undef __SYCL_PARAM_TRAITS_SPEC_PARAMT

// Provide an alias to the return type for each of the info parameters
template <typename T, T param> class param_traits {};
Expand Down
8 changes: 8 additions & 0 deletions sycl/plugins/cuda/pi_cuda.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -1185,6 +1185,14 @@ pi_result piextEnqueueCommandBuffer(pi_ext_command_buffer CommandBuffer,
CommandBuffer, Queue, NumEventsInWaitList, EventWaitList, Event);
}

pi_result piextSyncPointGetProfilingInfo(
pi_event Event, pi_ext_sync_point SyncPoint, pi_profiling_info ParamName,
size_t ParamValueSize, void *ParamValue, size_t *ParamValueSizeRet) {
return pi2ur::piextSyncPointGetProfilingInfo(Event, SyncPoint, ParamName,
ParamValueSize, ParamValue,
ParamValueSizeRet);
}

pi_result piextPluginGetOpaqueData(void *opaque_data_param,
void **opaque_data_return) {
return pi2ur::piextPluginGetOpaqueData(opaque_data_param, opaque_data_return);
Expand Down
8 changes: 8 additions & 0 deletions sycl/plugins/hip/pi_hip.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -1193,6 +1193,14 @@ pi_result piextEnqueueCommandBuffer(pi_ext_command_buffer CommandBuffer,
CommandBuffer, Queue, NumEventsInWaitList, EventWaitList, Event);
}

pi_result piextSyncPointGetProfilingInfo(
pi_event event, pi_ext_sync_point SyncPoint, pi_profiling_info param_name,
size_t param_value_size, void *param_value, size_t *param_value_size_ret) {
return pi2ur::piEventGetProfilingInfo(Event, SyncPoint, ParamName,
ParamValueSize, ParamValue,
ParamValueSizeRet);
}
mfrancepillois marked this conversation as resolved.
Show resolved Hide resolved

pi_result piextPluginGetOpaqueData(void *opaque_data_param,
void **opaque_data_return) {
return pi2ur::piextPluginGetOpaqueData(opaque_data_param, opaque_data_return);
Expand Down
8 changes: 8 additions & 0 deletions sycl/plugins/level_zero/pi_level_zero.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -1351,6 +1351,14 @@ pi_result piextEnqueueCommandBuffer(pi_ext_command_buffer CommandBuffer,
CommandBuffer, Queue, NumEventsInWaitList, EventWaitList, Event);
}

pi_result piextSyncPointGetProfilingInfo(
pi_event Event, pi_ext_sync_point SyncPoint, pi_profiling_info ParamName,
size_t ParamValueSize, void *ParamValue, size_t *ParamValueSizeRet) {
return pi2ur::piextSyncPointGetProfilingInfo(Event, SyncPoint, ParamName,
ParamValueSize, ParamValue,
ParamValueSizeRet);
}

const char SupportedVersion[] = _PI_LEVEL_ZERO_PLUGIN_VERSION_STRING;

pi_result piPluginInit(pi_plugin *PluginInit) { // missing
Expand Down
8 changes: 8 additions & 0 deletions sycl/plugins/native_cpu/pi_native_cpu.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -1176,6 +1176,14 @@ pi_result piextEnqueueCommandBuffer(pi_ext_command_buffer CommandBuffer,
CommandBuffer, Queue, NumEventsInWaitList, EventWaitList, Event);
}

pi_result piextSyncPointGetProfilingInfo(
pi_event Event, pi_ext_sync_point SyncPoint, pi_profiling_info ParamName,
size_t ParamValueSize, void *ParamValue, size_t *ParamValueSizeRet) {
return pi2ur::piextSyncPointGetProfilingInfo(Event, SyncPoint, ParamName,
ParamValueSize, ParamValue,
ParamValueSizeRet);
}

pi_result piextPluginGetOpaqueData(void *opaque_data_param,
void **opaque_data_return) {
return pi2ur::piextPluginGetOpaqueData(opaque_data_param, opaque_data_return);
Expand Down
8 changes: 8 additions & 0 deletions sycl/plugins/opencl/pi_opencl.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -1129,6 +1129,14 @@ pi_result piextEnqueueCommandBuffer(pi_ext_command_buffer CommandBuffer,
CommandBuffer, Queue, NumEventsInWaitList, EventWaitList, Event);
}

pi_result piextSyncPointGetProfilingInfo(
pi_event Event, pi_ext_sync_point SyncPoint, pi_profiling_info ParamName,
size_t ParamValueSize, void *ParamValue, size_t *ParamValueSizeRet) {
return pi2ur::piextSyncPointGetProfilingInfo(Event, SyncPoint, ParamName,
ParamValueSize, ParamValue,
ParamValueSizeRet);
}

pi_result piextPluginGetOpaqueData(void *opaque_data_param,
void **opaque_data_return) {
return pi2ur::piextPluginGetOpaqueData(opaque_data_param, opaque_data_return);
Expand Down
12 changes: 3 additions & 9 deletions sycl/plugins/unified_runtime/CMakeLists.txt
Original file line number Diff line number Diff line change
Expand Up @@ -56,15 +56,9 @@ endif()
if(SYCL_PI_UR_USE_FETCH_CONTENT)
include(FetchContent)

set(UNIFIED_RUNTIME_REPO "https://github.com/oneapi-src/unified-runtime.git")
# commit 8d3233af35ed5515dd966c08d1af04b76113d3a8
# Merge: eddda123 2bd41d7b
# Author: Kenneth Benzie (Benie) <k.benzie@codeplay.com>
# Date: Wed Jan 24 14:27:54 2024 +0000
# Merge pull request #1247 from PietroGhg/pietro/empty_cmd_buffer_fill
# [NATIVECPU] Empty implementation for command buffer fill
set(UNIFIED_RUNTIME_TAG 8d3233af35ed5515dd966c08d1af04b76113d3a8)

set(UNIFIED_RUNTIME_REPO "https://github.com/bensuo/unified-runtime.git")
set(UNIFIED_RUNTIME_TAG cmdbuf-node-profiling)

if(SYCL_PI_UR_OVERRIDE_FETCH_CONTENT_REPO)
set(UNIFIED_RUNTIME_REPO "${SYCL_PI_UR_OVERRIDE_FETCH_CONTENT_REPO}")
endif()
Expand Down
37 changes: 37 additions & 0 deletions sycl/plugins/unified_runtime/pi2ur.hpp
Original file line number Diff line number Diff line change
Expand Up @@ -4137,6 +4137,43 @@ inline pi_result piEventGetProfilingInfo(pi_event Event,
return PI_SUCCESS;
}

inline pi_result piextSyncPointGetProfilingInfo(
pi_event Event, pi_ext_sync_point SyncPoint, pi_profiling_info ParamName,
size_t ParamValueSize, void *ParamValue, size_t *ParamValueSizeRet) {

PI_ASSERT(Event, PI_ERROR_INVALID_EVENT);

ur_event_handle_t UREvent = reinterpret_cast<ur_event_handle_t>(Event);

ur_profiling_info_t PropName{};
switch (ParamName) {
case PI_PROFILING_INFO_COMMAND_QUEUED: {
PropName = UR_PROFILING_INFO_COMMAND_QUEUED;
break;
}
case PI_PROFILING_INFO_COMMAND_SUBMIT: {
PropName = UR_PROFILING_INFO_COMMAND_SUBMIT;
break;
}
case PI_PROFILING_INFO_COMMAND_START: {
PropName = UR_PROFILING_INFO_COMMAND_START;
break;
}
case PI_PROFILING_INFO_COMMAND_END: {
PropName = UR_PROFILING_INFO_COMMAND_END;
break;
}
default:
return PI_ERROR_INVALID_PROPERTY;
}

HANDLE_ERRORS(urSyncPointGetProfilingInfoExp(UREvent, SyncPoint, PropName,
ParamValueSize, ParamValue,
ParamValueSizeRet));

return PI_SUCCESS;
}

inline pi_result piEventCreate(pi_context Context, pi_event *RetEvent) {

ur_context_handle_t UrContext =
Expand Down
Loading
Loading