rocFFT and hipFFT examples (part II) #160

Beanavil · 2024-09-06T12:16:21Z

This pull request contains the second batch of the new rocFFT and hipFFT examples. Added samples:

rocFFT
- complex_complex
- complex_real
- real_complex
hipFFT
- multi_gpu

evetsso · 2024-09-06T17:45:40Z

Common/example_utils.hpp

+void print_nd_data(const std::vector<Tdata> data,
+                   std::vector<Tsize>       np,
+                   const int                column_width     = 4,
+                   const bool               reverse_indexing = false)


I think this would be easier to understand and use if it was either renamed to row_major (since the deffault looks like column-major), or if this was replaced by an enum that spelled out exactly what the caller wanted.

Hm yes, this was added to support printing data with either layout. IINM the default is row-major (np[0] is assumed to be the size of the outermost dimension), and when reversing the ordering of the dimension sizes we assume a column-major layout. Changed this parameter to column_major.

malcolmroberts · 2024-09-06T21:28:38Z

I'm having some trouble getting the example to work with the cuda backend; cmake issues show up.

malcolmroberts

cuda compilation doesn't seem to work. CI coverage is probably needed here.

Beanavil · 2024-09-09T14:22:53Z

cuda compilation doesn't seem to work. CI coverage is probably needed here.

@malcolmroberts what errors are you getting? On our end (internal CI, and I also built it locally to double-check) it builds without problems

evetsso · 2024-09-17T14:43:49Z

Common/example_utils.hpp

+/// layout, the \p column_major parameter must be set to \p true for a correct interpretation
+/// of the dimensions' sizes.
+template<class Tdata, class Tsize>
+void print_nd_data(const std::vector<Tdata> data,


data should be a const reference?

Could be, yes. Updated this!

Beanavil · 2024-10-07T16:27:26Z

@malcolmroberts @evetsso what's the status of the review for this? I think the only issues left to tackle were the ones mentioned in this comment about the CUDA build, but this is not showing up on our end so without further details I cannot move forward on that

malcolmroberts · 2024-10-11T21:43:58Z

Sorry for the delay - jumping back into this again.

When I configure with
cmake -DCMAKE_CXX_COMPILER=hipcc -DGPU_RUNTIME=CUDA ..
I get errors like

[ 16%] Building CUDA object multi_gpu/CMakeFiles/hipfft_multi_gpu.dir/main.cpp.o
In file included from /home/AMD/marobert/repo/rocm-examples/Libraries/hipFFT/multi_gpu/main.cpp:24:
/home/AMD/marobert/repo/rocm-examples/Libraries/hipFFT/multi_gpu/../../../Common/example_utils.hpp:50:10: fatal error: hip/hip_runtime.h: No such file or directory
   50 | #include <hip/hip_runtime.h>
      |          ^~~~~~~~~~~~~~~~~~~
```

Beanavil · 2024-10-14T19:18:18Z

@malcolmroberts Oh I see, but IINM that error should be happening for other samples because it seems the HIP headers are not found. I can reproduce this on my side if for instance I rename the /opt/rocm/include/hip folder to something else. Could you verify if you have the hip headers present on your machine?

malcolmroberts · 2024-10-16T21:52:03Z

Hey, @Beanavil ; I do have /opt/rocm/include/hip populated on my test machine.

However, if there isn't a find_package( hip REQUIRED ) then I wouldn't expect it to include or link the hip runtime. The files are .cpp, so the cmake setup isn't detecting the hip programming language, and I don't think that the runtime is going to be automatically included. I'm configuring/compiling from the Libraries/hipFFT subdirectory.

evetsso · 2024-10-16T22:13:23Z

@Beanavil Just to be totally clear here, are you also building these examples under CUDA?

Beanavil · 2024-10-28T18:09:28Z

@evetsso @malcolmroberts sorry for the delayed response, I got totally caught up on other tasks. Yes, we build the examples for CUDA and HIP backend. In fact, we use the dockerfiles provided by this repo for this, so I think if you use a container built from the cuda image the build should be successful.

I'm trying to figure out what could be triggering the failures in your end, I don't think a find_package( hip REQUIRED ) is what's missing here because IINM when we link to hipFFT we are already including the dependency on HIP. On my end, for instance, I can see that when I invoke cmake it does add the compiler option for including the hip headers (-isystem=/opt/rocm/include):

developer@2c1b9a595f95:~/rocm-examples/Libraries/hipFFT/build$ cmake --build . -v
...
[ 16%] Building CUDA object multi_gpu/CMakeFiles/hipfft_multi_gpu.dir/main.cpp.o
cd /home/developer/rocm-examples/Libraries/hipFFT/build/multi_gpu && /usr/local/cuda/bin/nvcc 
-forward-unknown-to-host-compiler  -I/home/developer/rocm-examples/Libraries/hipFFT/multi_gpu/../../../Common 
+ -isystem=/opt/rocm/include --generate-code=arch=compute_52,code=[compute_52,sm_52] -std=c++17 -MD -MT 
multi_gpu/CMakeFiles/hipfft_multi_gpu.dir/main.cpp.o -MF CMakeFiles/hipfft_multi_gpu.dir/main.cpp.o.d -x cu 
-c /home/developer/rocm-examples/Libraries/hipFFT/multi_gpu/main.cpp -o CMakeFiles/hipfft_multi_gpu.dir/main.cpp.o
...

Could you also show how is the compiler being run on your end? This is done with the -v option that I show in the previous code.

Beanavil self-assigned this Sep 6, 2024

Beanavil marked this pull request as ready for review September 6, 2024 12:26

Beanavil requested review from a team and dgaliffiAMD as code owners September 6, 2024 12:26

Beanavil requested review from malcolmroberts and evetsso September 6, 2024 12:27

evetsso reviewed Sep 6, 2024

View reviewed changes

malcolmroberts requested changes Sep 6, 2024

View reviewed changes

Beanavil force-pushed the fft-multigpu-plan branch 3 times, most recently from c08fc9f to b88aa6e Compare September 17, 2024 09:20

evetsso reviewed Sep 17, 2024

View reviewed changes

Naraenda and others added 4 commits September 18, 2024 16:17

feat(hipFFT/multi_gpu): added hipFFT multi GPU example

f2f9dd0

Resolve "rocFFT plan_1d_{d2z,z2z} Example"

7d868b9

Updated URLs in README.md

e8ff778

Added VS files for hipFFT/multi_gpu example

e5f6539

Beanavil force-pushed the fft-multigpu-plan branch 2 times, most recently from f0b29b0 to a7478f1 Compare September 18, 2024 14:25

Snektron and others added 5 commits September 18, 2024 17:08

fix warning in hipfft multi gpu example

a169a91

update formatting for rocm 6.2

5026899

Fixed markdown linting

0a94e70

Fixed CMake linting

95b6880

Clarified use of print_nd_data utility

24d38bd

Beanavil force-pushed the fft-multigpu-plan branch from a7478f1 to 24d38bd Compare September 18, 2024 15:09

dgaliffiAMD requested review from malcolmroberts and evetsso October 2, 2024 20:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rocFFT and hipFFT examples (part II) #160

rocFFT and hipFFT examples (part II) #160

Beanavil commented Sep 6, 2024

evetsso Sep 6, 2024

Beanavil Sep 10, 2024

malcolmroberts commented Sep 6, 2024

malcolmroberts left a comment

Beanavil commented Sep 9, 2024

evetsso Sep 17, 2024

Beanavil Sep 18, 2024

Beanavil commented Oct 7, 2024

malcolmroberts commented Oct 11, 2024

Beanavil commented Oct 14, 2024

malcolmroberts commented Oct 16, 2024

evetsso commented Oct 16, 2024

Beanavil commented Oct 28, 2024

rocFFT and hipFFT examples (part II) #160

Are you sure you want to change the base?

rocFFT and hipFFT examples (part II) #160

Conversation

Beanavil commented Sep 6, 2024

evetsso Sep 6, 2024

Choose a reason for hiding this comment

Beanavil Sep 10, 2024

Choose a reason for hiding this comment

malcolmroberts commented Sep 6, 2024

malcolmroberts left a comment

Choose a reason for hiding this comment

Beanavil commented Sep 9, 2024

evetsso Sep 17, 2024

Choose a reason for hiding this comment

Beanavil Sep 18, 2024

Choose a reason for hiding this comment

Beanavil commented Oct 7, 2024

malcolmroberts commented Oct 11, 2024

Beanavil commented Oct 14, 2024

malcolmroberts commented Oct 16, 2024

evetsso commented Oct 16, 2024

Beanavil commented Oct 28, 2024