Understanding tensor shape choices #14117

PVarnai · 2023-01-23T15:31:29Z

PVarnai
Jan 23, 2023

Hi!

I am trying to understand the optimal way of defining tensors, in particular the order of their dimensions when we have batches.

I remember when running code on the CPU, the optimal thing was to have the N batches as the last dimension and use tensor shapes of the form (dims..., N), since the CPU operates on the batches one by one, and it's better if it can load in and cache the data it needs to operate on while working on a specific batch instance. I then learned that the opposite is true on the GPU while trying to optimize my code, where shapes of the form (N, dims...) are preferred. I guess this is because the GPU loads in the data for all N batches at once, so there it's worth to have them in consecutive memory? Still, this makes sense to me.

What doesn't make sense to me is that then why in an RNN, the shapes we use are (T, N, dims), when the time dimension is accessed step by step? Doesn't the same idea apply that the batches N should be the first dimension and the shapes should be (N, T, dims) or something?

Thanks for any help!

EDIT: I am even more confused now, since as far as I can tell jax by default uses a row-major format. So isn't have N as the first dimension the opposite of the optimal?

jakevdp · 2023-01-25T22:02:33Z

jakevdp
Jan 25, 2023
Maintainer

When you wrap your code in JIT, the compiler will make decisions about the most effective layout for your particular sequence of computations. For that reason, the best advice would be to use whatever layout makes the logic of your algorithm the most clear.

1 reply

PVarnai Jan 25, 2023
Author

But a function takes the data already laid out according to some ordering as an input, right? Surely it's not effective for JIT to make a copy and reorder it if that's better suited for the computations.. So shouldn't I already try and create the data in a layout that can be expected to work well with the function?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Understanding tensor shape choices #14117

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

Understanding tensor shape choices #14117

PVarnai Jan 23, 2023

Replies: 1 comment · 1 reply

jakevdp Jan 25, 2023 Maintainer

PVarnai Jan 25, 2023 Author

PVarnai
Jan 23, 2023

Replies: 1 comment 1 reply

jakevdp
Jan 25, 2023
Maintainer

PVarnai Jan 25, 2023
Author