Coherence in MGPUSim #88

Soilorian · 2024-08-10T05:33:58Z

Soilorian
Aug 10, 2024

I was wondering how MGPUSim deals with cache coherence. For example, imagine a scenario where there is the same data in two different L2 caches from two different GPUs, and a write happens in one of the GPUs. How does MGPUSim tell the other cache to revalidate its data? This could happen in L1 caches as well.

Answered by syifan

Aug 10, 2024

MGPUSim does not keep coherence, which is not a missing feature, but is because GPUs do not need coherence.

Since L2 caches are memory-side caches, a piece of data will never appear in two L2 caches. Each L2 cache is mapped to a range of memory, so the address of the cache line determines on which L2 cache it should appear. This rule applies to both single-GPU and multi-GPU environments.

For L1 caches, this problem is more realistic as a piece of data can present in multiple L1 caches at the same time. But L1 caches are write-through caches. So, all the writers will write to both L1 and L2 caches at the same time. If another CU needs to read the data, they read directly from the L2 cache …

View full answer

syifan · 2024-08-10T15:02:50Z

syifan
Aug 10, 2024
Maintainer

MGPUSim does not keep coherence, which is not a missing feature, but is because GPUs do not need coherence.

Since L2 caches are memory-side caches, a piece of data will never appear in two L2 caches. Each L2 cache is mapped to a range of memory, so the address of the cache line determines on which L2 cache it should appear. This rule applies to both single-GPU and multi-GPU environments.

For L1 caches, this problem is more realistic as a piece of data can present in multiple L1 caches at the same time. But L1 caches are write-through caches. So, all the writers will write to both L1 and L2 caches at the same time. If another CU needs to read the data, they read directly from the L2 cache and have the updated data, mitigating the coherency problem. However, using write-through caches cannot fully solve the problem. It is still possible that two L1 caches have the same data; one CU writes it to update, and the other CU reads the stale data (let’s call this one-write-one-read).

To better understand why not supporting coherence is not a problem for MGPUSim, we need to first understand the GPU programming model. GPU programs are typically written in a style where each thread is responsible for generating part of the results. The results generated from each thread should not overlap. Also, the results written into the main memory should never be consumed (read) by another thread within the same kernel. Violating these rules causes wrong or undefined behavior. In case a GPU thread needs the data written from another thread, the only solution is to use atomic instructions or start another kernel.

Following the rules above, the one-write-one-read problem never happens within the same kernel. If it happens, the results are undefined in both real GPU environments and in MGPUSim. We only need to consider the one-write-one-read problem across kernels. MGPUSim's solution is to flush L1 (invalidate all) caches at kernel boundaries. So at the beginning of a kernel, all the memory reads are issued to the L2 caches, ensuring they fetch the updated data. For now, we do not support atomic instructions.

Please let me know if you can think of other situations that are not considered.

1 reply

Soilorian Aug 10, 2024
Author

you are right, L2 caches won't face this issue in the current setup but for a SM side cache this is harmful, I was wondering if you had already handled such cohersions in L1 caches and I could use that method.

I was thinking about an invalidation request being sent to other L2 caches during write hits to avoid this problem

thanks for the well thought answer

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Coherence in MGPUSim #88

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment 1 reply

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

Coherence in MGPUSim #88

Soilorian Aug 10, 2024

Replies: 1 comment · 1 reply

syifan Aug 10, 2024 Maintainer

Soilorian Aug 10, 2024 Author

Soilorian
Aug 10, 2024

Replies: 1 comment 1 reply

syifan
Aug 10, 2024
Maintainer

Soilorian Aug 10, 2024
Author