New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

ChunkedList: store pointers to next/last elements instead of indices … #48

Merged

psychocoderHPC merged 3 commits into ComputationalRadiationPhysics:dev from michaelsippel:topic-chunkedlist

Dec 13, 2023

Member

michaelsippel commented Dec 12, 2023

…to save overhead of address calculation and allow more efficient packing of ItemAccess

this also reintroduces the chunk-size as template parameter instead of storing it as member
initialize AtomicList with chunk_capacity, not allocation size
AtomicList: construct Item/ItemControlBlock from memory::Block
ChunkedList: construct Chunk from memory::Block
reorder ChunkedList::Item struct for more efficient alignment
encode has_item of ItemAccess into one 8-byte pointer using bitmagic

michaelsippel added the optimization label

michaelsippel requested a review from psychocoderHPC

December 12, 2023 13:25

Member Author

michaelsippel commented Dec 12, 2023

Here I'm still getting warnings which I dont understand

warning: 'short unsigned int __atomic_fetch_sub_2(volatile void*, short unsigned int, int)' writing 2 bytes into a region of size 0 overflows the destination [-Wstringop-overflow=]
  645 |       { return __atomic_fetch_sub(&_M_i, __i, int(__m)); }

michaelsippel force-pushed the topic-chunkedlist branch from a0d1479 to e2fd434 Compare

December 12, 2023 13:39

Member

psychocoderHPC commented Dec 12, 2023

Here I'm still getting warnings which I dont understand

warning: 'short unsigned int __atomic_fetch_sub_2(volatile void*, short unsigned int, int)' writing 2 bytes into a region of size 0 overflows the destination [-Wstringop-overflow=]
  645 |       { return __atomic_fetch_sub(&_M_i, __i, int(__m)); }

Could it be a hint that some data are unaligned?

Member

psychocoderHPC commented Dec 12, 2023

@michaelsippel Is this PR a replacement for #45?

psychocoderHPC requested changes

View reviewed changes

Member

psychocoderHPC left a comment

IMO we still have the double free issue I tried to solve with #45 and we have the problem that more than one thread can append a new chunk.

redGrapes/util/chunked_list.hpp Outdated

               #include <limits>
               #include <memory>
               #include <optional>
               #include <redGrapes/util/trace.hpp>
               #include <redGrapes/memory/allocator.hpp>
-              #include <redGrapes/memory/bump_allocator.hpp>
+              //#include <redGrapes/memory/bump_allocator.hpp>

Member

psychocoderHPC Dec 12, 2023

?? why is this commented out?

Member Author

michaelsippel Dec 12, 2023

commented because its not needed. Allocator is a template parameter. will be removed

Member

psychocoderHPC Dec 12, 2023

please remove the line.

redGrapes/util/chunked_list.hpp Outdated

                       }
                       ~Chunk()
                       {
-                          for( unsigned i = 0; i < last_idx; ++i )
-                              items()[i].~Item();
+                          for( Item * item = first_item; item != last_item; item++ )

Member

psychocoderHPC Dec 12, 2023

last_item is pointing by default to the element before the first item. If we never allocate memory but create a chunked_list does it mean we loop over non existing memory?

Member

psychocoderHPC Dec 12, 2023

maybe for( Item * item = first_item; item <= last_item ; item++ )?

Member Author

michaelsippel Dec 12, 2023

True, currently this bug doesnt manifest because a chunk is only allocated if at least one element exists , but when we switch to the algorithm where the thread with the last slot allocates the new chunk, it can happen that a chunk remains empty and thus creating an endless loop.

redGrapes/util/chunked_list.hpp Outdated

                       typename memory::AtomicList< Chunk, Allocator >::MutBackwardIterator chunk;
+                      /* this pointer packs the address of the current element
+                       * and the `has_element` bit in its MSB.

Member

psychocoderHPC Dec 12, 2023

MSB ?? What is this?

Member Author

michaelsippel Dec 12, 2023

MSB = most significant bit

Member

psychocoderHPC Dec 12, 2023

please add this to the first usage

redGrapes/util/chunked_list.hpp

@@ @@ -441,7 +438,7 @@ struct ChunkedList @@
                   void release_chunk( typename memory::AtomicList< Chunk, Allocator >::MutBackwardIterator chunk )
                   {
                       if( chunk->item_count.fetch_sub(1) == 0 )
-                          chunks.erase( chunk );
+                          chunks.erase( chunk );

Member

psychocoderHPC Dec 12, 2023

If a thread is entering this line and at the same moment another thread is pushing an item and is increasing the counter and sees that no item is free and then releases the memory again we will have a double free.

Member Author

michaelsippel Dec 12, 2023

in this version we increment item_count only after we are safe to have gotten a slot. Only if the increment of item_count happens first and then decrements this is relevant

redGrapes/util/chunked_list.hpp Outdated

+                                  }
+                              }
+                              release_chunk(chunk);
                           }
                           auto prev_chunk = chunks.allocate_item();

Member

psychocoderHPC Dec 12, 2023

How do we prevent that more than one thread is creating new memory?

Member Author

michaelsippel commented Dec 12, 2023 •

edited

Loading

Yes you are right, this PR does not include the fix for the double allocation / memory leak yet , #45 is still neccesary, either one needs to be rebased

Member

psychocoderHPC commented Dec 12, 2023 •

edited

Loading

I opened a PR against your PR to solve the double free and alloc issue: michaelsippel#1
My local tests works well, please have a look.
As I wrote before it is not necessary that #45 get merged if this PR is solving the big issue with the double alloc and free.

michaelsippel force-pushed the topic-chunkedlist branch from e2fd434 to 6933ef3 Compare

December 12, 2023 17:24


          ChunkedList: store pointers to next/last elements instead of indices …

…to save overhead of address calculation and allow more efficient packing of ItemAccess

* this also reintroduces the chunk-size as template parameter instead of storing it as member
* initialize AtomicList with `chunk_capacity`, not allocation size
* AtomicList: construct Item/ItemControlBlock from `memory::Block`
* ChunkedList: construct Chunk from `memory::Block`
* reorder ChunkedList::Item struct for more efficient alignment
* encode `has_item` of ItemAccess into one 8-byte pointer using bitmagic

michaelsippel force-pushed the topic-chunkedlist branch from 6933ef3 to 8897117 Compare

December 12, 2023 17:28


          fix double free and double alloc

fb64cd8

Equally to
ComputationalRadiationPhysics#45 this
PR should solve the possible double free and double alloc.

psychocoderHPC mentioned this pull request

redGrapes is currently broken :-( #49

Closed


          Merge pull request #1 from psychocoderHPC/fix-doubleFreeAndAlloc

b22c4a9

fix double free and double alloc

michaelsippel mentioned this pull request

fix memory leak #45

Closed

3 tasks

psychocoderHPC approved these changes

View reviewed changes

psychocoderHPC merged commit 794431a into ComputationalRadiationPhysics:dev

1 check passed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels