New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Draft: Coroutines article #300

Open

koniarik wants to merge 11 commits into memfault:master from koniarik:master

+376 −0

Contributor

koniarik commented Jan 30, 2023

As discussed in email, I am making a PR with a draft of article about coroutines, I am looking forward for cooperation!

P.S: the article is intentionally set to 2024, feel free to suggest better date.

koniarik and others added 10 commits

April 29, 2022 20:06


          draft for testing library

06f7ea1


          Rename post for preview to work

5e41967


          Added paragraph about source repository

74103a0


          Merge branch 'master' of github.com:koniarik/interrupt

a8c3bc3


          Grammarlied the testing library post

3fb8f7d


          Vanilla version of artcile about coroutines

14c0b42


          updated coroutines

a7640e9


          Merge remote-tracking branch 'upstream/master'

79e38aa


          cleaned up coroutines article with grammarly

59b3fe8


          removed the beta version of testing lib

daa3b34

bahildebrand reviewed

View reviewed changes

Member

bahildebrand left a comment

Overall I like the idea of the article. I think our readers would be interested in a more focused look at how to implement coroutines for asynchronous peripheral IO, and what the benefits over traditional threaded concurrency would look like. Maybe you could spin up an asynchronous driver for an MCU peripheral and use it as an example?

Let me know if you have any question about any of my feedback.

_posts/2024-04-20-coroutines.md


		Approach 1) has multiple potential issues, it might take a lot of code to implement a complex exchange of data in this way (a lot of state variables) and the approach is problematic, as there is a chance that one of the steps might take longer than expected and we can't prevent that.

		Approach 2) has another set of issues. Each thread requires its own stack space (which might not scale), and we got all the problems of parallelism - exchanging data between can suffer various potential concurrency issues.

Member

bahildebrand Apr 28, 2023

I'm not sure I agree with the thought that using coroutines avoids concurrency issues. Coroutines man not be parallel, but they are certainly concurrent. I think the main advantage would be the first focus of this sentence, savings on stack size and context switch penalties.

Contributor Author

koniarik Aug 15, 2023

Eeeeeh, it does, point of coroutines is that you make a context swithc at specific points in the code that are controlled by you. While in normal freertos-like threading, the context switch can happen anytime the main timer fires and switches the threads.

This means that you do not have to think problems about stuff like strict atomic work with shared variables, let's assume that i is shared variable between parts of the system that execute concurrently. This is unsafe in threads, but is perfectly ok in case of coroutines:

int j = i;
j += 1;
i = j;

That is, with threads: context switch can happen at any point in time, at any instruction, and you DO have to think about it and take care
with coros: context switch happens ONLY when you explicitly write in the code that the coroutine should interrupt itself.

For me this is one of the biggest motivations to use coroutines over threads in general, as the mental overhead of managing concurrent access to stuff with this explicit cooperation is much smaller than with threads.

Anyway, I will try to re-think this part and think how to express it in better way :)

_posts/2024-04-20-coroutines.md


		That is, `coroutine` is just a `function` that can interrupt itself and be resumed by the caller.

		### How does it look like

Member

bahildebrand Apr 28, 2023

It would be good to see some references throughout this section.

Contributor Author

koniarik Aug 15, 2023

I added general reference to cppreference coroutines, I will think about this some more whenever I want to go into more detail. (Generally, cppreference is a good reference)

_posts/2024-04-20-coroutines.md

Comment on lines +206 to +207

		Coroutines have `allocator` support, we can provide the coroutine with an allocator that can be used to get memory for the coroutine and hence avoid the dynamic allocation. (Approach that I can suggest)
		This is done by implementing custom `operator new` and `operator delete` on the `promise_type` which allocates the `entire frame`.

Member

bahildebrand Apr 28, 2023

Is the suggestion to roll our own allocator? I'll admit I'm not very familiar with C++ coroutines. More examples and references here would be appreciated.

Contributor Author

koniarik Aug 15, 2023

Understood, I will think about expanding this part more clearly, generally: you can use whatever you want to get/free memory, that is: malloc/free, your own memroy resource, allocators or anything.

_posts/2024-04-20-coroutines.md

Comment on lines +209 to +210

		An alternative is to rely on `halo` optimization (Heap Allocation Elision Optimization) if the coroutine is implemented correctly and the parent function executes the entire coroutine in its context.
		The compiler can optimize away the dynamic allocation and just store the frame on the stack of the coroutine's parent.

Member

bahildebrand Apr 28, 2023

To me this seems cleaner than a custom allocator. Could you elaborate on why you prefer the custom allocator solution?

Contributor Author

koniarik Aug 15, 2023

I did updated this in the article

_posts/2024-04-20-coroutines.md

+              What I would suggest (that is what I do), is to use coroutines for the long-term process and just build them during the initialization of the device.
+              Or just live with the dynamic memory.
+              (Note: if I could dream a bit here, I would want an explicit way of forcing the coroutine to live on the stack, which would have clearly defined behaviour of when and how it should happen, and with proper compiler errors if one fails to do so)

Member

bahildebrand Apr 28, 2023

Yeah, this would be the preferred way :(. It might help to flesh out the halo optimization more and why it isn't as ideal, since by your description it looks to accomplish this.

Contributor Author

koniarik Aug 15, 2023

I will ok this more to make sure it is explained properly, halo is optimization... you can't force compiler to do it, comiler usually won't tell you why it did it or did not do it, or it won't even tell you how exactly it decides

_posts/2024-04-20-coroutines.md


		#### Frame size is big

		One more invisible caveat that appeared is that GCC is not yet smart enough with the frame size.

Member

bahildebrand Apr 28, 2023

Just a GCC problem, or do other compilers have this issue as well?

Contributor Author

koniarik Aug 15, 2023

I will try to check this

_posts/2024-04-20-coroutines.md Outdated

Comment on lines 285 to 286

		Given the bus nature of `i2c`, it is also quite easy to achieve sharing of the `i2c bus` with multiple `device drivers` for various devices on the bus itself.
		We can just implement the `i2c_coroutine round_robin_run(std::span<i2c_coroutine> coros)` coroutine that uses round robin to share access to the peripheral between multiple coroutines (devices).

Member

bahildebrand Apr 28, 2023

I think this section would be very interesting to elaborate. Other strategies for handling concurrency for this scenario, and how you would implement them.

Contributor Author

koniarik Aug 15, 2023

I added some round robin implementation, wil lthink about others


          fixes

4db2ff3

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet