Replies: 1 comment 1 reply
-
I met the same problem.. Hope someone can fix this |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi all. I try to do something like the following:
However, this gives me
File "/somepath/python3.8/site-packages/deepspeed/runtime/zero/stage3.py", line 1111, in __reduce_and_partition_ipg_grads assert len(set(p.ds_id for p in self.params_in_ipg_bucket)) == len(self.params_in_ipg_bucket) AssertionError
. Does anyone have an idea how to resolve this? I'm using deepspeed zero3 0.10.0 with gradient checkpointing. Thanks!Beta Was this translation helpful? Give feedback.
All reactions