-
Notifications
You must be signed in to change notification settings - Fork 88
Issues: microsoft/tutel
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
How to convert checkpoint files that adapt to different distributed world sizes
#246
opened Aug 27, 2024 by
swjtulinxi
[Question] Why use datatype ncclInt8 in nccl_all_to_all_scatter_async.
#220
opened Dec 18, 2023 by
cicirori
How to implement Fairseq-MoE training checkpoint like Swin-MoE?
#219
opened Nov 10, 2023 by
withinmiaov
Non-surface function utilities only work for contiguous input data
#218
opened Nov 6, 2023 by
lyd126
ImportError: cannot import name 'tutel_custom_kernel' from 'tutel.impls.jit_compiler'
environmental issue
#198
opened Mar 30, 2023 by
zhaojiancheng007
tutel/jit_kernels/sparse.py torch.float16 There is a bug in the calculation: the cuda calculation result is inconsistent with the CPU calculation result and the array is out of bounds
invalid
This doesn't seem right
#196
opened Mar 8, 2023 by
WsqRichards1
How the experts' gradients are handled under data parallelism?
#192
opened Dec 26, 2022 by
yzs981130
[installation errors] fatal error: nccl.h: No such file or directory
#189
opened Oct 19, 2022 by
qianyuzqy
My code seems to hang when skip_remainder_batch=False.
application patch
#182
opened Aug 9, 2022 by
Fragile-azalea
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.