alpa-projects / alpa Star 3.1k Code Issues Pull requests Discussions Training and serving large-scale neural networks with auto parallelization. machine-learning deep-learning compiler distributed-computing high-performance-computing distributed-training jax alpa auto-parallelization llm Updated Dec 9, 2023 Python
alibaba / TePDist Star 88 Code Issues Pull requests TePDist (TEnsor Program DISTributed) is an HLO-level automatic distributed system for DL models. distributed-systems machine-learning deep-learning compiler distributed-computing rhino high-performance-computing distributed-training auto-parallelization disthlo Updated Apr 22, 2023 C++