rel_2022.4 release
Pre-release
Pre-release
Initial preproduction release of Intel Extension to DeepSpeed that supports GPT-2 3.6 billion parameters Large Language Model training, with below features verified on Intel GPUs:
• DeepSpeed ZeRO stage 2
• Model parallel size is 1