Huawei's Ascend 950 series chips have significantly improved performance, with computing power reaching 1-2 PFLOPS.

511
The new-generation Ascend 950 series chipset utilizes a hybrid SIMD/SIMT architecture for the first time. SIMD inherits the characteristics of the NPU and is suitable for regular matrix and vector calculations, while SIMT, consistent with GPGPU, emphasizes thread-level parallelism, enabling greater flexibility in large-scale training. This design attempts to balance energy efficiency and versatility, but from an engineering perspective, it is a complex compromise that requires resolving conflicts between resource scheduling and compilation optimization at the hardware level.