it is not H200. They are not comparable. They are in fact built on different philosophies. Why would anyone compare Ascend-950 with a vastly downgraded chip like H20? I understand that you probably copy and pasted something you saw someone else post, but if they write stuff like this, maybe you should not repost it.Huawei Unveils AI Accelerator Ascend 950 with 3x H20 Performance: Achieving NVIDIA H200 Levels Possible with 3nm and HBM3 Application
The Ascend 950, combining a 3nm process with HBM3, has become powerful enough to be called the **"Chinese version of the H200"**.
Carbon Nanotube (CNT) Semiconductor: This is the most groundbreaking aspect. It has already been verified at the laboratory level and is currently being optimized for SMIC's production lines. If successful, it could become a "game changer" capable of overcoming the physical limitations of existing silicon chips and demonstrating energy efficiency exceeding that of the H200.
Ascend-950 is probably using N+2 processDo you have a source for this? I can't find this online and I'm skeptical that Ascend 950 is on a 3nm process considering that it seems SMIC only got to 6nm fairly recently and I'm kinda skeptical Huawei would be able to trick TSMC into fabbing a lot of chips for them again.
again for Nvidia. Sparsity vs Dense, make sure you know what you are comparing. Nvidia likes to advertise sparsity figures.He/she is spreading semi-false information, the reality is 1.56PFLOPS PF4, ~40% H100, while the power is 600 watt, which is quite disappointing considering the power of H100 SXM is around 700 watt; besides, the bandwith is dissppointing as well with 1.4TB/S, ~47% H100 SXM. So we may infer from it that this chip is made with HBM2e or equivalent VRAM chip and its power efficiency is roughly the same as Ascend 910B, which is quite strange because it even cannot match the level of Ascend 910C. I wonder if it is because they are shifting away from NPU so that the version 1 has low power efficiency.
If you cannot understand why Ascend-950 has lower compute than 910C, then you are probably not qualified to comment. Also we don't know the power consumption of Ascend-950. We know that Ascend950PR is rated as 1 PFLOPS FP8/MXPF8 & using HIBL 1.0 with 128 GB of memory. We know the chip to chip interconnect is 2 TB/s

Do you understand why Ascend-950 has lower per chip compute than 910C? This is critical to understand the design decisions behind 950 series.Ascend 950 is not meant to compete with Nvidia cards on a card level performance. Instead they are focusing on cluster level performance (ie, bigger clusters than Nvidia). On a single card level it actually has less compute than Huawei's previous product Ascend 910C.
Numbers for Ascend 950 and several subsequent products are in Huawei's public roadmap for next several years. Ascend 950 is most likely 7nm chip with HBM2E for the inference version and HBM3 for the training version. From the roadmap it's very clear they are not trying to match Nvidia cards one-on-one any time soon. Their focus is bigger clusters and better networking.







