Amazon AWS releases Inferentia 2 chip to accelerate large-scale model reasoning-Redplanx

Amazon AWS releases Inferentia 2 chip to accelerate large-scale model reasoning

AWS reasoning chip performance Amazon meter increase distributed triple accelerator lease scale memory

2024-12-26 07:13

Amazon AWS released the Inferentia 2 chip, which triples the computing performance and increases the total accelerator memory by a quarter. Inferentia 2 supports distributed reasoning and can support up to 175 billion parameters, making it a strong competitor for large-scale model reasoning.

Prev：亞馬遜AWS發表Inferentia 2晶片，加速大規模模型推理

Next：Amazon AWS が大規模モデル推論を高速化する Inferentia 2 チップをリリース