Alibaba Cloud releases "Aegaeon" computing pooling solution

2025-10-21 09:43
 476
At the SOSP 2025 conference in Seoul, South Korea, Alibaba Cloud launched the "Aegaeon" computing pooling solution, which effectively reduces GPU resource waste in AI model serving, reducing the number of GPUs required for large language models by 82%. This technology is now used in the Alibaba Cloud Bailian platform.