塑造人工智能开放基础设施的未来.pdf

编号:1011815 PDF 18页 4.16MB 下载积分:VIP专享
下载报告请您先登录!

塑造人工智能开放基础设施的未来.pdf

1、Ian BuckVP of HPC and HyperscaleNVIDIAShaping the Future of Open Infrastructure for AIGiga-Scale AI is Transforming Data CentersDriving extreme co-design from chip to grid with open collaborationNVIDIA Giga-Scale Reference DesignsPowerCoolingNetworkingComputeMechanicalScale-Up Spectrum-X EthernetOpe

2、n CollaborationCPXPower Smoothing45C Liquid CoolingMGX010,00020,00030,00040,00050,00060,0000100200300400500600GPT-OSS LaunchInferenceMAXTensorRT-LLM+Spec DecodeAug 2025GPT-OSS LaunchTodayCost per Million TokensBlackwell Optimizations Achieve 5X Throughput in 2 MonthsMulti-fold reduction in token cos

3、tsThroughputTPS per GPUInteractivityTPS per UserGPT-OSS-120B$0.11$0.02 5X100030,000 TPS/GPU5x Throughput in 2 monthsH200 NVL8GB200 NVL72Non-GPU CostsGPU CostsProfitExtreme Hardware-Software Co-Design for Inference Performance$5M GB200 NVL72 investment can generate$75M token revenue02,5005,0007,50010

4、,00012,500105090Measured DeepSeek-R1ThroughputTPS per GPUInteractivityTPS per User15xNVL72FP4DynamoTRT-LLMTRT Model OptimizerCUDA GraphsH200GB200AI Factory ROI$75M Revenue$5M$5M CostRevenue estimates assume 3-year operation on 72 GPUs at 50 TPS/User with DeepSeek R1 and$1.45/M token cost,based on In

5、ferenceMAX results and SemiAnalysis TCO model;actual ROI may vary.Inference Complexity is ExplodingMore parameters,experts,reasoning,kernels&shapes,and contextDS-R1,GPT OSS,Kimi K2,Llama4,Qwen3,Cosmos,Gemini,LTM-2-mini,Sora2Mixture of ExpertsDense TransformersDense LLMsInferenceComplexityBERTLlama32

6、024201820232025Massive Context(Video generation,software application development)1Expert10KKernels,Shapes300+Experts10MKernels,Shapes1M+Context Tokens(2,000 x vs.BERT)Next Generation Vera Rubin for Giga-Scale AIOCP MGX compatible infrastructureVera Rubin NVL144Vera Rubin CPXComputeMemoryBandwidthNVL

友情提示

1、下载报告失败解决办法
2、PDF文件下载后,可能会被浏览器默认打开,此种情况可以点击浏览器菜单,保存网页到桌面,就可以正常下载了。
3、本站不支持迅雷下载,请使用电脑自带的IE浏览器,或者360浏览器、谷歌浏览器下载即可。
4、本站报告下载后的文档和图纸-无水印,预览文档经过压缩,下载后原文更清晰。

本文(塑造人工智能开放基础设施的未来.pdf)为本站 (明日何其多) 主动上传,三个皮匠报告文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。 若此文所含内容侵犯了您的版权或隐私,请立即通知三个皮匠报告文库(点击联系客服),我们立即给予删除!

温馨提示:如果因为网速或其他原因下载失败请重新下载,重复下载不扣分。
客服
商务合作
小程序
服务号
折叠