当前位置:首页 > 报告详情

20260218_A-102_Sever.PDF

上传人: 彩旗 编号:1158858 2026-03-02 12页 1.94MB

1、1/Chiplet Sumit 2026.proteanTecs.All rights reserved.February 2026Electronics Visibility.From Within.Nir SeverSenior Director of Business DevelopmentIn-Field Monitoring for Chiplet Designs via ML-Driven On-Die Telemetry2 /Chiplet Sumit 2026.proteanTecs.All rights reserved.The New Frontier:Chiplet Ba

2、sed AI ArchitecturesThe rapid shift to chiplet-based AI acceleration is driven by four core pillars:Heterogeneous Dies:Mixing process nodes for optimized logic and I/OCompute Dies:Complex,ultra high speed,multi-core,and mission criticalStacked Memories(HBM):Massive bandwidth directly adjacent to the

3、 computeUltra-Fast Interconnects:Delivering the low-latency communication requiredfor AI workloads3 /Chiplet Sumit 2026.proteanTecs.All rights reserved.AI Systems Challenges While performance and power have soared,system complexity has introduced critical physical blind spots Traditional test and va

4、lidation methods alone can no longer guarantee field reliability,power profiles,or efficient bring-upThermal StressLocalized hotspots from 3D stacking and high-density AI cores.Process VariationInconsistent performance across identical chipletsAccelerated AgingFaster degradation due to continuous,hi

5、gh-intensity workloadsDynamic Voltage DroopPerformance degradation due to continuous,high-intensity workloadsSDC is on the Rise1 Upasani,G.,Vera,X.,&Gonzalez,A.(2015).A case for acoustic wave detectors for soft errors.IEEE Transactions on Computers,65(1),518.Power Consumption is on the RiseUp to 60%

6、workload loss due to hardware failure 5 /Chiplet Sumit 2026.proteanTecs.All rights reserved.Its All About Timing VisibilityCombination logic(replication of the CP set by S1:SnCanary CircuitBest known methods often monitor design margins by proxy,and cannot provide accurate data about actual performa

word格式文档无特别注明外均可编辑修改,预览文件经过压缩,下载原文更清晰!
三个皮匠报告文库所有资源均是客户上传分享,仅供网友学习交流,未经上传用户书面授权,请勿作商用。
1. **Chiplet AI架构挑战**:异构芯片、高算力核心、堆叠内存(HBM)、超快互连四大支柱驱动发展,但面临热应力、工艺变异、加速老化、电压跌落等问题,导致硬件故障损失高达60%工作负载。 2. **实时时序可见性**:Proteantec通过ML驱动片上遥测,实时监控关键路径时序裕度,覆盖工作负载、缺陷、电压跌落等,实现动态功耗降低(客户案例:网络/超大规模/移动客户分别节省12.5%/9.0%/11.9%)。 3. **故障预防与诊断**:通过性能指数预警时序失效,结合本地/远程诊断,实现主动故障避免、可靠性监控及精准故障定位,提升系统韧性与能效。
**芯片盲点何在?** **AI芯片如何长寿?** **功耗如何优化?**
客服
商务合作
小程序
服务号
折叠