《CXL RAS 固件优先错误处理的应用案例.pdf》由会员分享,可在线阅读,更多相关《CXL RAS 固件优先错误处理的应用案例.pdf(13页珍藏版)》请在三个皮匠报告上搜索。
1、Use Cases for CXL RAS Firmware-First Error HandlingIntel CorporationSTORAGEHarapanahalli,Manjunaatha B-Server Firmware BIOS ArchitectChen,Arvin-Platform Validation EngineerIntel CorporationUse Cases for CXL RAS Firmware-First Error HandlingIntroductionFirmware-First in CXL RAS:Real-World Lessons on
2、Mailbox DesignImpact of SMI Latency on System PerformanceCommon Error Signaling ProtocolsUUID to GUID:Translating CXL Errors with Correct FormatCXL Communication Errors:Boot-Time and Runtime HandlingError Pollution with CXL Error1234567OutlineThe CXL ecosystem comprises of multitude of component ven
3、dors like SoC,Memory,Storage,Networking,etc.The explosive growth of internet content and the resulting data storage and computation requirements has resulted in the deployment of heterogenous and complex solutions in the very large-scale data centers.These warehouse sized buildings are packed with s
4、erver,storage and network hardware.Specifically,if there is an uncorrected fatal error detected by hardware that pose a containment risk.The system needs to be reset and restarted,if possible,to enable continued operation.The error affects the entire CXL device,a persistent/permanent memory device i
5、s considered to have experienced a dirty shut-down.IntroductionThe idea of primary and secondary mailboxis to have FWand OS NOT to step on each other.Firmware first support requires the CXL Memory device to implement a secondary mailbox.There was a challenge to get this support from CXL IHVs in the
6、initial stages.To support the engineering/debug effort,IntelFW team added the option to use the Primary mailboxduring runtime to enable the MEFNfeature to validate with IHVswithout Secondary mailbox support.The primary and secondary Mailbox queues are different,but the Device Status Registeris commo