《通过混沌工程和可观测性构建弹性块存储.pdf》由会员分享,可在线阅读,更多相关《通过混沌工程和可观测性构建弹性块存储.pdf(19页珍藏版)》请在三个皮匠报告上搜索。
1、 2025,Amazon Web Services,Inc.or its affiliates.All rights reserved.2025,Amazon Web Services,Inc.or its affiliates.All rights reserved.Everything fails,all the timeWerner VogelsCTO,A 2025,Amazon Web Services,Inc.or its affiliates.All rights reserved.S T G 3 2 4Building resilient block storage throug
2、h chaos engineering and observabilityKirill Davydychev(he/him)Principal Storage Solutions ArchitectAmazon Web ServicesParnika Singh(she/her)Senior Product Manager TechnicalAmazon Web Services 2025,Amazon Web Services,Inc.or its affiliates.All rights reserved.AgendaOverviewDefining resilienceObservab
3、ilityResilience testingBest practices 2025,Amazon Web Services,Inc.or its affiliates.All rights reserved.2025,Amazon Web Services,Inc.or its affiliates.All rights reserved.Defining resilience 2025,Amazon Web Services,Inc.or its affiliates.All rights reserved.A B I L I T Y O F A W O R K L O A D T O R
4、 E C O V E R F R O M I N F R A S T R U C T U R E O R S E R V I C E D I S R U P T I O N SResilienceHigh availabilityResistance to common failures through design and operational mechanisms at a primary site Disaster recoveryReturning to normal operation within specific RTO/RPO for failures that cannot
5、 be handled by HAContinuous improvement CI/CD,observability,moving beyond pre-deployment testing towards chaos engineering patterns 2025,Amazon Web Services,Inc.or its affiliates.All rights reserved.How to build resiliencyUnderstand risks and potential disruptionsPlan and automate recoveryMinimize s
6、cope of impact 2025,Amazon Web Services,Inc.or its affiliates.All rights reserved.Disruption typesApplication dataData deletionData corruptionSoftware bugsInfrastructureComponent failure:Hard drivePower supplyIn AWS may look like:Amazon EC2 eventAmazon EBS eventAvailability ZoneNatural disaster:Fire