《利用 Amazon ElastiCache 中的语义缓存优化智能体 AI 应用.pdf》由会员分享,可在线阅读,更多相关《利用 Amazon ElastiCache 中的语义缓存优化智能体 AI 应用.pdf(60页珍藏版)》请在三个皮匠报告上搜索。
1、 2025,Amazon Web Services,Inc.or its affiliates.All rights reserved.2025,Amazon Web Services,Inc.or its affiliates.All rights reserved.D A T 4 5 1Optimizing agentic AI apps with semantic caching in Amazon ElastiCacheSanjit Misra(he/him)Product,Non-Relational DatabasesAWSAllen Samuels(he/him)Principa
2、l Engineer,In-Memory DatabasesAWS 2025,Amazon Web Services,Inc.or its affiliates.All rights reserved.The production moment of Agentic AIPresentNow:ProductizationEnterprise Adoption(Governance,risk,latency,cost containment)2022-2024Multimodal&QualityBetter text and image handlingBetter performance an
3、d accuracy2024-2025Agents:Chat to ActionTool/function calling(APIs,workflows,and more)Multi-agent architectures2022Conversational AIConversational UX making AI usefulMass-Market Adoption2014-2022New Language ModelsBreakthrough image generatorsReinforcement Learning from Human Feedback(RLHF)Foundatio
4、ns 2025,Amazon Web Services,Inc.or its affiliates.All rights reserved.The friction from demo to deploymentScaleSpeedCost 2025,Amazon Web Services,Inc.or its affiliates.All rights reserved.Agentic AI can drive up latencyQuery ProcessingPlanning&inferenceTool ExecutionAPI calls+waitResponse OutputResu
5、lts readyContextContext engineeringTotal Latency:(query+tool+output+context)N turnsWhere N represents the number of loop iterations required to complete the task.2025,Amazon Web Services,Inc.or its affiliates.All rights reserved.Increasing intelligence drives cost50K100K150K200K250K300K350K400K450K5
6、00KCosts($)UsersAnnual LLM Costs($)for Agentic AI ApplicationBaseline 2025,Amazon Web Services,Inc.or its affiliates.All rights reserved.Increasing intelligence drives cost50K100K150K200K250K300K350K400K450K500KCosts($)UsersAnnual LLM Costs($)for Agentic AI ApplicationBaselineIncreasing complexity 2