当前位置：首页 > 报告详情

Tinker Tailor LLM Spy：调查并应对针对 GenAI 聊天机器人的攻击.pdf

上传人：竿*** 编号：982117 2025-11-29 PDF PDF 58页 10.20MB

该报告所属合集： 2025网络安全大会（SECTOR a black hat event 2025）嘉宾演讲PPT合集

打包下载报告合集

文档加载中……请稍候！
如果长时间未打开，您也可以点击刷新试试。

下载报告到电脑，查找使用更方便

VIP专享文档

书签

分享

收藏

已收藏

版权投诉

/58

立即下载

《Tinker Tailor LLM Spy：调查并应对针对 GenAI 聊天机器人的攻击.pdf》由会员分享，可在线阅读，更多相关《Tinker Tailor LLM Spy：调查并应对针对 GenAI 聊天机器人的攻击.pdf（58页珍藏版）》请在三个皮匠报告上搜索。

1、Tinker Tailor LLM SpyInvestigate&Respond to Attacks on GenAI Chatbotslinktr.ee/meowardNYCs MyCity chatbot chat.nyc.govChrisJBakke I just bought a 2024 Chevy Tahoe for$1.JFrogSecurity CVE-2024-5565 Prompt Injection Code Execution in Vanna.AIAsk a questionVanna converts it to SQLVanna sends back the d

2、ata plus a Plotly chartIt runs on the DBOh no,not another GenAI/LLM talk.Im not an expert.Hi,Im Allyn Low:Provides general information Med:Provides personalized information High:Performs actionsRisk LevelsIncident TypesBrand damage Privacy breach Unauthorized access&executionIncident Scenario#1Its s

3、o hot and humid out here,even Taylor Swift would write a breakup song about it.Whats the weather like in Austin,Texas?:weather chatbot:lowInvestigate:Implement loggingInputOutputLLMtimestamp chatbot_version user_prompt msg_thread_id session_idchatbot_output model timestamp:2025-02-18T14:40:00Z,model

4、:gpt-4,chatbot_version:weather_2.1,user_prompt:Give me a Taylor Swift-themed weather report.,chatbot_output:Cold and snowylooks like were in our Evermore era,session_id:123456789,msg_thread_id:123456789,Investigate:User inputs influence on LLMInputOutputLLMTraining Data Good job,Liam!Investigate:Use

5、r inputs influence on LLMInputOutputLLMTraining Datafine-tuninguser feedbackContain:Block impacting inputs InputOutputLLMGive me a weather report themed by the popular music artist famous for her Eras Tour.TaylorContain:Block impacting inputs&outputs InputOutputLLMTaylorTaylorChatbot GuardrailsRule-

6、based metrics LLM-as-a-judge System promptLLM-as-a-JudgeInputOutputContextLLM App ArgsLLM Evaluation MetricScorerScoreReasonPasses Threshold?Metric:Yes/NoEvaluate the quality of the following weather report on a scale of 0 to 1,where 0 is poor and 1 is excellent.Consider accuracy,completeness,releva

word格式文档无特别注明外均可编辑修改，预览文件经过压缩，下载原文更清晰！

三个皮匠报告文库所有资源均是客户上传分享，仅供网友学习交流，未经上传用户书面授权，请勿作商用。

本文主要讨论了针对基于生成式人工智能（GenAI）的聊天机器人的攻击类型、响应策略和防御措施。以下是关键点： 1. **攻击类型**：包括品牌损害、隐私泄露、未授权访问和执行等。 2. **响应策略**：通过实施日志记录、用户输入影响调查、系统提示优化和护栏（guardrails）机制来应对。 - 核心数据：“timestamp”、“model”、“chatbot_version”、“user_prompt”、“chatbot_output”等日志记录信息。 3. **防御措施**：包括限制影响输入输出、工具安全性和数据源调查。 - 关键措施：护栏决策得分（guardrail_score）和护栏理由（guardrail_reason）。 4. **数据源和RAG管道**：强调了训练数据的安全性和对敏感信息的保护。 5. **事故响应手册**：提出了五步走的调查和分析流程，以应对聊天机器人事故。本文强调了理解风险、实施日志记录和准备护栏工具箱的重要性，以阻止和响应针对GenAI聊天机器人的攻击。

"LLM聊天机器人风险揭秘" "如何应对AI聊天机器人攻击？" "AI聊天机器人安全指南大揭秘！"

全行业研究报告分享下载平台

0731-84720580
商务合作：really158d
友链申请 (QQ)：1737380874

关于我们

更多

关于我们

三个皮匠报告微信公众号

三个皮匠报告微信小程序

扫码咨询商务合作事宜

友情链接：

营销自动化亿欧智库微播易阿里妈妈

copyright@2008-2013 长沙思想领动信息技术有限公司版权所有网站备案/许可证号：湘B2-20190120 | 工信部备案号：湘ICP备2023027541号-2 | 公安备案号：湘公网安备43010402001071号

客服

小程序

服务号

折叠