当前位置:首页 >英文主页 >中英对照 > 报告详情

CSET:2024AI安全的关键概念:机器学习中可靠的不确定性量化方法分析报告(英文版)(13页).pdf

上传人: Kell****reet 编号:182870 2024-11-27 13页 446.96KB

下载:

1、Issue BriefJune 2024Key Concepts in AI SafetyReliable Uncertainty Quantification in Machine LearningAuthorsTim G.J.RudnerHelen Toner Center for Security and Emerging Technology|1 This paper is the fifth installment in a series on“AI safety,”an area of machine learning research that aims to identify

2、causes of unintended behavior in machine learning systems and develop tools to ensure these systems work safely and reliably.Other papers in the series describe three categories of AI safety issuesproblems of robustness,assurance,and specification.This paper introduces the idea of uncertainty quanti

3、fication,i.e.,training machine learning systems that“know what they dont know.”Introduction The last decade of progress in machine learning research has given rise to systems that are surprisingly capable but also notoriously unreliable.The chatbot ChatGPT,developed by OpenAI,provides a good illustr

4、ation of this tension.Users interacting with the system after its release in November 2022 quickly found that while it could adeptly find bugs in programming code and author Seinfeld scenes,it could also be confounded by simple tasks.For example,one dialogue showed the bot claiming that the fastest

5、marine mammal was the peregrine falcon,then changing its mind to the sailfish,then back to the falcondespite the obvious fact that neither of these choices is a mammal.This kind of uneven performance is characteristic of deep learning systemsthe type of AI systems that have seen most progress in rec

6、ent yearsand presents a significant challenge to their deployment in real-world contexts.An intuitive way to handle this problem is to build machine learning systems that“know what they dont know”that is,systems that can recognize and account for situations where they are more likely to make mistake

word格式文档无特别注明外均可编辑修改,预览文件经过压缩,下载原文更清晰!
三个皮匠报告文库所有资源均是客户上传分享,仅供网友学习交流,未经上传用户书面授权,请勿作商用。
本文介绍了人工智能安全领域中的一个关键概念——可靠的不确定性量化。主要内容包括: 1. 人工智能系统在实际应用中存在不可靠性,如ChatGPT在简单任务上出现错误。 2. 为了提高系统的可靠性,需要构建能够识别并处理自身局限性的机器学习系统,即不确定性量化。 3. 实现不确定性量化面临的主要挑战是数据分布偏移,即系统在实际部署时遇到的数据与训练数据不同。 4. 目前存在四种主要的不确定性量化方法:确定性方法、模型集成、规范预测和贝叶斯推断。每种方法都有其优缺点,但都无法保证在所有情况下提供可靠的不确定性估计。 5. 尽管不确定性量化是一个开放的研究问题,但其在确保人工智能系统安全可靠方面具有重要意义。
如何可靠地量化机器学习模型的不确定性? 机器学习模型如何“知道它们不知道什么”? 面对分布偏移,如何确保不确定性量化模型的可靠性?
客服
商务合作
小程序
服务号
折叠