当前位置:首页 >英文主页 >中英对照 > 报告详情

CSET:2025可解释人工智能的实证检验:对AI评估方法的批判性审视报告(英文版)(34页).pdf

上传人: Kell****reet 编号:618528 2025-03-20 34页 1.03MB

下载:

1、Issue BriefFebruary 2025Putting Explainable AI to the TestA Critical Look at AI Evaluation ApproachesAuthorsMina NarayananChristian SchoeberlTim G.J.RudnerPutting Explainable AI to the TestA Critical Look at AI Evaluation ApproachesAuthorsMina NarayananChristian SchoeberlTim G.J.RudnerCenter for Sec

2、urity and Emerging Technology|1 Executive Summary Policymakers frequently invoke explainability and interpretability as key principles that responsible and safe AI systems should uphold.However,it is unclear how evaluations of explainability and interpretability methods are conducted in practice.To

3、examine evaluations of these methods,we conducted a literature review of studies that focus on the explainability and interpretability of recommendation systemsa type of AI system that often uses explanations.Specifically,we analyzed how researchers(1)describe explainability and interpretability and

4、(2)evaluate their explainability and interpretability claims in the context of AI-enabled recommendation systems.We focused on evaluation approaches in the research literature because data on AI developers evaluation approaches is not always publicly available,and researchers approaches can guide th

5、e types of evaluations that AI developers adopt.We find that researchers describe explainability and interpretability in variable ways across papers and do not clearly differentiate explainability from interpretability.We also identify five evaluation approaches that researchers adoptcase studies,co

6、mparative evaluations,parameter tuning,surveys,and operational evaluationsand observe that research papers strongly favor evaluations of system correctness over evaluations of system effectiveness.These evaluations serve important but distinct purposes.Evaluations of system correctness test whether

word格式文档无特别注明外均可编辑修改,预览文件经过压缩,下载原文更清晰!
三个皮匠报告文库所有资源均是客户上传分享,仅供网友学习交流,未经上传用户书面授权,请勿作商用。
本文主要研究了AI系统中可解释性和可理解性的评估方法。研究发现,研究人员在描述可解释性和可理解性时,并没有明确区分这两个概念,而是采用了相似的描述方式。此外,研究人员采用了五种评估方法,包括案例研究、比较评估、参数调整、调查和运营评估,其中案例研究和比较评估最为常见。然而,这些评估方法主要关注系统的正确性,而不是系统的有效性。因此,政策制定者需要投资于AI安全评估的标准,并建立一个能够评估这些评估有效性的专业人才队伍。
解释AI系统的可解释性和可理解性有何不同? 研究人员如何评估AI系统的可解释性和可理解性? 政策制定者如何确保AI系统的可解释性评估有效?
客服
商务合作
小程序
服务号
折叠