当前位置:首页 >英文主页 >中英对照 > 报告详情

兰德公司:2026衡量AI智能体的生物能力与风险:基于智能体评估的证据生成与解读研究报告(英文版)(33页).pdf

上传人: 1****1 编号:1123825 2026-02-12 33页 1.29MB

下载:

1、PATRICIA PASKOV,JEFFREY LEE,KYLE BRADY,ALYSSA WORLANDMeasuring Biological Capabilities and Risks of AI AgentsGenerating and Interpreting Evidence from Agentic EvaluationsPerspectiveThis publication has completed RANDs research quality-assurance process but was not professionally copyedited.For more

2、information on this publication,visit www.rand.org/t/PEA4710-1.About RANDRAND is a research organization that develops solutions to public policy challenges to help make communities throughout the world safer and more secure,healthier and more prosperous.RAND is nonprofit,nonpartisan,and committed t

3、o the public interest.To learn more about RAND,visit www.rand.org.Research IntegrityOur mission to help improve policy and decisionmaking through research and analysis is enabled through our core values of quality and objectivity and our unwavering commitment to the highest level of integrity and et

4、hical behavior.To help ensure our research and analysis are rigorous,objective,and nonpartisan,we subject our research publications to a robust and exacting quality-assurance process;avoid both the appearance and reality of financial and other conflicts of interest through staff training,project scr

5、eening,and a policy of mandatory disclosure;and pursue transparency in our research engagements through our commitment to the open publication of our research findings and recommendations,disclosure of the source of funding of published research,and policies to ensure intellectual independence.For m

6、ore information,visit www.rand.org/about/research-integrity.RANDs publications do not necessarily reflect the opinions of its research clients and sponsors.Published by the RAND Corporation,Santa Monica,Calif.2026 RAND Corporation is a registered trademark.Limited Print and Electronic Distribution R

word格式文档无特别注明外均可编辑修改,预览文件经过压缩,下载原文更清晰!
三个皮匠报告文库所有资源均是客户上传分享,仅供网友学习交流,未经上传用户书面授权,请勿作商用。
1. **AI生物风险与评估需求**:AI科学家(如Google、FutureHouse的模型)加速科研,但也带来生物武器开发风险(如LLMs降低非专家门槛,Dev等2025)。 2. **代理评估(Agentic Evaluations)的重要性**:需超越静态知识测试,通过多步任务(如设计-合成-实验)评估AI在真实生物工作流中的能力(Brady & Lee等2026)。 3. **评估设计的核心考量**: - **跨学科协作**:整合生物学、工程与生物安全专家(Paskov等2025)。 - **任务分解**:将复杂流程(如生物工具使用)拆解为子任务,避免掩盖瓶颈(图3)。 - **人机交互模型**:明确人类参与程度(如自主/协作),影响结果解读(Shi & Tang等2024)。 - **资源与假设**:工具、算力等需匹配威胁场景(如国家行为体vs.个人)。 4. **透明与风险平衡**:需详细记录评估条件(如提示模板、评分标准),同时管理信息泄露风险(Frontier Model Forum 2025)。
AI生物风险? 评估AI能力? 如何设计评估?
客服
商务合作
小程序
服务号
折叠