微软:AI大模型:Gemini-功能强大的多模态模型(2023)(英文版)(50页).pdf

编号:148097 PDF  DOCX  中文版 50页 1.63MB 下载积分:VIP专享
下载报告请您先登录!

微软:AI大模型:Gemini-功能强大的多模态模型(2023)(英文版)(50页).pdf

1、2023-12-06Gemini:A Family of Highly CapableMultimodal ModelsGemini Team,Google1This report introduces a new family of multimodal models,Gemini,that exhibit remarkable capabilitiesacross image,audio,video,and text understanding.The Gemini family consists of Ultra,Pro,and Nanosizes,suitable for applic

2、ations ranging from complex reasoning tasks to on-device memory-constraineduse-cases.Evaluation on a broad range of benchmarks show that our most-capable Gemini Ultra modeladvances the state-of-the-art in 30 of 32 of these benchmarks notably being the first model to achievehuman-expert performance o

3、n the well-studied exam benchmark MMLU,and improving the state of theart in every one of the 20 multimodal benchmarks we examined.We believe that the new capabilities ofGemini models in cross-modal reasoning and language understanding will enable a wide variety of usecases and we discuss our approac

4、h toward deploying them responsibly to users.1.IntroductionWe present Gemini,a family of highly capable multimodal models developed at Google.We trainedGemini jointly across image,audio,video,and text data for the purpose of building a model with bothstrong generalist capabilities across modalities

5、alongside cutting-edge understanding and reasoningperformance in each respective domain.Gemini 1.0,our first version,comes in three sizes:Ultra for highly-complex tasks,Pro for enhancedperformance and deployability at scale,and Nano for on-device applications.Each size is specificallytailored to add

6、ress different computational limitations and application requirements.We evaluatethe performance of Gemini models on a comprehensive suite of internal and external benchmarkscovering a wide range of language,coding,reasoning,and multimodal tasks.Gemini advances state-of-the-art in large-scale langua

友情提示

1、下载报告失败解决办法
2、PDF文件下载后,可能会被浏览器默认打开,此种情况可以点击浏览器菜单,保存网页到桌面,就可以正常下载了。
3、本站不支持迅雷下载,请使用电脑自带的IE浏览器,或者360浏览器、谷歌浏览器下载即可。
4、本站报告下载后的文档和图纸-无水印,预览文档经过压缩,下载后原文更清晰。

本文(微软:AI大模型:Gemini-功能强大的多模态模型(2023)(英文版)(50页).pdf)为本站 (无糖拿铁) 主动上传,三个皮匠报告文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。 若此文所含内容侵犯了您的版权或隐私,请立即通知三个皮匠报告文库(点击联系客服),我们立即给予删除!

温馨提示:如果因为网速或其他原因下载失败请重新下载,重复下载不扣分。
客服
商务合作
小程序
服务号
折叠