当前位置：首页 >英文主页 >中英对照 > 中译版报告详情

DeepSeek Coder V2技术报告（中译版）（19页）.pdf

上传人：淘*** 编号：650880 2025-04-07 PDF PDF 中文版中文版中文版 DOCX DOCX DOCX 19页 387.03KB 16张图表

下载：

文档加载中……请稍候！
如果长时间未打开，您也可以点击刷新试试。

下载报告到电脑，查找使用更方便

VIP专享文档

书签

分享

收藏

已收藏

版权投诉

/19

立即下载

《DeepSeek Coder V2技术报告（英文版）（19页）.pdf》由会员分享，可在线阅读，更多相关《DeepSeek Coder V2技术报告（英文版）（19页）.pdf（19页珍藏版）》请在三个皮匠报告上搜索。

1、DeepSeek-Coder-V2:Breaking the Barrier of Closed-SourceModels in Code IntelligenceQihao Zhu*,Daya Guo*,Zhihong Shao*,Dejian Yang*,Peiyi Wang,Runxin Xu,Y.WuYukun Li,Huazuo Gao,Shirong Ma,Wangding Zeng,Xiao Bi,Zihui Gu,Hanwei Xu,Damai DaiKai Dong,Liyue Zhang,Yishi Piao,Zhibin Gou,Zhenda Xie,Zhewen Hao

2、,Bingxuan WangJunxiao Song,Deli Chen,Xin Xie,Kang Guan,Yuxiang You,Aixin Liu,Qiushi Du,Wenjun GaoXuan Lu,Qinyu Chen,Yaohui Wang,Chengqi Deng,Jiashi Li,Chenggang ZhaoChong Ruan,Fuli Luo,Wenfeng LiangDeepSeek-AIhttps:/ present DeepSeek-Coder-V2,an open-source Mixture-of-Experts(MoE)code languagemodel

3、that achieves performance comparable to GPT4-Turbo in code-specific tasks.Specifically,DeepSeek-Coder-V2 is further pre-trained from an intermediate checkpoint of DeepSeek-V2with additional 6 trillion tokens.Through this continued pre-training,DeepSeek-Coder-V2substantially enhances the coding and m

4、athematical reasoning capabilities of DeepSeek-V2,while maintaining comparable performance in general language tasks.Compared to DeepSeek-Coder-33B,DeepSeek-Coder-V2 demonstrates significant advancements in various aspects ofcode-related tasks,as well as reasoning and general capabilities.Additional

5、ly,DeepSeek-Coder-V2 expands its support for programming languages from 86 to 338,while extending the contextlength from 16K to 128K.In standard benchmark evaluations,DeepSeek-Coder-V2 achievessuperior performance compared to closed-source models such as GPT4-Turbo,Claude 3 Opus,and Gemini 1.5 Pro i

6、n coding and math benchmarks.HumanEvalMBPP+MATHGSM8K5060708090100Accuracy(%)90.276.275.794.988.272.273.493.783.574.667.790.884.972.060.195.081.769.050.493.081.168.2AiderLiveCodeBenchSWE-Bench0102030405060708073.743.412.763.945.718.357.134.118.768.434.611.749.228.751.131.02.7DeepSeek-Coder-V2GPT-4-Tu

word格式文档无特别注明外均可编辑修改，预览文件经过压缩，下载原文更清晰！

三个皮匠报告文库所有资源均是客户上传分享，仅供网友学习交流，未经上传用户书面授权，请勿作商用。

本文介绍了DeepSeek-Coder-V2，这是一个开源的混合专家（MoE）代码语言模型，其性能可与GPT4-Turbo相媲美。具体来说，DeepSeek-Coder-V2从DeepSeek-V2的中间检查点进一步预训练，并增加了6万亿个标记的数据。通过继续预训练，DeepSeek-Coder-V2显著提高了DeepSeek-V2的编码和数学推理能力，同时保持了在通用语言任务中的可比性能。与DeepSeek-Coder-33B相比，DeepSeek-Coder-V2在代码相关任务、推理和通用能力方面取得了显著的进步。此外，DeepSeek-Coder-V2支持编程语言从86种增加到338种，并将上下文长度从16K扩展到128K。在标准基准评估中，DeepSeek-Coder-V2在编码和数学基准测试中优于封闭源模型，如GPT4-Turbo、Claude 3 Opus和Gemini 1.5 Pro。

"DeepSeek-Coder-V2如何超越GPT4-Turbo？" "开源代码模型如何缩小与闭源模型之间的差距？" "DeepSeek-Coder-V2在编程语言支持上有哪些突破？"

全行业研究报告分享下载平台

0731-84720580
商务合作：really158d
友链申请 (QQ)：1737380874

关于我们

更多

关于我们

三个皮匠报告微信公众号

三个皮匠报告微信小程序

扫码咨询商务合作事宜

友情链接：

营销自动化亿欧智库微播易阿里妈妈

copyright@2008-2013 长沙思想领动信息技术有限公司版权所有网站备案/许可证号：湘B2-20190120 | 工信部备案号：湘ICP备2023027541号-2 | 公安备案号：湘公网安备43010402001071号

客服

小程序

服务号

折叠