大型代码语言模型:探索现状、机遇与挑战.pdf

编号:981523 PDF 57页 6.91MB 下载积分:VIP专享
下载报告请您先登录!

大型代码语言模型:探索现状、机遇与挑战.pdf

1、Large Language Models for CodeLoubna Ben Allal,Machine Learning Engineer,Science teamAbout me-ML Engineer Hugging Face-Graduated from ENS Paris Saclay&Ecole des Mines de Nancy-Working on LLMs for code&Synthetic data:“The Stack,StarCoder,Cosmopedia.”LoubnaBenAllal1https:/loubnabnl.github.io/How it st

2、arted:GitHub Copilot in 2021ML+Code=Productivity https:/ Engines+ML lead to 6%reduction in code iterations 3%of code generated by model ButAPI:Model:XData:XCode:XHow its going:Over 1.7k open models trained on codeHow did we get here?Strong Instruction-tuned and base modelsHow are code LLMs trained?W

3、hat you need to train(code)LLMs from scratchTransformer ModelUntrained ModelPretrained“Base”ModelSupervised Finetuned(SFT)ModelRLHFChat LLM(e.g.GPT-4)Training Generative AI Models Untrained ModelPretrained“Base”ModelSupervised Finetuned(SFT)ModelRLHFChat LLM(e.g.GPT-4)Training Code LLMsInstruction d

4、ataset for code:“write a function”“solve a bug”.The Landscape of code LLMs The Stack dataset StarCoder StarCoder2 3B,7B,15B sizesStarChat2(with H4 team)DeepSeek-Coder1B,7B,33BDeepSeek-Coder-InstructCodeLlama 7B,13B,70BCodeLlama-InstructOthers:StableCode from StabilityAI,CodeGen from SalesForce&LLMs

5、like Mixtral,DBRX,Qwen&YiBigCode:open-scientific collaborationWe are building LLMs for code in a collaborative way:-Full data transparency-Open source processing and training code-Model weights released with commercial friendly license1100+researchers,engineers,lawyers,and policy makersClosed Source

6、 Training data&sources not disclosedModel weights not public Sending data to external APIsNot reproducibleClosed Source Training data and sources not disclosedModel weights not public Sending data to external APIsNot reproducibleOpen Source Public data with inspection and opt-out toolsModel weights

友情提示

1、下载报告失败解决办法
2、PDF文件下载后,可能会被浏览器默认打开,此种情况可以点击浏览器菜单,保存网页到桌面,就可以正常下载了。
3、本站不支持迅雷下载,请使用电脑自带的IE浏览器,或者360浏览器、谷歌浏览器下载即可。
4、本站报告下载后的文档和图纸-无水印,预览文档经过压缩,下载后原文更清晰。

本文(大型代码语言模型:探索现状、机遇与挑战.pdf)为本站 (竿头日上) 主动上传,三个皮匠报告文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。 若此文所含内容侵犯了您的版权或隐私,请立即通知三个皮匠报告文库(点击联系客服),我们立即给予删除!

温馨提示:如果因为网速或其他原因下载失败请重新下载,重复下载不扣分。
客服
商务合作
小程序
服务号
折叠