当前位置:首页 > 报告详情

陈争胜-在边缘DC部署大模型:实践和加速 陈争胜.pdf

上传人: 2*** 编号:134844 2023-07-08 14页 2.95MB

1、Deploying Large Model in edge DC:practice and accelerationJack ChenYsemi Computing遇 贤 微 电 子致力于研发高性能计算和数据中心的CPUYSEMI Computing Introduction7/17/20232 YSEMI Computing founded in 2020YSEMI Computing founded in 2020 100+Employees across Shenzhen/Shanghai/Xian100+Employees across Shenzhen/Shanghai/Xian P

2、roviding silicon,platform and system for cloud computing Providing silicon,platform and system for cloud computing data centersdata centers First generation product is 160Cores Armv9 Datacenter First generation product is 160Cores Armv9 Datacenter CPU,3.2GHz and estimatedCPU,3.2GHz and estimated 620

3、+SPECint2017620+SPECint2017 Members of LF Members of LF EdageEdage,OpenEulerOpenEuler,ODCC etc.,ODCC etc.Why Large Language Mode on edge is important?7/17/20233Data PrivacyPervasive AILatencyChatGPT-like LLM user cases in edge DC7/17/20234code explanationTime Complexity Calculationprogram code trans

4、lationFix code bugsparagraph productionstory creationsummary descriptionText CategorizationPerson switchCategoryFAQreview generationText Sentiment AnalysisAdvanced Sentiment Scoringinterview questions and answersText to Emojilanguage chatbotEngineeringCommunicationContent generationMarketingChalleng

5、es in LLM deployment on edge7/17/20235ChallengesDemands are hugeThe recent rise of ChatGPT-like Large Language Model(LLM)has promoted the vigorous development of AI on the application side,which also puts forward unprecedented demands on edge devices.Computing Force is limitedGPT-3 has 175 billion p

6、arameters while GPT-4 has more parameters.Big model size and large training computing force will limit the usage in edge and terminals.smaller model size moderate computing force lower but usable accuracyIP PhoneTablet PCPC ClientMobile ClientThin clientTV screenCollaborationSoftwareComputing force

word格式文档无特别注明外均可编辑修改,预览文件经过压缩,下载原文更清晰!
三个皮匠报告文库所有资源均是客户上传分享,仅供网友学习交流,未经上传用户书面授权,请勿作商用。
本文探讨了在边缘数据中心部署大型语言模型(LLM)的重要性和实践方法。文章指出,随着类似ChatGPT的大型语言模型的兴起,AI应用对边缘设备提出了前所未有的要求。这些大型模型需要大量的计算资源和存储空间,对边缘设备构成了挑战。文章介绍了一种由YSEMI Computing开发的针对边缘计算优化的处理器,以及如何通过优化模型和采用合适的计算架构来提高LLM在边缘设备上的性能。此外,文章还提到了通过AI优化处理器设计的方法,例如通过电路协同设计来生成满足特定输入输出规格的布尔函数。总体而言,文章强调了在边缘数据中心部署高效、低功耗的LLM模型的重要性,并提出了相应的技术解决方案。
如何实现边缘数据中心的高效推理? 大模型在边缘设备上的挑战与解决方案是什么? YSEMI Computing如何推动边缘数据中心的发展?
客服
商务合作
小程序
服务号
折叠