当前位置:首页 >英文主页 >中英对照 > 报告详情

智谱:ChatGLM技术报告(英文版)(19页).pdf

上传人: 淘*** 编号:650877 2025-04-07 19页 1.15MB

下载:

1、ChatGLM:A Family of Large Language Modelsfrom GLM-130B to GLM-4 All ToolsTeam GLM1Zhipu AI2Tsinghua UniversityAbstractWe introduce ChatGLM,an evolving family of large language models that we havebeen developing over time.This report primarily focuses on the GLM-4 languageseries,which includes GLM-4,

2、GLM-4-Air,and GLM-4-9B.They represent ourmost capable models that are trained with all the insights and lessons gained fromthe preceding three generations of ChatGLM.To date,the GLM-4 models arepre-trained on ten trillions of tokens mostly in Chinese and English,along witha small set of corpus from

3、24 languages,and aligned primarily for Chinese andEnglish usage.The high-quality alignment is achieved via a multi-stage post-training process,which involves supervised fi ne-tuning and learning from humanfeedback.Evaluations show that GLM-4,1)closely rivals or outperforms GPT-4in terms of general m

4、etrics such as MMLU,GSM8K,MATH,BBH,GPQA,andHumanEval,2)gets close to GPT-4-Turbo in instruction following as measured byIFEval,3)matches GPT-4 Turbo(128K)and Claude 3 for long context tasks,and 4)outperforms GPT-4 in Chinese alignments as measured by AlignBench.The GLM-4 All Tools model is further a

5、ligned to understand user intent and autonomouslydecide when and which tool(s)to useincluding web browser,Python interpreter,text-to-image model,and user-defi ned functionsto effectively complete complextasks.In practical applications,it matches and even surpasses GPT-4 All Toolsin tasks like access

6、ing online information via web browsing and solving mathproblems using Python interpreter.Over the course,we have open-sourced a seriesof models,including ChatGLM-6B(three generations),GLM-4-9B(128K,1M),GLM-4V-9B,WebGLM,and CodeGeeX,attracting over 10 million downloads onHugging face in the year 202

word格式文档无特别注明外均可编辑修改,预览文件经过压缩,下载原文更清晰!
三个皮匠报告文库所有资源均是客户上传分享,仅供网友学习交流,未经上传用户书面授权,请勿作商用。
本文主要介绍了GLM系列语言模型的发展历程,从GLM-130B到GLM-4 All Tools。GLM系列模型在预训练和后训练技术上不断进步,实现了在多种语言任务上的优异表现。GLM-4模型在学术基准测试、代码问题解决、英语环境中的智能代理能力和中文环境下的长文本处理等方面表现出色,与GPT-4和Claude 3 Opus等模型相比具有竞争力。GLM-4 All Tools模型进一步支持智能代理和用户自定义功能,能够自主理解用户意图,计划复杂指令,并调用多种工具完成复杂任务。此外,GLM系列模型在安全性方面也进行了严格评估和风险缓解。总的来说,GLM系列模型在语言模型领域取得了显著的进步,特别是在处理与中文相关的任务上表现突出。
ChatGLM如何实现多语言模型? GLM-4在哪些方面超越了GPT-4? ChatGLM如何处理长文本任务?
客服
商务合作
小程序
服务号
折叠