HC2022.KAIST.SeongminHong.v01.pdf

编号:136918 PDF 14页 1.98MB 下载积分:VIP专享
下载报告请您先登录!

HC2022.KAIST.SeongminHong.v01.pdf

1、DFX:A Low-latency Multi-FPGA Appliance for Accelerating Transformer-basedText GenerationSeongmin Hong1,Seungjae Moon1,Junsoo Kim1,Sungjae Lee2,Minsub Kim2,Dongsoo Lee2,and Joo-Young Kim11CastLab,School of EE,KAIST,2NAVER CLOVAHOTCHIPS22 Poster SessionText Generation Text generation Automatic generat

2、ion of human-readable text by a computer Example:dialogue system,topic-to-essay generation,and code generation Generative Pre-trained Transformer(GPT)State-of-the-art model in natural language processing that scale up to 175B parameters High-quality text generation and remarkable inference accuracy

3、for benchmarks(e.g.,LAMBADA)2 of 14isHello,my nameInput TokensJames SmithandOutput Tokens.LanguageModelLanguageModelLanguageModel.Generation StageSummarization Stage.LanguageModelGPTDecoder LayerDecoder LayerDecoder LayerDecoder LayerLanguageModelHOTCHIPS22Transformer-based Text Generation Transform

4、er-based text generation consists of summarization and generation stages Summarization stage:process with given input words from a user Generation stage:sequentially produce output words by language model3 of 14LM headTokenEmbeddingLM headTokenEmbeddingLM headTokenEmbeddingPositional EncodingGenerat

5、ion StageisJamesSmithHello,my name.Summarization StageDecoder Layer 1Decoder Layer 1Decoder Layer 1Fully-ConnectedGELUFully-ConnectedFully-ConnectedLayerNormSoftmaxConcat K,VInput Tokens:Output Tokens:+Feed-Forward NetworkFeed-Forward NetworkFeed-Forward NetworkSelf-AttentionSelf-AttentionSelf-Atten

6、tionResidualLayerNormLayerNormResidualResidualLayerNormLayerNormResidualResidualLayerNormLayerNormResidualDecoder Layer 2Decoder Layer NDecoder Layer 2Decoder Layer NDecoder Layer 2Decoder Layer NVectorVectorMatrixMatrixID Emb Vectorvec00vec11.vecnnWTEToken IDVectorVectorVectorVectorVectorVectorVect

友情提示

1、下载报告失败解决办法
2、PDF文件下载后,可能会被浏览器默认打开,此种情况可以点击浏览器菜单,保存网页到桌面,就可以正常下载了。
3、本站不支持迅雷下载,请使用电脑自带的IE浏览器,或者360浏览器、谷歌浏览器下载即可。
4、本站报告下载后的文档和图纸-无水印,预览文档经过压缩,下载后原文更清晰。

本文(HC2022.KAIST.SeongminHong.v01.pdf)为本站 (2200) 主动上传,三个皮匠报告文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。 若此文所含内容侵犯了您的版权或隐私,请立即通知三个皮匠报告文库(点击联系客服),我们立即给予删除!

温馨提示:如果因为网速或其他原因下载失败请重新下载,重复下载不扣分。
客服
商务合作
小程序
服务号
折叠