3412 - KServe 深度解析:面向生成式人工智能时代的演进型模型服务.pdf

编号:982939 PDF 51页 3.60MB 下载积分:VIP专享
下载报告请您先登录!

1、Yuan TangSenior Principal Software Engineer,Red Hat AIProject Lead:KServe,Argo,and KubeflowCo-chair:K8s WG Serving and WG AI ConformanceCo-chair:CNCF TAG Workloads FoundationKServe Deep Dive:Evolving Model Serving for the Generative AI EraAgenda0102030405What is KServeHistoryOpen Source CommunityArc

2、hitectureKey FeaturesWhat is KServe?Highly scalable,standard,cloud agnostic model inference platform on Kubernetes#IBMTechXchangeWhy KServe?Provides performant,standardized inference protocol across ML frameworks.Supports modern serverless inference workload with autoscaling including Scale to Zero

3、on GPU.Provides high scalability,density packing and intelligent routing using ModelMesh4Simple and pluggableproduction serving for production ML serving including prediction,pre/post processing,monitoring and explainability.Advanced deploymentswith canary rollout,experiments,ensembles and transform

4、ers.GenAI capabilities:Envoy AI Gateway,KEDA,LMCache.Model Cache,vLLM multi-node inference#IBMTechXchange5History of KServeFrom Kubeflow/KFServingto KServeDeveloped collaboratively by Google,IBM,Bloomberg,NVIDIA,and Seldon in 2019 under the Kubeflow project.The project graduated from Kubeflow and wa

5、s rebranded from KFServing to standalone KServe project in Sep 2021.KServe 0.7 released outside of the Kubeflow with migration guide for minimal disruptions in Oct 2021.KServe was donated to LF AI&Data Foundation in Nov 2021.KServe was accepted as a CNCF incubating project Oct 2025#IBMTechXchange6Hi

6、story of KServeKubeflow Ecosystem#IBMTechXchange7Community:Maintainers and Contributors19 maintainers and 300+contributors!#IBMTechXchange8Community:Adopters30+companies varying from vendors to end users!#IBMTechXchange9KServe Architecture#IBMTechXchange10KServe Feature:Serving RuntimesPluggable,reu

友情提示

1、下载报告失败解决办法
2、PDF文件下载后,可能会被浏览器默认打开,此种情况可以点击浏览器菜单,保存网页到桌面,就可以正常下载了。
3、本站不支持迅雷下载,请使用电脑自带的IE浏览器,或者360浏览器、谷歌浏览器下载即可。
4、本站报告下载后的文档和图纸-无水印,预览文档经过压缩,下载后原文更清晰。

本文(3412 - KServe 深度解析:面向生成式人工智能时代的演进型模型服务.pdf)为本站 (竿头日上) 主动上传,三个皮匠报告文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。 若此文所含内容侵犯了您的版权或隐私,请立即通知三个皮匠报告文库(点击联系客服),我们立即给予删除!

温馨提示:如果因为网速或其他原因下载失败请重新下载,重复下载不扣分。
客服
商务合作
小程序
服务号
折叠