In-person + Virtual
16 -20 May
Learn More and Register to Attend

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for KubeCon + CloudNativeCon Europe 2022 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

Please note: This schedule is automatically displayed in Central European Summer Time (UTC +2). To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date." The schedule is subject to change.
Back To Schedule
Wednesday, May 18 • 14:30 - 15:05
Improving GPU Utilization using Kubernetes - Maulin Patel & Pradeep Venkatachalam, Google

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.

Kubernetes supports efficient utilization of resources by enabling applications to request the precise amounts of resources it needs. Unlike fractional requests for CPUs, fractional requests for GPUs are not allowed in Kubernetes. GPU resources requested in the pod manifest must be an integer number. This means one GPU is fully allocated to one container even if the container only needs a fraction of GPU for its workload. Without the support for fractional GPUs, GPU resources are invariably over provisioned leading to a wastage. This is especially true for inference workloads that process a handful of data samples in real-time. To address this limitation, we have developed user-friendly solutions that allow a single GPU to be shared by multiple containers thereby improving utilization of GPUs and saving cost. In this talk, we will show the demos of our solutions and share performance results.

Click here to view captioning/translation in the MeetingPlay platform!

avatar for Maulin Patel

Maulin Patel

Group Product Manager, Google
Maulin Patel is a Group Product Manager at Google. Prior to his current role, he was a GM at GE and a Director at Philips Research. Maulin has a proven track record of innovations in IoT, AI/ML, cloud and smart buildings. He has experience in executing DoD, DoE, NSF and privately... Read More →

Pradeep Venkatachalam

Software Engineer, Google
Pradeep Venkatachalam is a Senior Software Engineer on the GKE (Google Kubernetes Engine) team at Google Cloud. One of Pradeep’s key focus has been to improve the accelerators ecosystem built on GKE. Pradeep has been involved in a number of reliability initiatives as well as bringing... Read More →

Wednesday May 18, 2022 14:30 - 15:05 CEST
Pavilion 4, Room B | Level 2 | Central Forum