Loading…
In-person + Virtual
16 -20 May
Learn More and Register to Attend

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for KubeCon + CloudNativeCon Europe 2022 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

Please note: This schedule is automatically displayed in Central European Summer Time (UTC +2). To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date." The schedule is subject to change.
Back To Schedule
Wednesday, May 18 • 16:30 - 17:05
Supporting Long-Lived Pods Using a Simple Kubernetes Webhook - Clément Labbe, Slack

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.


Today's applications strive to boot fast, be stateless, and handle unexpected terminations gracefully. However, some applications like distributed caches can take a while to warm up to a running state, while batch workers would rather avoid being terminated before they're done. At Slack, such applications found their home in Kubernetes thanks to a two-sided system: one one hand an admission webhook injects tolerations in pods to inform their requirement to be long-lived, and on the other hand a custom service taints nodes with their uptime. This results in pods desiring a long life to be scheduled on young nodes less likely to be terminated early. This talk will first describe how to write a simple Kubernetes admission webhook (https://github.com/slackhq/simple-kubernetes-webhook) to inject tolerations in pods, then move onto the symbiotic node tainting system, and end with gotchas and some metrics on how this long-lived pod support is used at Slack.

Click here to view captioning/translation in the MeetingPlay platform!

Speakers
avatar for Clément Labbe

Clément Labbe

Senior Software Engineer, Cloud, Slack
Clem is a cloud engineer approaching a decade of passionately working with distributed systems and web technologies. He loves solving application delivery in DevOps environments by developing tools in Go, and building resilient infrastructure using Kubernetes on AWS or GCP. 18 months... Read More →



Wednesday May 18, 2022 16:30 - 17:05 CEST
Pavilion 4, Room A | Level 2 | Central Forum Feria Valencia