Platform Engineer (AI Infrastructure)
We are hiring a Platform Engineer to help build and evolve the software platform behind large scale AI infrastructure. This is a hands on engineering role for someone who can write strong Python, work deeply with Kubernetes, design and build platform applications, and operate close to bare metal infrastructure. You will help build the systems that make GPU compute easier to provision, operate, secure and scale across AI infrastructure environments. This is not a generic DevOps role. We are not looking for someone who has only maintained pipelines, written Terraform or managed cloud services. We need someone who can build real platform software and understands the infrastructure it runs on. What you will do Design and build platform applications, APIs and services Write production grade Python for infrastructure and platform use cases Work with Kubernetes to build scalable platform capabilities Design and build Kubernetes operators and controllers across compute, storage and networking Build tooling that improves how bare metal and GPU infrastructure is provisioned, operated and monitored Translate operational pain points into scalable platform features Improve platform reliability, observability and performance Work across Linux, networking, storage and distributed systems Collaborate with product, security, infrastructure, networking and compute teams Help build the platform layer for AI infrastructure designed to operate at industrial scale What we are looking for Strong ..... full job details .....
Other jobs of interest...
Perform a fresh search...
-
Create your ideal job search criteria by
completing our quick and simple form and
receive daily job alerts tailored to you!