Search by job, company or skills
We&aposre partnering with a well-funded AI infrastructure startup that specializes in high-performance systems designed to optimize GPU utilization and deliver energy-efficient computing solutions. With substantial Series A backing and strategic partnerships with leading hardware manufacturers, they&aposre seeking a skilled DevOps Engineer to join their dynamic team and help revolutionize the AI computing landscape.
Role
As a DevOps Engineer, you&aposll be at the forefront of developing cutting-edge container technologies and infrastructure solutions that power AI workloads at scale. Your primary focus will involve architecting and implementing robust Kubernetes-based platforms that seamlessly handle AI scheduling, machine learning model training, and inference operations. You&aposll collaborate closely with engineering teams to build comprehensive monitoring ecosystems, develop high-performance storage solutions, and create networking plugins that maximize efficiency across heterogeneous computing environments. This position offers the opportunity to directly impact how organizations deploy and manage AI infrastructure while working with state-of-the-art GPU technologies and cloud platforms.
Requirements
The ideal candidate brings 3+ years of hands-on Kubernetes platform development experience, with demonstrated expertise in container runtimes, storage systems, and networking architectures specifically for AI applications. We&aposre looking for professionals with a background in GPU computing or cloud service provider environments. Technical proficiency should include deep knowledge of Kubernetes, containerd, runc, along with monitoring tools such as Prometheus and Grafana.
To Apply
To apply, please submit your resume to Yien Quek at [Confidential Information]. We regret to inform that only successful shortlisted candidates will be notified. Licence No: 16S8060 | Registration no: R1109830
Date Posted: 05/09/2025
Job ID: 125531575