
Search by job, company or skills
Design, build, and automate the full lifecycle management of Red Hat OpenShift clusters, treating infrastructure as code.
Develop and maintain automation for cluster provisioning, upgrades, scaling, and recovery using Ansible, Terraform, or Python.
Implement, configure, and manage OpenShift components including:
OVN-Kubernetes networking
OpenShift Data Foundation (ODF) / Ceph storage
Optimize platform reliability, performance, and scalability to ensure high availability and resiliency.
Implement and maintain GitOps workflows for cluster configuration and deployment management.
Monitor cluster health and performance, troubleshoot issues, and implement proactive improvements.
Collaborate with engineering and application teams to support containerized workloads and platform best practices.
Experience: Minimum 6 years in DevOps, Platform Engineering, or Site Reliability Engineering roles.
Strong expertise in Red Hat OpenShift and Kubernetes architecture & internals.
Proficiency in scripting and automation using Bash and/or Python.
Hands-on experience with Infrastructure as Code (IaC) tools such as Terraform or Ansible.
Solid understanding of container platform networking (SDN, OVN) and storage (Ceph/ODF).
Experience implementing GitOps practices and related tooling.
Strong troubleshooting, performance tuning, and reliability engineering skills.
Job ID: 142882743