We are partnered with a Financial Services Institution to look for a hands-on Platform Infrastructure Engineer. This role sits within the Infrastructure team and focuses on the day-to-day operation, stability, and reliability of Kubernetes and OpenShift platforms.
This is a core infrastructure role, not a DevOps or application engineering position. The successful candidate will act as the local SME for container platforms and work closely with global infrastructure teams across regions.
Key Responsibilities
- Operate and maintain Kubernetes and Red Hat OpenShift clusters in a production environment
- Perform BAU activities including patching, upgrades, monitoring, and troubleshooting
- Support platform stability, availability, and performance across environments
- Manage namespaces, access control, networking, routing, and platform configuration
- Work with monitoring and logging tools to ensure proactive issue detection
- Respond to infrastructure incidents, participate in on-call rotations, and support incident resolution
- Collaborate with global infrastructure teams on standards, changes, and deployments
- Work closely with local infrastructure, network, and server teams
- Support on-site infrastructure activities, including data centre visits when required
- Maintain documentation, runbooks, and operational procedures
Required Experience
- At least 6 years of experience in Infra / Platform Engineering
- Strong experience administering Kubernetes in production
- Hands-on experience with Red Hat OpenShift (on-prem, cloud, or hybrid)
- Solid Linux system administration background
- Good understanding of networking fundamentals (DNS, routing, ingress)
- Experience supporting production environments with operational responsibility
- Comfortable working independently and owning platform outcomes
- Experience with monitoring, logging, and incident management
Nice to Have
- Exposure to hybrid infrastructure environments (on-prem + cloud)
- Familiarity with ITIL-based incident and change processes
- Experience working in regulated or enterprise environments
- Basic scripting or automation experience (Bash, Python, Ansible)
- Understanding of application deployment concepts on container platforms