
Search by job, company or skills
About SHIP-HATS
SHIP-HATS (Secure Hybrid Integration Pipeline - Hive Agile Testing Solutions) is the Singapore Government's centralised CI/CD platform that enables public agencies to adopt DevSecOps practices. As part of the team, you will help ensure the platform is reliable, scalable, and secure for the hundreds of development teams across the whole-of-government that depend on it daily.
What You Will Do
As an SRE on the SHIP-HATS team, you will own the reliability and operational excellence of the platform end-to-end. You will define and track Service Level Objectives (SLOs) and Service Level Indicators (SLIs), and work closely with engineering and product teams to reduce toil and improve system resilience. You will design and implement observability solutions - covering logging, metrics, and distributed tracing - to give the team deep visibility into platform health.
You will lead incident response efforts, conduct thorough post-mortems, and drive systemic improvements to prevent recurrence. Beyond firefighting, you will contribute to the platform's infrastructure-as-code posture, automating provisioning, configuration, and deployment pipelines on cloud and on-premises environments. You will also participate in capacity planning exercises and performance tuning to ensure the platform scales with growing government demand.
Collaboration is central to this role. You will work alongside development teams to embed reliability thinking early in the software development lifecycle, reviewing architectures and advocating for operability best practices.
What We Are Looking For
Good to Have
Job ID: 148371391
We don’t charge any money for job offers