Search by job, company or skills

F

Site Reliability Engineer III

6-8 Years
SGD 8,500 - 17,000 per month
new job description bg glownew job description bg glownew job description bg svg
  • Posted 14 days ago
  • Be among the first 10 applicants
Early Applicant

Job Description

As a Site Reliability Engineer III, part of the Operational Support Systems (OSS) organization under the Security & Distributed Cloud organization, you will develop and manage highly scalable infrastructure and Identity and Access Management (IAM) systems to support business operations and F5 products.

This position is focused on Identity and Access Management (IAM) workflows and related automation tasks. You will lead the design and optimization of automation pipelines, troubleshoot escalated system incidents, write and update team documentation, enhance monitoring processes, and mentor junior engineers. You will collaborate closely with U.S.-based IAM counterparts as part of a follow-the-sun support model to improve global scalability and team alignment.

What You'll Do:

  • IAM Operations Leadership: Manage and oversee IAM workflows, including provisioning, deprovisioning, auditing, and identity-related escalations for systems such as Okta and Entra ID, ensuring SLA adherence and operational excellence

  • Automation Pipeline Design: Lead the design, configuration, and optimization of automation pipelines, integrating IAM workflows with deployment processes to improve scalability and security

  • Advanced Troubleshooting: Resolve escalated issues across IAM systems and automation workflows, including ticket escalations from tools like Jira and ServiceNow, and propose long-term fixes for recurring issues

  • IAM Automation: Develop and manage complex IAM automation workflows using Python, Selenium, Infrastructure-as-Code (IaC) tools like Terraform, Bash, or PowerShell to eliminate manual repetitive tasks, including RBAC updates and compliance reports

  • Documentation: Create and maintain detailed technical documentation covering IAM workflows, automation scripts, pipeline architectures, and troubleshooting playbooks to support operational clarity and team scalability

  • Project Leadership: Drive initiatives such as Identity Provider (IdP) migrations for systems like Okta and Entra ID, system consolidation during mergers and acquisitions (M&A), and integrating IAM systems for improved scalability and security

  • Monitoring and Performance Optimization: Deploy and manage monitoring tools, such as Prometheus, Grafana, and Splunk, to track performance metrics and proactively resolve bottlenecks in IAM workflows and automation pipelines

  • On-Call Support: Participate in the on-call rotation, leading incident response during critical outages and escalations to restore services and ensure operational readiness

  • Mentorship and Collaboration: Mentor SRE engineers, provide guidance on IAM processes, automation workflows, documentation writing, and troubleshooting practices

What You'll Bring:

  • Bachelor's degree in computer science, information technology, or equivalent experience, with 6+ years in relevant technical roles such as Site Reliability Engineering or System Engineering

  • Advanced experience managing IAM workflows and systems, including provisioning, deprovisioning, role-based access control (RBAC), and auditing tools such as Okta, Entra ID, LDAP, or SAML-based systems

  • Expertise in Python scripting for building automation workflows, along with proficiency in Bash or PowerShell

  • Strong experience designing scalable automation pipelines integrated with deployment workflows to improve identity systems

  • Familiarity with IaC tools such as Terraform, Ansible, or CloudFormation to enable provisioning and resource management

  • Advanced knowledge of monitoring platforms including Prometheus, Grafana, or Splunk to deploy metrics tracking and infrastructure observability

  • Solid understanding of ticket workflows and experience using Jira and ServiceNow for managing escalations and system requests

  • Proven ability to write and maintain clear, thorough documentation to support operational workflows, troubleshooting efforts, and organizational onboarding

  • Demonstrated ability to mentor junior SREs and contribute technical leadership in IAM-related projects

Preferred:

  • Certifications such as Certified Identity and Access Manager (CIAM), ITIL, or public cloud certifications (AWS Certified DevOps Engineer, Google Professional DevOps Engineer)

  • Proven experience leading Identity Provider (IdP) migrations and system consolidations due to M&A

  • Hands-on experience with container orchestration platforms, such as Kubernetes or Docker

What you'll get:

  • This is a self-driven role, so career growth and development is available at every stage of your career

  • Competitive pay, , and cool perks

  • Tuition assistance for professional development

  • , strong diversity and inclusion interest groups

More Info

Job Type:
Industry:
Employment Type:

Job ID: 139629359