
Search by job, company or skills
As a Site Reliability Engineer III, part of the Operational Support Systems (OSS) organization under the Security & Distributed Cloud organization, you will develop and manage highly scalable infrastructure and Identity and Access Management (IAM) systems to support business operations and F5 products.
This position is focused on Identity and Access Management (IAM) workflows and related automation tasks. You will lead the design and optimization of automation pipelines, troubleshoot escalated system incidents, write and update team documentation, enhance monitoring processes, and mentor junior engineers. You will collaborate closely with U.S.-based IAM counterparts as part of a follow-the-sun support model to improve global scalability and team alignment.
What You'll Do:
IAM Operations Leadership: Manage and oversee IAM workflows, including provisioning, deprovisioning, auditing, and identity-related escalations for systems such as Okta and Entra ID, ensuring SLA adherence and operational excellence
Automation Pipeline Design: Lead the design, configuration, and optimization of automation pipelines, integrating IAM workflows with deployment processes to improve scalability and security
Advanced Troubleshooting: Resolve escalated issues across IAM systems and automation workflows, including ticket escalations from tools like Jira and ServiceNow, and propose long-term fixes for recurring issues
IAM Automation: Develop and manage complex IAM automation workflows using Python, Selenium, Infrastructure-as-Code (IaC) tools like Terraform, Bash, or PowerShell to eliminate manual repetitive tasks, including RBAC updates and compliance reports
Documentation: Create and maintain detailed technical documentation covering IAM workflows, automation scripts, pipeline architectures, and troubleshooting playbooks to support operational clarity and team scalability
Project Leadership: Drive initiatives such as Identity Provider (IdP) migrations for systems like Okta and Entra ID, system consolidation during mergers and acquisitions (M&A), and integrating IAM systems for improved scalability and security
Monitoring and Performance Optimization: Deploy and manage monitoring tools, such as Prometheus, Grafana, and Splunk, to track performance metrics and proactively resolve bottlenecks in IAM workflows and automation pipelines
On-Call Support: Participate in the on-call rotation, leading incident response during critical outages and escalations to restore services and ensure operational readiness
Mentorship and Collaboration: Mentor SRE engineers, provide guidance on IAM processes, automation workflows, documentation writing, and troubleshooting practices
What You'll Bring:
Bachelor's degree in computer science, information technology, or equivalent experience, with 6+ years in relevant technical roles such as Site Reliability Engineering or System Engineering
Advanced experience managing IAM workflows and systems, including provisioning, deprovisioning, role-based access control (RBAC), and auditing tools such as Okta, Entra ID, LDAP, or SAML-based systems
Expertise in Python scripting for building automation workflows, along with proficiency in Bash or PowerShell
Strong experience designing scalable automation pipelines integrated with deployment workflows to improve identity systems
Familiarity with IaC tools such as Terraform, Ansible, or CloudFormation to enable provisioning and resource management
Advanced knowledge of monitoring platforms including Prometheus, Grafana, or Splunk to deploy metrics tracking and infrastructure observability
Solid understanding of ticket workflows and experience using Jira and ServiceNow for managing escalations and system requests
Proven ability to write and maintain clear, thorough documentation to support operational workflows, troubleshooting efforts, and organizational onboarding
Demonstrated ability to mentor junior SREs and contribute technical leadership in IAM-related projects
Preferred:
Certifications such as Certified Identity and Access Manager (CIAM), ITIL, or public cloud certifications (AWS Certified DevOps Engineer, Google Professional DevOps Engineer)
Proven experience leading Identity Provider (IdP) migrations and system consolidations due to M&A
Hands-on experience with container orchestration platforms, such as Kubernetes or Docker
What you'll get:
This is a self-driven role, so career growth and development is available at every stage of your career
Competitive pay, , and cool perks
Tuition assistance for professional development
, strong diversity and inclusion interest groups
Job ID: 139629359