- Design, develop, and maintain the central automation platform supporting hybrid cloud environments (AWS, Azure, on-prem).
- Build reusable automation frameworks, modules, and APIs that other teams can leverage to design their own automation workflows.
- Define and enforce automation standards, governance models, and best practices across the enterprise.
- Provide enablement and onboarding support for infrastructure, and application teams adopting the automation platform.
- Develop documentation, templates, and reference architectures to accelerate automation adoption across teams.
- Implement Infrastructure-as-Code (IaC) using Terraform and Ansible to ensure consistent, auditable, and secure provisioning.
- Develop orchestration workflows integrating with infrastructure components (Windows, Linux, DNS, AD, vCenter, Databases, etc.) via SDKs and APIs.
- Automate operational tasks, runbooks, and incident response workflows to reduce Mean Time to Detect (MTTD) and Mean Time to Resolve (MTTR).
- Build self-healing and auto-remediation capabilities using event-driven automation frameworks
- Deep understanding of SRE principles, service health modelling, error budgets, and auto-remediation design.
Very Good understanding of Infrastructure and Cyber Services and applications.
- Familiarity with financial sector operational resilience frameworks, regulatory compliance, and incident governance.
Skills:
- Minimum 5 years of experience in Infrastructure automation or SRE roles, with strong automation and platform engineering focus.
- Proven experience designing and delivering enterprise-scale automation and orchestration solutions.
Technical Skills
- Integration & Orchestration: ServiceNow Orchestration, Microsoft System Center Orchestrator, BMC Helix, or equivalent.
- Automation / IaC: Terraform, Ansible, Ansible EDA, Python, PowerShell, Bash.
- Compliance & Policy Enforcement: Terraform Sentinel, Ansible Lint.
- Cloud Platforms: AWS and Azure.
- Observability Tools: DataDog, Dynatrace, Splunk, ELK.
- CI/CD & DevOps: GitHub, Github Actions.
- Security & Governance: Familiarity with encryption, access management, and regulatory frameworks (e.g., MAS TRM, DORA).