Role Overview
Responsible for 24x7 operational support of IT systems hosted in on-premise data centres and the Government Commercial Cloud (GCC/AWS). The role focuses on infrastructure reliability, incident resolution, cloud operations, automation, vendor coordination, and cost management within a government-compliant environment.
Key Responsibilities
- Provide production support for GCC (AWS) and on-prem systems to ensure high availability and minimal downtime.
- Troubleshoot and resolve application, infrastructure, and security incidents in line with SLAs.
- Maintain compliance with government IT policies and security standards.
- Manage cloud resources including compute, networking, security groups, IAM, and monitoring dashboards.
- Execute production deployments, manage release schedules, perform validation and rollback when required, and support zero-downtime deployments.
- Develop and maintain deployment pipelines and automation scripts.
- Use Terraform and Ansible for infrastructure provisioning and configuration management.
- Maintain SOPs, runbooks, and operational documentation.
- Coordinate with vendors and cross-functional teams to resolve issues and support project rollouts.
- Support budgeting, billing, cost tracking, and cost recovery activities.
Requirements
- Bachelor's degree in IT, Computer Science, or related field (or equivalent experience).
- Singapore Citizen.
- 4-6 years experience in IT operations or infrastructure support.
- Hands-on experience with Windows/Linux servers and cloud platforms (GCC/AWS preferred).
- Strong troubleshooting skills across application, infrastructure, and security domains.
- Experience with ITSM tools such as Jira or ServiceNow.
- Familiarity with enterprise IT governance frameworks, especially in regulated or government environments.
- Certifications such as ITIL, AWS Essentials, or AWS Solutions Architect are advantageous.
- Strong analytical skills and ability to manage multiple priorities independently.