
Search by job, company or skills
Responsibilities
Take full ownership of system maintenance, stability, and operational performance.
Ensure the system is production-ready at all times, including post-release support and steady-state operations.
Proactively identify operational risks and drive mitigation plans.
Act as the primary authority for Day-2 matters, decisions, and escalation.
Lead and manage major incidents (Sev 1 / Sev 2) from detection to closure.
Ensure timely escalation, coordination, and resolution across infrastructure, system, and application teams.
Produce structured incident reports, including incident timeline, impact assessment, root cause analysis (RCA), corrective and preventive actions
Present incident findings confidently to external customers and stakeholders.
Track recurring issues and drive problem management initiatives.
Represent the maintenance team in operational meetings, reviews, and briefings with external customers.
Prepare and present maintenance slides, operational dashboard and incident summaries
Communicate system health, risks, and recovery plans clearly to both technical and non-technical audiences.
Build stakeholder trust through transparency, accountability, and delivery.
Work closely with Day-1 project teams
Align expectations on operational readiness and support scope
Review release schedules, cutover plans, and post-go-live support
Ensure proper handover of documentation, SOPs, and system knowledge
Validate that new releases are supportable, operable, maintainable before acceptance into Day-2
Raise gaps early and enforce operational acceptance criteria.
Define, maintain, and enforce maintenance SOPs, escalation paths, support workflows
Ensure disciplined execution of incident management, problem management, change management processes
Drive continuous improvement of operational processes and service quality.
Support audits, reviews, and governance checks required for government projects.
Lead and coordinate Day-2 Infrastructure, System, and Application teams.
Ensure clear roles, responsibilities, and accountability across support functions.
Manage duty rosters, standby arrangements, operational coverage
Coach team members on incident handling, reporting quality, operational discipline
Requirements:
Bachelor's Degree in IT, Computer Science, Engineering, or equivalent
With 8-12 years of IT experience
At least 3-5 years in a role such as Maintenance Manager, Operations Lead, Production Support Lead
Proven experience managing mission-critical production systems
Strong hands-on experience in IT Maintenance / Day-2 Operations
Proven ownership of end-to-end incident management, including major incidents
Ability to produce high-quality incident reports suitable for external customers
Experience presenting maintenance and incident reports/slides to stakeholders
Experience collaborating with Day-1 project teams
If you are keen, please email your updated resume to [Confidential Information]
EA License no.14C7275/Registration no. R1434860
Please take note that only shortlisted candidate will be contacted. Thank you.
Job ID: 144991847