Key Responsibilities
1. Systems & Infrastructure Management
- Manage, monitor, and maintain servers, virtualization platforms, storage devices, firewalls, and related infrastructure.
- Install, configure, test, and maintain operating systems (Windows Server, RHEL), middleware, application software, and system management tools.
- Perform system upgrades, patch management, capacity planning, and performance tuning.
2. Day 2 Operations & Support
- Provide enterprise-wide operational support for Microsoft Windows, Linux, VMware, and middleware platforms.
- Respond to and resolve infrastructure and system incidents, escalating where necessary.
- Participate in 24/7 on-call support rotations and provide timely incident response.
- Conduct daily health checks, log reviews, and preventive maintenance activities.
3. Security & Compliance
- Perform security hardening and testing based on CIS benchmarks and Singapore Government security standards.
- Manage user accounts, access controls, password resets, and system roles securely.
- Ensure compliance with IT governance, audit, and regulatory requirements.
- Support disaster recovery planning, testing, and execution.
4. Collaboration & Projects
- Participate in the design and implementation of new infrastructure solutions and enhancements.
- Liaise with vendors, service providers, and internal IT teams to resolve issues and deliver solutions.
- Support IT audits, capacity planning, and documentation of infrastructure standards and procedures.
5. Patch Management & Upgrades
- Test, stage, and deploy OS patches, firmware updates, and COTS/middleware upgrades.
- Apply change management best practices for controlled deployment of fixes and enhancements.
6. Hardware & Platform Support
- Support and maintain enterprise-grade hardware: Dell servers, Cisco switches, FortiGate firewalls, UPS, storage (Dell EMC), and power management.
- Manage and troubleshoot enterprise platforms such as:
- Middleware & Integration: IBM MQ, IBM ACE, Kafka, WebSphere
- Databases & Data Services: MS SQL, MongoDB
- GIS & Applications: ArcGIS, Elastic Stack, Rocket.Chat
- Security Tools: Symantec, Carbon Black EDR, CipherTrust, Keycloak, Fortify WebInspect
- DevOps & Monitoring: Grafana, Prometheus, GitLab Enterprise, Ansible, OpenShift, Red Hat Satellite
7. Documentation & Continuous Improvement
- Maintain SOPs, system diagrams, maintenance logs, and incident reports.
- Generate reports on incidents, performance, and preventive actions.
- Propose automation and reliability improvements to strengthen system resilience.
Requirements
- Diploma / Bachelor's Degree in Computer Science, Information Technology, or equivalent
- At least 3-5 years of combined experience in system administration and infrastructure support.
- Strong knowledge of Windows Server, RHEL, VMware, and enterprise hardware (servers, storage, networking).
- Hands-on experience with backup and recovery solutions, firewalls, and middleware platforms.
- Familiarity with container platforms (OpenShift/Kubernetes), DevOps tools (Ansible, GitLab), and IT monitoring stacks (Grafana, Prometheus).
- Working knowledge of ITIL processes (incident, change, and problem management).