Search by job, company or skills

T

Server and Storage Management Services (Day-2 Operations) Engineer

4-6 Years
SGD 6,000 - 8,000 per month
new job description bg glownew job description bg glownew job description bg svg
  • Posted 6 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Role: Server and Storage Management Services (Day-2 Operations)

Location: Ang Mo Kio

Type of Role: Contract (12 months renewable)

Remuneration: Base Salary - $6,000 - $8,000

Renumeration: 1 month bonus upon completion of 12 months contract (subjected to attendance, performance etc..)

Service Objective

Provide secure, reliable, and compliant Day-2 operations for enterprise server, virtualization, storage, and core infrastructure platforms, ensuring service continuity, data protection, performance stability, and audit readiness following formal handover from implementation teams.

Service Outcomes

The Server and Storage Management Service delivers the following measurable outcomes:

  • Stable and secure operation of server, virtualization, and storage platforms.
  • Consistent enforcement of operating system, platform, and storage hardening baselines.
  • Timely patching and vulnerability remediation with auditable evidence.
  • Predictable performance, capacity, and availability across compute and storage layers.
  • Verified backup, recovery, and data protection capabilities.
  • Controlled maintenance and changes with minimal service disruption.
  • Clear operational visibility through monitoring, reporting, and compliance artifacts.

Service Scope (In Scope)

Server System Administration

  • Day-2 administration of physical and virtual servers following formal handover.
  • Operating system administration for supported platforms.
  • User, group, and access management in accordance with approved security policies.
  • Configuration management, system utilities maintenance, and job scheduling.
  • Review and management of system logs, alerts, and operational events.
  • Maintenance of server administration documentation, runbooks, and SOPs.

Virtualization and Hypervisor Operations

  • Day-2 operations of virtualization platforms and virtual machine estates.
  • Resource allocation, availability management, and performance monitoring.
  • Support for virtual appliance hosting on approved hypervisor platforms.
  • Coordination of maintenance activities impacting virtualized workloads.
  • Validation of service availability following host or cluster changes.

Storage and SAN Operations (Explicitly In Scope)

  • Day-2 operations of enterprise storage platforms and SAN infrastructure.
  • Storage provisioning, LUN management, zoning coordination, and access control.
  • Performance monitoring, latency analysis, and capacity trend management.
  • Storage availability monitoring and incident response.
  • Coordination of storage-related changes, upgrades, and maintenance activities.
  • Support for data services hosted on storage platforms, including snapshot and replication features where applicable.
  • Documentation and maintenance of storage configuration and operational records.

Server and Storage Security Administration

  • Enforcement of operating system, platform, and storage hardening baselines.
  • Management of certificates, encryption keys, and trust components on server platforms.
  • Compliance with vulnerability management and security remediation requirements.
  • Coordination of mitigation controls where patches are unavailable.
  • Support for audit and compliance activities through evidence generation.

Patch Management

  • End-to-end patch management covering:

o Operating systems

o Hypervisors

o Server firmware and BIOS

o Storage firmware and platform components

  • Patch identification, risk assessment, approval coordination, deployment, and validation.
  • Emergency and high-risk patch handling within defined timelines.
  • Monthly patch compliance reporting and ad-hoc status updates when required.

Backup, Recovery, and Data Protection

  • Administration of enterprise backup and recovery solutions.
  • Scheduling, monitoring, and remediation of backup jobs.
  • Management of backup media, repositories, and retention policies.
  • Periodic restoration testing to validate recoverability of systems and data.
  • Support for recovery activities during incidents, exercises, or audits.

Performance, Fault, and Capacity Management

  • Continuous monitoring of server and storage health indicators.
  • Threshold-based alerting and proactive fault response.
  • Analysis of CPU, memory, disk, I/O, and storage performance metrics.
  • Trend analysis to identify bottlenecks and capacity risks.
  • Input into periodic capacity planning and performance optimization exercises.

Maintenance and Change Execution

  • Planned maintenance with documented risk and impact assessments.
  • Controlled execution of server, hypervisor, and storage maintenance activities.
  • Graceful shutdown and startup procedures in accordance with approved runbooks.
  • Post-maintenance validation to confirm service restoration and data integrity.

Automation

  • Automation of repeatable administrative, compliance, and reporting tasks.
  • Automation must be auditable, secure, and aligned with change governance.
  • Automation shall not bypass approval, security, or service management controls.

Service Scope (Out of Scope)

  • Server or storage architecture design and initial implementation activities.
  • Major platform migrations or technology refreshes unless separately commissioned.
  • Application-level functional support beyond infrastructure responsibilities.
  • Activities outside the defined scope without formal service request or change approval.

Service Controls and Governance

  • Services are delivered in alignment with ITIL practices, including Incident, Problem, Change, Configuration, and Knowledge Management.
  • All server and storage activities are subject to documented approvals, auditability, and evidence retention.
  • Security hardening, patching, and backup validation are mandatory operational controls.
  • Disputes on fault ownership shall be supported by logs, metrics, and documented analysis.

Service Reporting and Evidence

  • Monthly service reports covering:

o Server and storage availability.

o Incident, problem, and change metrics.

o Patch and vulnerability compliance status.

o Backup success rates and restoration test results.

  • Performance and capacity utilization reports for compute and storage.
  • Compliance and audit evidence packs.
  • Ad-hoc reports provided upon request, subject to agreed timelines.

Supported Server, Storage, and Infrastructure Platforms

The Server and Storage Management Service provides Day-2 operational support for the following platforms and technologies. These define the minimum platform competency coverage required for service delivery.

Physical Server Platforms

  • Dell PowerEdge Servers

o Enterprise rack-mounted servers supporting infrastructure and virtualized workloads

  • Cisco UCS Servers

o Compute platforms supporting infrastructure services and virtual appliances

Virtualization and Hypervisors

  • VMware ESXi

o Host, cluster, and virtual machine operations

o Availability, resource, and performance management

  • Virtual Appliance Hosting

o Support for security, monitoring, and infrastructure appliances deployed as virtual machines

Operating Systems

  • Microsoft Windows Server

o OS administration, patching, and security hardening

  • Red Hat Enterprise Linux

o OS administration, patching, and compliance enforcement

Enterprise Storage and SAN

  • Primary Storage Platforms

o Pure Storage arrays

  • SAN Infrastructure

o Fibre Channel fabrics and zoning

o Integration with compute and virtualization platforms

  • Storage Services

o Provisioning, performance optimization, and capacity management

Backup and Recovery Platforms

  • Enterprise Backup Solutions

o Commvault backup appliances and associated components

  • Recovery Capabilities

o System, application, and data recovery support

Security and Trust Infrastructure Hosted on Servers

  • Authentication and Identity Services

o RSA multi-factor authentication platforms

  • Hardware Security Modules

o Thales Luna Network HSM

o Thales Luna Backup HSM

  • Certificate and Key Management

o Certificate lifecycle and cryptographic key operations

Core Infrastructure Services Hosted on Server and Storage Platforms

Operational support includes infrastructure services hosted on managed platforms, including but not limited to:

  • DNS, DHCP, and IP address management (e.g. Infoblox)
  • Identity and authentication services
  • Patch and vulnerability management platforms
  • Log management and event forwarding
  • Backup, recovery, and notification services

Operational Capabilities Across All Server and Storage Platforms

  • 24x7 operational monitoring and incident response (where contracted)
  • Patch and firmware lifecycle management
  • Backup verification and disaster recovery testing
  • Performance monitoring and capacity forecasting
  • Controlled change execution and validation
  • Security posture enforcement and compliance reporting

Assumptions and Dependencies

  • Day-2 operational responsibility commences only after formal handover and acceptance.
  • Platform designs, baselines, and configurations are provided by the implementation phase.
  • Required system access, documentation, and tooling are available at service commencement.

Service Completion Criteria

The Server and Storage Management Service is considered successfully delivered when:

  • Server and storage services meet agreed service levels.
  • Patch, hardening, and backup compliance is demonstrable.
  • Incidents, changes, and problems are managed within defined controls.
  • Audit and reporting requirements are satisfied.

Requirements:

  • Minimum 4 years experience in Server and Storage management.
  • Knowledge of Windows Server 2016/2019/2022.
  • Knowledge of VMware or storage or cloud or Redhat.
  • MCSE/MCSA/VMware certified would be advantageous.
  • Office hours, but rotating standby to work on night or weekend tasks when required.
  • Must be a team player.

For a confidential discussion, interested applicants please kindly send your updated resume.

More Info

Job Type:
Industry:
Employment Type:

Job ID: 137375293