Search by job, company or skills

Digital Edge DC

Senior Manager, DC Operations, Incident Management Office

8-10 Years
new job description bg glownew job description bg glownew job description bg svg
  • Posted 9 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Who we are:

Digital Edge DC (Digital Edge) is a leading data center platform company, established to transform digital infrastructure in Asia. We seek to build the foundation for the world's digital future, helping organizations to grow sustainably and empowering the populations they serve.

Through building and operating state-of-the-art, energy-efficient data centers rich with connectivity options, we bring new colocation and interconnect options to the Asian market, making infrastructure deployment in the region easy, efficient and economical. Backed by Stonepeak, a leading alternative investment firm specializing in infrastructure and real assets, Digital Edge has in excess of US$1.6 billion in committed capital.

Founded in late 2019, the company has grown rapidly across multiple markets in the region, with 30+ data centers in operations and under construction, and this role is an exciting opportunity to join the team as we further expand our footprint across Asia.

The Role

The Senior Manager, Infrastructure Management Office (IMO) is the regional owner of incident (problem) management and operational risk governance across Digital Edge's data center portfolio.

This role is central to driving a shift from reactive incident response to predictive, risk‑based operations, ensuring that incidents, maintenance strategy, CapEx/OpEx decisions, Planned Preventive Maintenance (PPM) contracts, and RFP requirements are systematically linked and continuously improved.

Key Responsibilities

Problem Management & Root Cause Governance

  • Establish and lead a regional problem management framework
  • Ensure major and repeat incidents are thoroughly investigated, with root causes identified and corrective actions tracked to closure

Incident‑to‑Investment Alignment

  • Translate incident trends and failure patterns into actionable insights for CapEx and OpEx decision‑making
  • Partner with maintenance and finance stakeholders to highlight the cost and risk impact of deferred or declined investments

Maintenance Strategy & PPM Governance

  • Review and support standardisation of PPM contracts across all countries
  • Ensure contracts include clear SLAs, response times, and performance accountability aligned to operational risk

RFP & Operational Requirement Review

  • Review RFPs from an operational perspective, including maintenance obligations, response commitments, and change management requirements
  • Ensure operational risks are addressed prior to contractual commitment

Risk Reporting & Leadership Support

  • Provide structured reporting on repeat failures, systemic weaknesses, and emerging operational risks
  • Deliver data‑driven insights to senior leadership to support informed decision‑making

Governance, Safety & Compliance

  • Actively champion and implement the organization's policies on health & safety, environment, energy, quality, information security, and business continuity, ensuring adherence to incident reporting, legal, and regulatory requirements.
  • Ensure compliance with incident reporting standards, legal, and regulatory requirements

What We're Looking For

Required Experience & Capabilities

  • 8–10+ years in data center operations, reliability engineering, or infrastructure governance
  • Strong experience in incident analysis, problem management, and operational risk
  • Proven ability to convert operational data into structured insights and recommendations
  • Comfortable engaging with senior stakeholders at Director level
  • Strong analytical thinking with a disciplined, structured approach

Preferred Attributes

  • Systems thinker with a strong data governance mindset
  • Experience working across multiple countries or regions
  • Clear executive‑level communication and reporting skills
  • Demonstrated ability to standardise processes in complex operational environments

If you're motivated by building resilient operations, learning from incidents, and driving step‑change improvements at scale, we'd love to hear from you.

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 145697845