Search by job, company or skills

G

Program Manager II, Data Center Incidents and Availability - Singapore

2-5 Years
SGD 9,000 - 18,000 per month
Save
new job description bg glownew job description bg glow
  • Posted a day ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Product area

The Data Center team designs and operates some of the most sophisticated electrical and HVAC systems in the world. We are an upbeat, creative, team-oriented group of engineers committed to building and operating powerful data centers.

Job description

A problem isn't truly solved until it's solved for all. That's why Googlers build products that help create opportunities for everyone, whether down the street or across the globe. As a Program Manager at Google, you'll lead complex, multi-disciplinary projects from start to finish - working with stakeholders to plan requirements, manage project schedules, identify risks, and communicate clearly with cross-functional partners across the company. Your projects will often span offices, time zones, and hemispheres. It's your job to coordinate the players and keep them up to date on progress and deadlines.

Additional job description

In this role, you will ensure the Data Center Operations teams have the tools, processes, templates and training required to effectively prevent, detect, escalate, manage, and mitigate incidents. You will require close collaboration with site reliability engineers and global server operations in order to ensure that data center operations related incidents are mitigated.

Success in this role requires a breadth of data center infrastructure knowledge, experience with operational procedures, policies, business continuity plans, and electrical and mechanical maintenance activities. Additionally, this role will require close collaboration with facility managers, plant engineers, and facilities technicians.

Qualifications

Job responsibilities

  • Own the end-to-end incident management process, from real-time response and investigation to the execution of scalable root cause analysis findings and corrective actions.
  • Participate in on-call rotation supporting critical incident response.
  • Liaise with regional counterparts, as well as the program owners on the Data Center Incidents and Availability team to ensure global collaboration.
  • Liaise between Data Center campus members, Tech Incident Response Team (Tech-IRT), network security, and the crisis management groups to ensure incidents are effectively communicated and understood by all stakeholders.

Minimum qualifications

  • Bachelor's degree or equivalent practical experience.
  • 2 years of experience in program or project management.
  • Experience with data center power and cooling infrastructure.
  • Experience in Root Cause Analysis (RCA).
  • Ability to travel up to 30% of the time as required.

Preferred qualifications

  • Bachelor's degree in Mechanical or Electrical Engineering.
  • 2 years of experience managing cross-functional or cross-team projects.
  • Experience in data center operations or similar mission critical experience.
  • Public speaking skills and enthusiasm for teaching/leading multi-day training events.

More Info

Job Type:
Industry:
Employment Type:

Job ID: 147921853