We are looking for an Incident Manager to join our engineering operation steam and take ownership of our company-wide incident response program.
Job Description
- Own Supabases's end-to-end incident response process
- Monitor, enforce, and evolve incident best practices across all teams
- Detect patterns and coordinate proactive follow-up actions
- Lead retrospectives and ensure action items are prioritized and completed
- Work with engineering to maintain uptime SLA reporting per product
- Define and improve metrics for responsiveness, recovery and impact
- Drive improvements in monitoring, alerting and on-call rotations
- Ensure incident tooling (incident.io, slack workflows, etc.) is configured and maintained
- Serve as a point of escalation for high-severity incidents and join the Incident Commander on-call rotation
- Mentor and coach teams on effective incident management practices
- Partner with support and customer success to ensure clear external comms during incidents
Job Requirements
- Have a minimum 3 years of experience in incident management, site relability engineering, or technical program management
- Have run or paticipated in on-call rotations and high-severity incident response
- Have used tools like incident.io, Opsgenie, PagerDuty, Datadog On-call
- Can facilitate calm, clear incident calls and write blameless retrospectives
- Prior experience in open-source or developter tools companies
- Exposure to regulatory compliance incident response (e.g., SOC2, GDPR)
- Have experience in async or globally distributed teams