Role Overview
As a Senior Cloud Platform Engineer, you will design, build, and maintain the scalable infrastructure that powers our services. You are a specialist in Infrastructure as Code (IaC) and automation, ensuring our cloud ecosystems are secure, cost-effective, and highly resilient.
Our Platform Engineering Culture
- Customer Obsession: We don't build tech for tech's sake; we build to empower our developers and deliver value to our end users.
- Psychological Safety: We foster an environment where everyone feels safe to voice concerns and challenge the status quo. We embrace fail-safe engineering and prioritize learning from failures through a strictly blameless culture.
- Practical Integrity: We do the next right thing by prioritizing long-term platform health over quick fixes, while remaining pragmatic about delivery timelines.
- Constructive Transparency: We value honest feedback and open communication as the primary drivers of business success and operational reliability.
Key Responsibilities
- Infrastructure as Code: Architect and manage cloud environments using Terraform and Python to ensure repeatable and version-controlled deployments.
- Workload Portability: Leverage Kubernetes (K8s) and Helm charts to build an open, stable application platform that ensures consistent performance and operational flexibility across different environments.
- CI/CD Leadership: Maintain and optimize automated workflows via GitHub to ensure high-velocity engineering.
- Operational Excellence: Provide expert-level management of AWS infrastructure while ensuring technical standards are compatible with diverse architectural requirements.
- Security & FinOps: Implement Security by Design principles and proactively optimize cloud spend to balance performance with cost-efficiency.
- Availability & Reliability: Participate in a shared on-call rotation to support critical production infrastructure, ensuring high uptime and rapid incident response.
Technical Requirements
- Core Tools: Minimum 3+ years of hands-on experience with Terraform, GitHub, and Python.
- Cloud Platforms: Extensive experience with AWS (Required); a strong understanding of building flexible, adaptable infrastructure patterns.
- Containerization: Advanced knowledge of Kubernetes (K8s) focusing on platform stability and standardized deployment strategies.
- Agile Mindset: Deep familiarity with Agile and Scrum methodologies.
- Incident Management: Experience using Jira Service Management (JSM) for tracking incidents and maintaining post-mortem documentation.
Experience: 5+ years total in a DevOps, SRE, or Platform E