Job Summary
We are seeking a highly technical Platform Engineering Manager to lead the design, development, and continuous evolution of enterprise-grade cloud infrastructure, developer platforms, and large-scale distributed systems. This role will translate advanced research and technical requirements into highly scalable, reliable, and secure engineering solutions, lead a high-performing platform team, and drive the long-term technical strategy of the company's core technology platforms.
Key Responsibilities
- Lead, manage, and grow the platform engineering team in building, operating, and scaling cloud infrastructure, developer platforms, and mission-critical core systems.
- Research, design, develop, test, and maintain computer systems, network software, system utilities, and platform-level components.
- Analyze business and technical requirements, then design and implement software solutions based on computer science, engineering, and mathematical analysis principles.
- Improve, optimize, and upgrade existing platforms to enhance system performance, stability, security, reliability, and scalability.
- Establish and oversee software testing, validation, and quality assurance processes to ensure robust and production-ready systems.
- Own end-to-end platform architecture design, including cloud-native, microservices, containerization, and distributed systems.
- Drive and enforce engineering best practices including DevOps, SRE, CI/CD, infrastructure as code (IaC), and automation.
- Monitor system health, performance, and reliability identify bottlenecks and provide technical solutions to ensure stable and efficient operations.
Qualifications
- PhD degree in Computer Science, Software Engineering, Electrical Engineering, or a closely related field (required).
- 8+ years of professional experience in platform engineering, software engineering, systems engineering, or infrastructure development
- 3+ years of formal people management or leadership experience in a technology team.
- Strong foundation in computer science including data structures, algorithms, operating systems, computer networks, and distributed systems.
- Proficiency in at least one core programming language (e.g., Go, C++, Java, Python).
- Deep expertise in system architecture, high-concurrency, high-availability, and large-scale distributed systems.
- Hands-on experience with major cloud platforms (AWS, Azure, GCP) or hybrid/private cloud infrastructure.
- Strong system performance tuning, troubleshooting, and problem-solving capabilities.
- Experience in hardware-software integration is a strong advantage.
- Excellent communication, leadership, and cross-functional collaboration skills.