We are hiring a Systems Engineer. This role focuses on building and provisioning servers at a massive scale, developing automation to handle fleet management, and ensuring server health across the APAC region.
Location: Singapore
Key Responsibilities:
- Automation & Development: Develop back-end services, workflows, and automation for fleet management, including full server lifecycle (network boot, firmware updates, OS provisioning, failure detection, and decommissioning).
- Tooling: Develop out-of-band server management tooling in a multi-vendor environment, managing state and telemetry collection.
- Coding: Write, review, and test code automate testing on hardware.
- Troubleshooting: Troubleshoot provisioning, firmware updates, and network boot issues end-to-end.
Requirements:
- Experience: 5-7 years of relevant industry experience in Systems Engineering, DevOps, or Production Engineering.
- Coding: Proficiency in coding and scripting automation is non-negotiable (Python, Rust, or Go preferred).
- Networking: Strong understanding of TCP/IP network fundamentals.
- Hardware: Experience troubleshooting hardware and firmware issues familiarity with Dell, HP, or Nvidia DGX servers preferred.
- Systems: Experience with Linux systems, server management, and troubleshooting boot processes.
Preferred Skills:
- Experience with Kubernetes, Docker, or cloud deployment technologies is a big plus.
We regret to inform that only shortlisted candidates will be notified.
Job Reference: R25150511 Cammy Li Xin Hui
Allegis Group Singapore Pte Ltd, Company Reg No. 200909448N, EA License No. 10C4544