Responsibilities:
- Design, develop, and maintain backend services, workflows, and automation supporting server and infrastructure operations
- Support full server lifecycle management, including provisioning, firmware updates, monitoring, and decommissioning
- Build and maintain out-of-band server management tools in multi-vendor environments
- Implement state management, telemetry collection, and system monitoring solutions
- Write, review, and test code, including validation on physical hardware where required
- Automate testing and operational processes to improve system reliability and efficiency
- Troubleshoot server provisioning, firmware, and network boot issues end-to-end
Requirements:
- Relevant experience in systems engineering, infrastructure, or a related technical role
- Strong understanding of TCP/IP networking fundamentals
- Hands-on experience with Linux systems and server management
- Experience troubleshooting server network boot processes from both network and server perspectives
- Proficiency in coding and automation using one or more of the following: Python, Go, Rust, Bash, Ruby
- Experience managing and automating large-scale server fleets using technologies such as Redfish and IPMI
- Proven ability to analyze and troubleshoot server and systems architecture issues
- Experience with Kubernetes, Docker, and/or cloud deployment technologies
- Bachelor's degree in Computer Science, Software Engineering, or a related field, or equivalent practical experience
- Linux and/or TCP/IP certifications are preferred
This is a contract role based in Singapore.