This is a contract role ending in December 2026.
Responsibilities:
- Develop back-end services, workflows, and automation to manage server fleets, including full server lifecycle processes (network boot, firmware updates, OS provisioning, failure detection, and decommissioning).
- Build out-of-band server management tools in multi-vendor environments, encompassing state management and telemetry collection.
- Write, review, and maintain code test changes on hardware and automate testing processes.
- Troubleshoot server provisioning, firmware update, and network boot issues end-to-end.
Requirements:
- 5-7 years of experience in systems engineering or related fields.
- Strong understanding of TCP/IP networking fundamentals.
- Proficiency with Linux systems and server systems management.
- Experience troubleshooting server network boot processes from both network and server perspectives.
- Skilled in coding and scripting for automation (Python, Go, Rust, Bash, Ruby).
- Experience automating management of large server hardware fleets (Redfish, IPMI).
- Proven analytical skills in troubleshooting server and systems architectures and components.
- Familiarity with Kubernetes, Docker, and/or cloud deployment technologies.
- Bachelor's degree in Computer Science, Software Engineering, or a related field (or equivalent experience).
- Linux and TCP/IP certifications are preferred.