We're looking for an experienced HPC Systems Engineer to run and evolve our Linux-based high-performance computing (HPC) platforms supporting researchers, academics, and enterprise users.
Job scopes:
- Operate and support large-scale Linux HPC clusters, storage, and high-speed networks
- Manage HPC platforms including Slurm / PBS Pro / LSF and parallel file systems such as Lustre, GPFS, or BeeGFS
- Monitor performance, perform upgrades and patching, and plan for future capacity
- Troubleshoot complex issues across hardware, OS, storage, and networking
- Support AI / deep learning workloads in collaboration with software engineers
- Advise researchers on HPC application performance, debugging, and parallelisation
- Deliver user training and contribute to technical documentation
- Participate in on-call or escalation support when required
Requirements :
- Degree in Computer Science, Engineering, or related field
- 5+ years experience supporting large-scale HPC environments
- Strong hands-on skills with:
-Linux (RHEL, Rocky, SUSE)
-HPC schedulers and resource managers
-Parallel file systems
- Solid understanding of HPC performance tuning
- Nice to have :
-HPC code optimisation and parallel programming
-Fortran, C/C++, MPI, OpenMP
Please send your detailed resume in MS Word format to [Confidential Information] with
- Education Level
- Working experiences
- Each employment background
- Reason for leaving each employment
- Last drawn salary
- Expected salary
- Date of availability