- Work on cutting-edge AI infrastructure and heterogeneous GPU system
- High-impact role shaping next-generation large-scale AI and LLM system
About Our Client
Our client is a globally recognised leader in cloud computing and AI infrastructure, operating large-scale distributed systems that power enterprise and AI workloads worldwide. The organisation is known for its strong engineering culture, innovation in AI platforms, and commitment to building next-generation computing capabilities.
Job Description
- Design and optimise AI system architecture
- Run GPU/system testing and performance benchmarking
- Troubleshoot GPU, CUDA, and system-level issues
- Build monitoring and diagnostic tools
- Optimise LLM training and inference performance
- Drive fast-paced infrastructure deployment
The Successful Applicant
- Strong experience in GPU/AI systems (NVIDIA/AMD)
- Python, C++, Linux, CUDA
- Familiar with AI frameworks (PyTorch, etc.)
- Experience in performance tuning and debugging
- Strong problem-solving and communication skills
What's on Offer
- Competitive salary + bonus
- Work on cutting-edge AI systems
- High ownership and impact
- Strong career growth in a global tech environment
Michael Page International Pte Ltd | Registration No. 199804751N