- Work on GenAI, large‑scale model training, and GPU performance optimization
- Exposure to multi‑node, multi‑GPU systems, low‑level optimization
About Our Client
The client is a global technology manufacturer recognized for innovation in advanced systems, AI, and high‑performance computing. With a strong commitment to research, sustainability, and engineering excellence, they provide an environment where highly technical engineers can solve complex, real‑world problems at scale.
Job Description
- Architect and execute large‑scale model training and fine‑tuning on multi‑node, multi‑GPU clusters
- Optimize training and inference performance using distributed strategies (DDP, FSDP, DeepSpeed, Megatron‑LM)
- Design and develop autonomous AI Agents for complex, multi‑step manufacturing workflows
- Profile and analyze GPU‑intensive workloads to identify compute, memory, and latency bottlenecks
- Develop and optimize high‑performance GPU kernels using CUDA or related GPGPU frameworks
- Partner with hardware architects to shape next‑generation accelerator features
- Build performance regression testing frameworks for drivers, compilers, and runtime system
The Successful Applicant
- At least 5 years experience in GPU computing, performance optimization, or low‑level systems programming
- Deep knowledge of GPU architectures, memory hierarchies, and interconnects
- Strong hands‑on experience with PyTorch and distributed model training techniques
- Expertise in LLM fine‑tuning, inference optimization, and GenAI application development
- Advanced C++ skills and proficiency in CUDA or other GPGPU frameworks
- Solid understanding of end‑to‑end ML systems, CI/CD pipelines, and cloud or on‑prem environments
- Excellent analytical, communication, and cross‑functional collaboration skills
What's on Offer
- Competitive compensation and performance‑linked incentives
- Opportunity to work on industry‑leading AI and GPU technologies
- Exposure to large‑scale, real‑world GenAI and manufacturing systems
- Strong emphasis on learning, innovation, and technical growth
- Collaborative, inclusive culture with long‑term career progression
Contact
Lydia Chen (Lic No: R22108104 / EA no: 18C9065)
Quote job ref
JN-042026-6992136
Phone number
+65 6416 9829
Michael Page (Personnel) Pte Ltd | Registration No.201736642C