Job Overview:
We are seeking a talented and driven AI Computing Platform Software Developer to join our cutting-edge engineering team. In this role, you will be responsible for designing and developing software systems that power AI computing platforms. Your work will focus on optimizing performance, scalability, and efficiency for AI workloads, ensuring our platform meets the growing demands of machine learning and deep learning models. You will collaborate with researchers, data scientists, and engineers to build next-generation AI computing platform and infrastructure.
Key Responsibilities:
- Design, develop, and optimize the software architecture and source code for AI computing platforms that support machine learning and deep learning workloads.
- Implement and enhance algorithms for distributed computing, parallel processing, and NPU/GPU acceleration.
- Work on software components that interface with hardware accelerators such as NPUs, GPUs, and specialized AI chips.
- Collaborate with AI researchers and developers to improve platform performance for complex AI applications.
- Design and implement tools and frameworks for efficient deployment, monitoring, and scaling of AI workloads.
- Integrate new AI models and algorithms into the computing platform while ensuring scalability, fault tolerance, and efficiency.
- Collaborate with cross-functional teams to troubleshoot and resolve platform-related performance issues.
- Conduct performance profiling, optimization, and benchmarking to ensure the platform delivers high throughput and low latency for AI workloads.
- Stay up-to-date with the latest developments in AI hardware and software architecture, and continuously improve platform capabilities.
Required Skills and Qualifications:
- Master's degree in Computer Science, Electrical Engineering, or a related field (PhD preferred).
- Extensive experience in developing high-performance software for computing platforms, ideally with a focus on AI/ML workloads.
- Strong programming skills in languages such as C/C++, Python, or Go, with a focus on low-level programming for high-performance systems.
- In-depth knowledge of parallel programming techniques and distributed systems.
- Experience with GPU programming (CUDA, OpenCL) or specialized hardware accelerators (e.g., NPUs, FPGAs).
- Familiarity with machine learning frameworks such as PyTorch, TensorFlow, or MXNet.
- Understanding of AI model deployment, orchestration, and optimization on large-scale platforms.
- Experience with containerization (Docker, Kubernetes) , AI computing platform (HuggingFace, Kaggle) and cloud infrastructure (AWS, GCP, Azure).
- Strong problem-solving skills and the ability to optimize software performance for AI workloads.
Preferred Qualifications:
- Experience with deep learning frameworks and tools for performance profiling and optimization.
- Knowledge of low-level hardware optimization techniques (e.g., memory management, CPU/NPU/GPU instruction tuning).
- Familiarity with system-level software for AI computing platforms, such as TensorRT or ONNX Runtime.
- Background in designing and implementing high-performance distributed systems and storage solutions.
- Strong understanding of networking, I/O, and data management techniques for AI workloads.
Interested candidate please click APPLY to begin your job search journey.
We regret to inform that only shortlisted candidates will be notified.
By sending us your personal data and curriculum vitae (CV), you are deemed to consent to PERSOLKELLY Singapore Pte Ltd and its affiliates to collect, use, and disclose your personal data for the purposes set out in the Privacy Policy available at . You acknowledge that you have read, understood, and agree with the Privacy Policy.
PERSOLKELLY Singapore Pte Ltd
. RCB No. 200007268E . EA License No. 01C4394
. Registration ID Heah Sian Wei