Work with customers to understand their AI workloads, technical requirements, and scaling needs
Collaborate with Engineering and Ops teams to translate high-level architectural designs into actionable deployment plans
Design tailored GPU infrastructure solutions in capacity planning, cluster design, networking, and performance optimization
Provide architectural oversight during the deployment of GPU clusters, ensuring networking, storage, and compute configurations meet performance and reliability benchmarks
Support onboarding, proof-of-concept projects, and GPU cluster deployments
Troubleshoot architecture-level challenges and recommend best practices for performance and cost optimization
Create solution diagrams, architecture documentation, and technical proposals
Own and create BOMs for customer solutions including RFQ
Serve as a subject matter expert on Nscale's platform, features, APIs, and capabilities
About You
Masters/Bachelor Degree in Computer Engineering, Information Technology or equivalent
At least 7 years of experience in Solution architecture, cloud architecture, systems engineering, or similar roles
Strong understanding of GPU compute, HPS, AI/ML workloads, and high performance infrastructure
Strong experience in networking (L2/L3, InfiniBand, RoCE), storage systems and distributed compute patterns
Experience in deployment of data centre networking and connectivity
Ability to translate customer needs into clear, scalable technical solutions
Strong communication skills and able to conduct presentation to technical and non-technical audiences
Ability to work cross-functionally with Engineering, operations, product, and customer teams
Hands-on experience with Linux systems, containers, orchestration tools