
Search by job, company or skills
Our client is a cloud computing service provider delivering large-scale, fast, and reliable AI
compute for global customers
Key responsibilities
include:
. Assist the Solution & Technical manager in completing the planning and topology design
of the network, and provide technical suggestions at the on - site implementation level.
Be able to write detailed network configuration documents, wiring diagrams, and
operation and maintenance manuals.
. Complete the deployment, configuration, and debugging of high - performance
networks.
. Conduct network - layer testing and in - depth optimization, and solve related problems
during testing.
. Troubleshoot and maintain the network, monitor the network health, quickly locate
network faults, and handle them.
. Cooperate with hardware engineers, storage engineers, and cloud engineers to solve
cross - domain faults and problems.
Who You Are
. Bachelor's degree or higher in Computer Science, Electronic Engineering, Automation, or
a related field.
. 5+ years of data center network operation and maintenance experience, with experience
in HPC (High - Performance Computing) or AI cluster network maintenance.
. Holders of NCDA (NVIDIA Certified Data Center Associate) or advanced network
certifications such as CCIE/HCIE are preferred.
. Familiar with the configuration and management of NVIDIA Quantum - 2 and Spectrum
series switches.
. Proficient in network configuration, problem analysis, and parameter tuning in the Linux
system.
. Possess script programming capabilities (Python/Shell) and be able to write automated
operation and maintenance scripts.
. Familiar with the structured cabling system and cabling standards and be able to
terminate optical fibers and network cables.
. Able to adapt to a high-intensity project delivery rhythm and on-site working
environment.
Job ID: 138848859