Search by job, company or skills

Tencent

Hunyuan LLM Site Reliability Engineer

Early Applicant
  • Posted 11 days ago
  • Be among the first 10 applicants
2-4 Years

Job Description

Business Unit

Technology Engineering Group (TEG) is responsible for supporting the company and its business groups on technology and operational platforms, as well as the construction and operation of R&D management and data centers, TEG provides users with a full range of customer services. As the operator of the largest networking, devices, and data center in Asia,TEG also leads the Tencent Technology Committee in strengthening infrastructure R&D through internal and distributed open source collaboration, constructing new platforms and supporting business innovation.

What The Role Entails

  • Responsible for the operation and maintenance of overseas model services at Hunyuan, ensuring stable, reliable, and efficient service operations;
  • Responsible for capacity management and planning, resource cost optimization, ensuring reasonable online service capacity and improving resource efficiency;
  • Responsible for continuous integration and delivery, efficient and automated operational optimization, enhancing service stability and research and development efficiency;
  • Participate in the design of online systems and various service architectures, providing professional solutions for stability and architecture improvement;
  • Analyze and deeply explore the shortcomings of existing systems, data-driven to find weak points, and promote system optimization implementation and improvement;
  • Pay attention to industry front-end technology trends, explore technologies and directions for automation and intelligence in the operation and maintenance of complex business systems.

Who We Look For

  • Bachelor's degree or above, with 2 years or more experience in internet operations and maintenance;
  • Familiar with Linux operating system, with solid system management and network knowledge;
  • Familiar with deploying, configuring, and tuning components such as Nginx, Redis, MySQL;
  • Proficient in monitoring systems such as Zabbix, Prometheus, Grafana, real-time grasping the running status of overseas systems;
  • Proficient in at least one programming language (such as Python, Go, Shell, etc.), with experience in developing automated operational tools to meet the needs of complex and variable overseas operations and maintenance;
  • Familiar with mainstream public cloud operations and maintenance management overseas (such as AWS, Azure, etc.), with experience in containerization and microservices architecture, able to cope with the characteristics and differences of local cloud services;
  • Strong sense of work responsibility, good communication skills, learning ability, and team spirit;
  • Proficient in English and Chinese, in listening, speaking, reading, and writing, timely writing updated workflow and technical documents as required.

Equal Employment Opportunity at Tencent

As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.

More Info

Industry:Other

Function:Technology

Job Type:Permanent Job

Date Posted: 19/09/2025

Job ID: 126509539

Report Job

About Company

View More
Last Updated: 25-09-2025 01:13:20 PM
Home Jobs in Singapore Hunyuan LLM Site Reliability Engineer