Search by job, company or skills

Trulyyy

Site Reliability Engineer

Early Applicant
  • Posted 25 days ago
  • Be among the first 10 applicants
1-3 Years

Job Description

Client:

Global Leader in Connectivity and Smart Technology

Responsibilities:

  • Act as the technical Subject Matter Expert (SME) for the deployment and management of Microservices on Kubernetes-based cloud platforms.
  • Collaborate with Cloud Technical Development and DevOps teams to facilitate the deployment of services across Multi-Cloud Environments.
  • Conduct Load Testing and Chaos Engineering exercises to validate the scalability and resilience of microservices.
  • Develop observability solutions for Microservices and cloud platforms such as AWS, OCI, Azure, and GCP.
  • Create and implement Disaster Recovery plans in partnership with Development and DevOps teams.
  • Analyze and troubleshoot production risks stemming from resource limitations, including node groups, CPU, memory, HPA scheduling, JVM pre-warming, etc.
  • Write and maintain automation scripts using languages like Python, Go, or Bash.
  • Define and monitor KPIs (SLA/SLO/SLI) for all cloud microservices in collaboration with development teams to enhance business insights.
  • Produce and maintain comprehensive technical documentation, including architecture diagrams, design specifications, and operational procedures.
  • Lead incident response efforts to swiftly diagnose and resolve production issues.
  • Conduct post-incident reviews to identify root causes and recommend solutions or mitigations.
  • Support product and technology selection processes, including Proof of Concepts (POCs).

Requirements:

  • Bachelors degree in Computer Science, Information Technology, or a related discipline.
  • At least 1 year of experience as a Site Reliability Engineer.
  • Proficiency in programming and scripting languages such as Java, Python, Bash, or PowerShell.
  • Practical experience in SRE, DevOps, cloud operations, and cloud security best practices.
  • Strong understanding of security technologies, including Identity and Access Management, Network Security, Application Security, and Data Protection.
  • Excellent problem-solving and analytical capabilities, with the ability to work both independently and collaboratively.

Regrettably, only shortlisted candidates will be notified.

Please note that data provided is for recruitment purposes only.

Business Registration No.: 202004228R | License. No. - 20S0118 | EA Registration No. - R1986587

More Info

Industry:Other

Function:Information Technology

Job Type:Permanent Job

Date Posted: 05/09/2025

Job ID: 125530267

Report Job

About Company

View More
Last Updated: 19-09-2025 05:33:04 PM
Home Jobs in Singapore Site Reliability Engineer

Similar Jobs