Search by job, company or skills

V

Operations Engineer (DevOps / SRE)

3-5 Years
SGD 7,000 - 10,000 per month
new job description bg glownew job description bg glownew job description bg svg
  • Posted 6 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description

We are looking for an Operations Engineer (DevOps / SRE) to support and enhance the reliability, performance and scalability of our core technology platforms. You will be responsible for day to day operations, system stability and infrastructure optimisation across our production environments.

Our platforms include corporate websites, CDN infrastructure, instant messaging systems, customer service SaaS platforms and cloud based infrastructure. This role offers hands on exposure to production grade SaaS systems, multi cloud environments and high availability architectures.

Key Responsibilities:

System Operations and Reliability

  • Manage daily operations and monitoring of company platforms including website, CDN, IM system and customer service SaaS
  • Ensure 24x7 system stability, high availability and business continuity
  • Maintain servers, containers, networks and storage resources

Cloud Infrastructure Management

  • Deploy and manage cloud resources including compute, networking, load balancing and object storage
  • Support multi cloud or hybrid cloud environments such as AWS, Huawei Cloud, Alibaba Cloud and Cloudflare
  • Perform capacity planning, cost optimisation and resource scaling

CDN and Domain Management

  • Manage CDN configuration, DNS records and HTTPS certificate lifecycle
  • Optimise global access performance and cross region reliability
  • Troubleshoot CDN caching, origin and certificate related issues

Containerisation and Automation

  • Maintain and optimise Docker and Kubernetes clusters
  • Build and maintain CI/CD pipelines for automated deployment
  • Develop automation scripts using Shell, Python or similar tools

Monitoring and Incident Response

  • Build and maintain monitoring, logging and alerting systems such as Prometheus, Grafana and ELK
  • Respond to production incidents, perform root cause analysis and post incident reviews
  • Maintain operational SOPs, emergency plans and system documentation

Security and Compliance

  • Perform system and network hardening
  • Manage firewall rules, security groups, access control and permissions
  • Support security audits, penetration testing and remediation efforts

Requirements :

Basic Requirements
- Bachelor's degree in Computer Science or a related field, or equivalent practical experience
- At least 3 years of experience in Linux system operations, DevOps or SRE related roles
- Strong understanding of Linux systems such as CentOS or Ubuntu
- Solid knowledge of networking fundamentals
- Mandarin is required for work communication with internal or external stakeholders.

Technical Requirements
- Hands on experience with cloud platforms such as AWS, Huawei Cloud, Alibaba Cloud or GCP
- Good understanding of CDN architecture, DNS management and HTTPS or SSL certificate configuration
- Experience with Docker and container based environments
- Kubernetes operational experience is preferred
- Familiarity with web architectures including Nginx, APIs and microservices
- Experience with monitoring, logging and alerting systems

Preferred Qualifications
- Experience supporting SaaS platforms or B2B systems
- Experience with instant messaging systems or customer service platforms
- Experience working in multi tenant system environments
- Exposure to automation scripting using Shell or Python
- Experience in system security hardening or cloud security practices

Personal Attributes
- Strong sense of responsibility and ability to handle production incidents
- Able to work independently and perform under pressure
- Good documentation habits and process awareness
- Good communication skills and ability to work with cross functional teams

What we offer

  • Hands on exposure to production grade SaaS and high availability systems
  • Opportunities to work with multi cloud, CDN, IM and SaaS technology stacks
  • Flat structure with strong technical ownership
  • Competitive remuneration and long term career development

More Info

Job Type:
Industry:
Function:
Employment Type:

Job ID: 137374175