
Search by job, company or skills
About Rakuten Group, Inc.:
Rakuten Group, Inc. (TSE: 4755) is a global leader in internet services that empower individuals, communities, businesses and society. Founded in Tokyo in 1997, Rakuten has expanded to offer services in e-commerce, fintech, digital content and communications to approximately 1.8 billion members around the world. The Rakuten Group has over 30,000 employees and operations in 30 countries and regions. For more information, visit https://global.rakuten.com/corp/(opens in a new tab).
About Rakuten Asia:
Situated in the heart of Singapore's Central Business District, Rakuten Asia Pte. Ltd. is Rakuten's Asia Regional headquarters. Established in August 2012 as part of Rakuten's global expansion strategy, Rakuten Asia comprises various businesses that provide essential value-added services to Rakuten's global ecosystem. Through advertisement product development, product strategy, and data management, among others, Rakuten Asia is strengthening Rakuten Group's core competencies to take the lead in an increasingly digitalized world.
Job Summary:
Overview
We are hiring a DevOps Engineer to strengthen our platform reliability, deployment speed, and observability across cloud environments. You will own CI/CD, infrastructure automation, container orchestration, and monitoring, partnering closely with backend, QA, and product teams.
Responsibilities
Own and improve CI/CD pipelines (Jenkins/GitHub Actions/Bitbucket Pipelines)
Containerize services and manage runtime environments (Docker, Kubernetes on GKE/EKS)
Provision and manage cloud infrastructure (GCP/Azure IaaS VM management)
Implement IaC across environments (Terraform/Helm)
Set up monitoring and alerts (Prometheus/Grafana Cloud Monitoring/CloudWatch) contribute to tracing and logging standards
Support database/storage operations (RDBMS, NoSQL Cloud SQL/RDS backups, replication, performance)
Manage networking/IAM (VPC, LB/DNS, RBAC, secrets management)
Drive reliability improvements and incident response define and improve SLI/SLOs
Collaborate with QA on test environments and automation integration
Support services in production environments to ensure high availability and performance
Align with lifecycle management (LCM) strategy ensure systems, software, and middleware are upgraded regularly
Align with company security requirements prioritize remediation of vulnerabilities in server environments
Drive cost-awareness and optimization across infrastructure and operations
Design and evolve reusable DevOps libraries and CI/CD pipelines that can be applied across projects
Collaborate with DevOps peers and architects to align best practices, standards, and technical direction for infrastructure and DevOps across the organization
Requirements
Education: Bachelor's degree or higher in Computer Science, Computer Engineering, or a related field
Experience: 5+ years in DevOps/SRE with hands-on production ownership
Cloud: AWS/GCP platform experience IaaS VM operations
CI/CD: Jenkins, GitHub Actions, or Bitbucket Pipelines
Containers: Docker Kubernetes operations on GKE/EKS (upgrades, scaling, multi-env)
IaC: Terraform/Helm environment orchestration and standardization
Monitoring/Observability: Prometheus/Grafana, tracing, and cloud alerting familiarity with Cloud Monitoring/CloudWatch define actionable SLOs PagerDuty setup
Scripting: bash or Python for automation practical runbook usage
Databases/Storage: RDBMS (MySQL, Oracle etc.) NoSQL (Redis, MongoDB, Couchbase, Firestore etc.) S3/GCS RDS/Cloud SQL (backups, replication, performance tuning)
Networking/IAM: VPC design, load balancers, DNS, RBAC, secrets management
Scaling patterns: high-traffic services across layers (web, database, logging)
Production operations: commitment to availability, incident response and postmortems, and recovery
Cost-effectiveness: plan/monitor infra usage and optimize costs
Good to have:
Experience with GCP (GKE, Cloud SQL, Cloud Monitoring) in production
Helm charts and Kustomize service mesh familiarity
Secrets management (e.g., HashiCorp Vault, cloud KMS)
Performance testing/optimization and capacity planning
Experience in setting up NoSQL DBs (e.g., Redis, MongoDB, Firestore, Couchbase) and CDN usage
Experience with microservices, and event-driven architectures
Experience with middleware such as distributed caching, message queues, RPC frameworks
Job ID: 147288083
Skills:
System Security, Windows Server, Gpo, PowerShell, Network security, Microsoft Sql Server, Bash, Storage, Linux Ubuntu, Devops, Ansible, Clustering, Python, AD, SSL hardening, Azure Stack Hub, Patching, Commvault backup, VM networking
Skills:
Unix, Vpns, Prometheus, Dns, Tcp Ip, Grafana, Datadog, Firewalls, Javascript, Docker, Terraform, Shell scripting, Proxies, Python, Azure DevOps, AWS, Bash, Elk Stack, Jenkins, Linux, Ansible, Splunk, Load Balancing, Kubernetes, GitHub Actions, GitLab CI CD
Skills:
Dns Configuration, Continuous Delivery, VLAN, Firewalls, Docker, Terraform, Application Servers, Proxies, Web Servers, Node.js, Continuous Integration, Jenkins, Git, Network Administration, Ansible, Reactjs, Load Balancers, Amazon Aws, Puppet, Chef, routing configuration, Shell-scripting, Agile processes, subnets, Atlassian Bamboo, database servers
Skills:
Github, Cloudformation, Prometheus, Grafana, Linux Administration, Solarwinds, Jenkins, Appdynamics, Git, Docker, Terraform, Ansible, Sonarqube, Gitlab, Kubernetes, AWS, CloudFabrix
Skills:
Github, Cloudformation, Prometheus, Grafana, Linux Administration, Solarwinds, Jenkins, Appdynamics, Git, Terraform, Docker, Ansible, Sonarqube, Gitlab, Kubernetes, AWS, CloudFabrix
We don’t charge any money for job offers