
Search by job, company or skills
. Maintain open source-based application monitoring infrastructure. Enhance, optimize, and migrate to new solutions if required.
. Support application teams to migrate to latest OpenShift versions, perform deployment of stateful/stateless apps, and troubleshoot issues in Kubernetes/OpenShift platforms.
. Work with application developers to implement application instrumentation libraries and frameworks.
. Maintain metrics data store using TSDBs like Prometheus. Perform administration and tuning like cardinality optimization, resource optimization.
. Maintain distributing tracing infrastructure like Otel, Jaeger, Zipkin, etc. Perform administrative functions and tuning like sampling strategy. Troubleshoot distributed tracing in microservices.
. Perform production support activities of enterprise logging platforms like ELK stack, Grafana Loki, etc. Work on Index Lifecycle management in Elastic search.
. Implementing alerting infrastructure, integrate with PagerDuty, MS teams and any other software which needs alert-based mitigation/action. Assist application support team to define alerting rules for enterprise business apps.
. Deploy and do administration of visualization tools like Grafana/Elastic. Create dashboarding templates which can be reused, Implement RBAC for the entire userbase.
. Educate and implement observability culture in dev community. Assist them identifying golden signals, defining SLI, SLO for enterprise applications, calculate error budgets, MTTD, and MTTR.
. Troubleshoot the infra issues in the observability infrastructure in Linux VMs and Kubernetes PODs, Setup and secure reverse proxies, secure all application endpoints with TLS, enable MFA, LDAPS, OAuth based on requirement.
. Configure CI/CD pipeline for all the monitoring infrastructure and services. Modify and extend existing pipeline to cater multiple environments/regions.
Required experience:
Minimum 3 yrs experience as a Software Engineer.
The person should have hands-on on below key technologies
. Elasticsearch/Kibana - Cluster Management, Search Optimization
. Prometheus/Grafana
. OpenTelemetry
. Linux OS troubleshooting
. Kubernetes deployments, CI/CD pipelines
. Good understanding of SRE practices
Registration No. / Unique Entity Number: 199801439D
Disclaimer:The company is committed to ensuring the privacy andsecurity of your information. By submitting this form, you consent to thecollection, processing, and retention of the information you provide. The datacollected (which may include your contact details, educational background, workexperience and skills) will be used solely for the purpose of evaluating yourqualifications for the position you're applying for. Your data will be storedsecurely and retained for the duration necessary to fulfill our hiring process.If you are not selected for the position, your data will be kept on file for alimited period in case future opportunities arise. You have the right toaccess, correct, or delete your data at any time by contacting us at QuessSingapore | A Leading Staffing Services Provider in Singapore (quesscorp.sg)
This is in partnership with the Employment andEmployability Institute Pte Ltd (e2i).
e2i is the empowering network for workers and employersseeking employment and employability solutions. e2i serves as a bridge betweenworkers and employers, connecting with workers to offer job security throughjob-matching, career guidance and skills upgrading services, and partneringemployers to address their manpower needs through recruitment, training, andjob redesign solutions. e2i is a tripartite initiative of the National TradesUnion Congress set up to support nation-wide manpower and skills upgradinginitiatives. By applying for this role, you consent to QuesscorpSingapore's PDPA and e2i's PDPA.
Job ID: 147797903
Skills:
Java, Prometheus, Node.js, Grafana, Elk Stack, Gcp, Docker, Terraform, Azure, Python, Kubernetes, AWS
Skills:
.NET, .Net Core, Waterfall, Prometheus, React, Git, Javascript, Docker, Agile, Kubernetes, Golang, GitHub Actions, Cloud Kubernetes platforms, ArgoCD
Skills:
network security, Docker, Cloud Infrastructure, Kubernetes, security measures, CI CD pipelines, logging systems, data protection protocols
Skills:
Openshift, Elk, Openstack, cloud, Kubernetes, Grafana, Docker, "Application support", "Application operations", "application security", Site Reliability Engineering, "solution architecture", containerisation, Observability, CI/CD, "SRE"
Skills:
Oracle Database, Maven, Restful Api, Elk Stack, Grafana, Microservices, Docker, Openshift, Shell scripting, Java, Mq, Openstack, JBOSS EAP, Autosys, Sql, Web Services, Sftp, Rhel, Datawarehouse, Kubernetes, Document Management System, CI CD, File transfers, Observability platforms, Server-side Java, Watermelon
We don’t charge any money for job offers