Search by job, company or skills

B

DevOps Field Engineer

2-4 Years
SGD 3,000 - 5,000 per month
Save
new job description bg glownew job description bg glow
  • Posted 8 days ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Company Overview

Blue Silo is a technology company specialising in mission-critical, enterprise-grade software solutions for defense, government, and industrial clients. We build, deploy, and operate robust, secure, and scalable systems across both cloud and on-premises (including air-gapped) environments.

To support the full lifecycle of our deployed systems, we are expanding our Operations & Engineering team with a DevOps Field Engineer. This role is central to bridging cloud-based DevOps practices with hands-on, on-site deployment and support in secure client environments where availability is non-negotiable.

Role Summary

The DevOps Field Engineer is responsible for the deployment, configuration, operation, maintenance, and troubleshooting of Blue Silo systems across two distinct environments: cloud-based deployments (AWS, Azure, or GCP) and on-premises, often air-gapped, client sites. The role covers both software and hardware, including application platforms, servers, networking, RFID systems, and CCTV/video systems.

The engineer is the named first responder for production incidents under contracted Service Level Agreements (SLAs), and is expected to be on-site at affected client locations within 8 hours for critical issues. The role works closely with software, infrastructure, operations, and customer teams to ensure stable operations, secure deployments, and continuous improvement across development, staging, and production environments.

This is a hybrid role. When DevOps and field workload is heavy, the engineer focuses on deployments, incident response, and operations. When DevOps workload is lighter, the engineer contributes as a JavaScript / Node.js developer on backend utilities, APIs, application modules, and system integrations. Candidates should be comfortable wearing both hats.

Key Responsibilities

1. Service Level Ownership & Incident Response

  • Act as the named first responder for production incidents on assigned client systems under contracted SLAs.

  • Attend the affected client site within 8 hours for critical (Severity 1) incidents, and within agreed timelines for lower severities.

  • Participate in an on-call rotation, including evenings, weekends, and public holidays as required by client SLAs.

  • Lead diagnosis, mitigation, and resolution of incidents own root cause analysis and post-incident reviews.

  • Coordinate escalation to the remote development team, vendors, and system integrators where required.

2. Cloud Deployment & DevOps

  • Deploy, configure, and maintain applications and supporting infrastructure on a major cloud platform (AWS, Azure, or GCP) across development, staging, and production environments.

  • Design, implement, and maintain CI/CD pipelines for automated build, test, and release management.

  • Implement Infrastructure as Code (IaC) using tools such as Terraform or CloudFormation for reproducible environment builds.

  • Support containerised deployments using Docker, Kubernetes, or equivalent orchestration platforms.

  • Implement and manage cloud-native monitoring, logging, and security services.

3. On-Premises & Air-Gapped Site Deployment

  • Perform on-site installation, commissioning, integration, and operational readiness activities at client premises, including air-gapped and security-controlled facilities.

  • Manage offline software distribution, patching, and dependency mirroring for environments without internet access.

  • Administer Linux and Windows server environments, virtualisation platforms (VMware, Hyper-V), and physical/virtual networking.

  • Implement and verify backup, replication, and disaster recovery jobs (Veeam or equivalent).

  • Monitor system health and performance using tools such as Zabbix, addressing alerts proactively.

4. Hardware & Integrated Systems Support

  • Configure, test, and troubleshoot RFID devices, readers, antennas, and associated middleware support RFID asset-tracking integration with applications.

  • Configure and maintain CCTV cameras, Network Video Recorders (NVR), Video Management Systems (VMS), and related network components.

  • Troubleshoot connectivity, streaming, storage, recording, and hardware-related issues across integrated systems.

  • Support integration between RFID/CCTV systems and operational workflows where applicable.

5. Software Development (JavaScript / Node.js)

  • Contribute as a developer on backend utilities, internal tools, REST APIs, and integration components using JavaScript and Node.js.

  • Support troubleshooting, bug fixes, and feature enhancements of existing application modules.

  • Step up to a primary developer role on assigned features or modules during periods of lower DevOps and field workload.

  • Automate routine operational tasks and deployments using Bash, PowerShell, or Python.

  • Maintain source control, deployment scripts, and configuration management repositories.

6. Documentation, Compliance & Knowledge Custodianship

  • Maintain comprehensive technical documentation: runbooks, architecture diagrams, configuration records, troubleshooting guides, and incident logs.

  • Ensure operational procedures and configurations comply with project, client, and security requirements.

  • Support audit, inventory, and asset management activities.

  • Translate operational insights into updated documentation, automation, and preventive improvements.

Required Qualifications

  • 2+ years of hands-on experience in a DevOps, site reliability, system administration, field engineering, or software development role. Direct experience supporting production systems under SLA is a strong plus but not mandatory.

  • Working experience or solid project exposure with at least one major cloud platform (AWS, Azure, or GCP) - deploying, configuring, or operating systems.

  • Comfort working on customer premises, including environments with restricted or no internet connectivity. Prior on-site deployment experience is preferred but not required.

  • Working knowledge of Linux and Windows Server administration.

  • Familiarity with networking fundamentals: TCP/IP, VLAN, routing, firewall, and switch configuration.

  • Exposure to CI/CD pipelines, Docker, and Infrastructure as Code (Terraform or equivalent) willingness to deepen these skills on the job.

  • Proficiency in JavaScript and Node.js, with the ability to contribute as a developer on backend utilities, APIs, integrations, or application modules during periods of lower DevOps workload.

  • Working knowledge of at least one additional scripting language (Bash, Python, or PowerShell) for automation.

  • Familiarity with system monitoring tools (Zabbix preferred) and backup/recovery concepts (Veeam or equivalent).

  • Strong analytical and troubleshooting skills across both hardware and software.

  • Excellent written and verbal communication skills able to work effectively with clients, developers, and vendors under pressure.

  • Willingness and ability to be on-site at client locations within 8 hours for critical incidents, including outside standard business hours valid driving licence and own transport preferred.

  • Eligibility to work on defense, government, or other security-sensitive projects, including any clearance, vetting, or background-check requirements specified by the client.

Preferred Qualifications

  • Diploma or Degree in Computer Engineering, Information Technology, Computer Science, Software Engineering, or a related discipline.

  • Experience with RFID systems, asset tracking, or related hardware integration.

  • Experience configuring and troubleshooting CCTV systems, IP cameras, NVRs, and VMS platforms.

  • Experience supporting mission-critical or operational technology (OT) systems in defense, government, or industrial settings.

  • Experience with container orchestration (Kubernetes) and configuration management (Ansible).

  • Familiarity with MQTT, IoT, or edge computing environments.

  • Certifications in cloud platforms (AWS / Azure / GCP), Linux, Kubernetes, networking, or ITIL.

  • Familiarity with documentation and collaboration tools (Confluence, SharePoint, or similar).

What We Offer

  • A foundational role in Blue Silo's Operations & Engineering team, with direct impact on client outcomes.

  • Exposure to both modern cloud platforms and high-stakes on-premises environments.

  • Backing from an experienced development team for complex troubleshooting and escalations.

  • Competitive compensation, on-call allowance where applicable, and professional development support.

How to Apply

Please send your resume and a cover letter outlining your experience with both cloud and on-site deployments, and any SLA-bound support roles you have held, to [Confidential Information].

Include DevOps Field Engineer Application in the subject line.

More Info

Job Type:
Industry:
Function:
Employment Type:

Job ID: 147455287