Provide advanced Level 3 (L3) production engineering support for enterprise Security Operations, Facility Management, Incident Monitoring, Workforce Management, and other operational support platforms within 24×7 mission-critical environments.
Manage complex production incidents involving access control systems, surveillance integrations, operational monitoring platforms, workforce scheduling systems, reporting engines, and document/output management solutions.
Perform in-depth root cause analysis (RCA), log analysis, transaction tracing, batch failure analysis, and service restoration across distributed operational ecosystems.
Support enterprise integrations involving security monitoring platforms, facility support applications, middleware interfaces, APIs, Kafka messaging, and automated operational workflows.
Execute production deployments, middleware upgrades, OS patching, certificate renewals, controlled data fixes, and release validation activities across Linux/UNIX production environments.
Administer and support operational batch processing, automated scheduling, reconciliation workflows, output distribution, and enterprise reporting services.
Develop Shell, Perl, Python, and SQL automation scripts for operational monitoring, alerting, health checks, incident recovery, and process optimization.
Lead major incident bridges, operational escalations, DR/BCP exercises, failover validation, and service recovery coordination with regional and global support teams.
Drive operational resilience, platform stability, observability enhancements, proactive monitoring, and automation initiatives for enterprise support systems.
Ensure compliance with enterprise security standards, audit controls, operational governance, and regulated service management processes.
Requirements
10+ years of hands-on L2/L3 production support experience in enterprise operational environments, security operations, facility management systems, banking, or other highly regulated industries.
Strong expertise in UNIX/Linux administration (RHEL, Solaris, OpenBSD) and enterprise operational platform troubleshooting.
Advanced experience in Oracle PL/SQL, Sybase ASA, PostgreSQL, MS SQL Server, Azure SQL, and production data remediation activities.
Proven experience supporting enterprise operational systems, workforce management platforms, monitoring systems, reporting engines, output/document management, or integrated operational support environments.
Strong exposure to Splunk, AppDynamics, Control-M, AutoSys, Kafka, WebLogic, Apache Tomcat, and distributed monitoring ecosystems.
Deep understanding of enterprise batch scheduling, operational workflow orchestration, system integrations, and production governance frameworks.
Expertise in incident, problem, and change management, RCA methodologies, SLA management, and high-severity operational recovery handling.
Hands-on scripting and automation experience using Shell, Perl, Python, PowerShell, or SQL in enterprise support environments.
Experience supporting HA/DR architectures, operational resiliency frameworks, and mission-critical service continuity operations.
Ability to operate as an SME or escalation lead during critical outages involving infrastructure, application, database, middleware, and vendor coordination teams.