Role and Responsibilities:
The Level 3 (L3) Support team provides the highest level of technical expertise for the HEALIX Data Streaming platform. This role is responsible for resolving complex technical issues escalated from Level 2 (L2) support, performing in-depth root cause analysis, and driving long-term solutions to enhance system stability, performance, and scalability. The L3 team also supports ongoing platform improvements, integration enhancements, and environment optimization
Key Responsibilities:
- Resolve critical or complex incidents escalated from L2 support.
- Conduct deep-dive analysis of application, data pipeline, and infrastructure issues.
- Work with vendors and engineering teams to implement permanent fixes.
Root Cause Analysis & Problem Management:
- Lead technical investigations to identify underlying causes of recurring or major issues.
- Develop and implement preventive measures and corrective action plans.
- Maintain technical documentation and RCA reports for audit and knowledge sharing.
System Enhancement & Optimization:
- Support design and implementation of enhancements to improve system reliability and performance.
- Review and optimize data streaming workflows, resource utilization, and integration configurations.
- Participate in code reviews, configuration tuning, and performance testing.
Change & Release Management:
- Support deployment of software updates, patches, and configuration changes.
- Collaborate with DevOps and engineering teams to ensure smooth rollouts and rollback plans.
- Validate system stability post-deployment.
Collaboration & Knowledge Transfer:
- Work closely with development, data engineering, and architecture teams on system improvements.
- Provide technical guidance and training to L2 support.
- Contribute to the creation and maintenance of the technical knowledge base.
Requirements / Qualifications Requirements:
- Degree in Computer Science, Information Technology, Engineering, or related field.
- Experience in system, data platform, or application support.
- Hands-on experience with cloud platforms (e.g., AWS), data streaming frameworks (e.g., Kafka, Databricks), and automation tools.
- Strong understanding of APIs, data pipelines, monitoring tools, and incident management processes.
- Strong communication and collaboration skills across technical and business teams.
- Familiarity with ITIL or similar service management frameworks is preferred.