Assist in the design, implementation, and maintenance of observability solutions using ELK stack
Monitor and analyze system performance, identify bottlenecks, and troubleshoot issues across distributed systems.
Collaborate with development and operations teams to instrument applications and infrastructure for better visibility.
Develop and maintain dashboards, alerts, and reports to provide actionable insights to stakeholders.
Participate in incident response and root cause analysis to ensure system reliability and performance.
Stay up-to-date with industry trends and best practices in observability and monitoring.
Support the migration from legacy monitoring tools to modern observability platforms.
Document processes, configurations, and best practices to ensure knowledge sharing and continuity.
Degree or Diploma in Computer Science, Information Technology, or a related field
Hands-on experience with modern observability tools and frameworks is preferred but open to candidates with prior experience in monitoring solutions who are looking to transition into modern observability practices.
Candidate should have a strong foundation in monitoring and troubleshooting.
Experience with scripting languages (e.g., Python, Bash) for automation and data processing is preferred.
Strong problem-solving skills and a proactive attitude towards learning new technologies.
Excellent communication and teamwork skills, with the ability to work effectively in a collaborative environment