Professional Experience:
- 4–7+ years of relevant experience in application support, systems support, or operations roles.
- Experience supporting production systems in a high-availability or mission-critical environment.
Technical Expertise:
- ● Strong hands-on experience with: Application log analysis and monitoring tools (e.g. AWS CloudWatch, Grafana, ELK, Google Analytics, etc)
- Linux/Unix environments
- Working knowledge of cloud platforms (e.g. AWS services such as ECS, Lambda, S3, RDS).
- Basic database knowledge (MySQL, PostgreSQL) for health checks and simple queries.
- Basic knowledge on REST APIs, system integrations and authentication design
- Understanding of incident, problem, and change management processes.
Problem-Solving Skills:
- Strong analytical and troubleshooting skills.
- Ability to break down complex incidents into clear, actionable steps.
- Calm and methodical approach when handling production issues under pressure.
Operational Practices:
- Familiarity with ticketing and incident management tools (e.g. Jira, PagerDuty).
- Experience working with runbooks, SOPs, and on-call support rotations (if applicable).
Additional Skills (Bonus Points):
- Experience supporting cloud-native or microservices-based systems.
- Basic scripting skills (e.g. Bash, Python) for automation.
- Experience working in government, regulated, or large-scale enterprise environments.
- Knowledge of disaster recovery and business continuity planning.