Key Responsibilities:
Linux / RHEL Administration:
- Install, configure, and maintain Linux servers (RHEL/CentOS).
- Perform system upgrades, patching, and security hardening.
- Manage users, groups, file systems, and permissions.
- Configure network services (DNS, DHCP, SSH, NFS, FTP, etc.).
- Perform backup, recovery, and server performance optimization.
- Troubleshoot kernel, boot, and OS-related issues.
Monitoring / Observability:
- Deploy, configure, and manage monitoring tools such as Nagios, Zabbix, Prometheus, Grafana, Splunk, ELK, Dynatrace, or Datadog.
- Create dashboards, alerts, and metric-based monitoring for systems, networks, and applications.
- Monitor server health, performance, capacity, and resource utilization.
- Implement log aggregation, log analysis, and incident monitoring frameworks.
- Develop automation scripts for monitoring integration using Shell, Bash, or Python.
Infrastructure & Operations:
- Manage virtualization technologies (VMware, KVM, Hyper-V).
- Maintain configuration management tools (Ansible, Puppet, Chef).
- Support cloud environments (AWS, Azure, GCP - optional).
- Participate in on-call support and incident response.
- Document configurations, procedures, and change management activities.
Required Skills:
- Strong expertise in RHEL / Linux system administration.
- Hands-on experience in monitoring and observability platforms.
- Knowledge of system performance tuning, patch management, and troubleshooting.
- Experience with Scripting (Shell, Bash, or Python).
- Familiarity with configuration management and automation tools.
- Understanding of network concepts, firewalls, and security protocols.
EA License # 14C6941